SEO Blog - Internet marketing news and views  

Google re-ranking and personalized search study

PDF  | Print |  E-mail
Monday, 18 January 2010 17:24


Once upon a time me and a few cohorts wondered about just what levels of flux there were in the Google SERPs. Has personalized search really changed the consistency of rankings? It’s an issue that has been spoken about many times in the search world. We set out to see what was up. After two rounds of research we noticed that this was unlikely to be the case. You can learn more in the post; The SEO guide to Google personalized search

It seemed only sensible, given Google’s Psearch expansion, to have another look. And so last December we started a third round. Has anything changed? How much flux is out there? Well, read on and find out what we’re seeing so far…

Google ranking flux and personalized seach study


Now, first things first; the goal wasn’t really to establish how personalized search operates. What we really wanted to know is how much movement is there in the SERPs for a given query type, in a given region (USA in this case). Yes we’re also noting some potential personalized search affects, but movement/flux was the core observation.

We must understand that even though there is some interesting data here, it is by no means a large enough sample to get nitty gritty with IMHO. It is also important to remember this is a specific task from a quasi-transactional query session (task development).

We want to go back and do more from an informational query space and one more suited to respondents (familiar, such as ‘learn SEO’ etc..). This approach, last time out, did show more movement than we saw with this space (we’ve done this round 2x now). 

As always, no magic bullets here… Just more links in the chain (pun intended of course).




The Set up

We decided to take a set of queries to build a session (task) in a space that may or may not be familiar to the respondents. The queries we used were;

‘antique lamps’
‘buy antique lamps’
‘buy lamps online’

We also asked them to tell us;

  1. Where they’re located (region/country)
  2. Which browser they’re using
  3. Cleared search history lately?
  4. Is personalized/search history on?
  5. Most common Google app used
  6. If they had searched for ‘lamps’ or furnishings in the last 60 days

The goal being to start understanding what may or may not be influencing the search result flux. While we had some questions to be answered, there was no initial bias (hypothesis) behind the analysis. It should also be noted that the entire collection process only lasted 4 days so that we could try our best to avoid any temporal anomalies.

Today we’re looking at the data from the US. At the end of the test period there was really only enough to do analysis on the US and UK markets. I shall post the UK data in the coming weeks.


US Respondent Stats;


Browsers;

  1. FireFox (FF) 88.46%
  2. Chrome (GCRM) 7.69%
  3. Internet Explorer (MS) 3.85%

Most used Google app;

  1. Gmail (GML) 53.85%
  2. GoogleTalk (GT) 11.54%
  3. Analytics (GA) 11.54%
  4. YouTube (YT) 11.54%
  5. Webmaster Tools (WMT) 7.69%
  6. Google Reader (GR) 3.85%
  7. iGoogle HomePage (iG) 0

Cleared Cache lately?

  1. Yes (Y) 42.31%
  2. No (N) 57.69%

Related searches in last 60 days?

  1. Yes (Y) 30.77%
  2. No (N) 69.23%

Is personalized search on?

  1. Yes (Y) 30.77%
  2. No (N) 69.23%




What have we learned about SERP flux?


What have we learned?


It’s a good question, one that I don’t think we have a definitive answer to. Let us remember this is but one query type in a limited set. Personalization is more likely to have an effect in a query space where the user is more active in. That being said, there were some interesting elements that emerged so far.

Much like the original round of testing, a year ago, we can say that there is no massive upheaval where SERPs are vastly different from one user to another. This much has been consistent in each of the (3) rounds in both informational and transactional query spaces.

Here are some notes on other questions we’ve had;


Are the top 3 stable?

In the initial task the top 3 placements were rock solid. It was on the secondary queries that extended the task that they would become somewhat instable, (see ‘concurrent tasks’). Now, it wasn’t huge, but there was some small movement that became more prominent with each related query performed.

Ultimately not much has changed as far as aiming for the sweet spot (above the fold). These are easily the most stable listings and it is unlikely these would be affected more than 95% of the time and even then, rare to drop too far down when they do.




Are top 10 stable?

Yes. While the movement always would be found from #4 on down, they did for the most part remain fairly consistent in content if not ranking. As was noted with the top 3, the more tasks in the chain (queries) the more instability would creep in.

I’d say it is more common to see a #4 move further below in the top 10, not altogether off of the page. We also noted that 7-10 are at the most risk and the 4-6 spots more consistent as far as a presence in the top 10. Many times it would be a plus/minus 1 or two ranking spots.



Where is the biggest re-rankings?

As noted so far, the biggest area of flux was on the expanded task elements. The first query ran, in most cases, was fairly stable. It was on further related queries in the session that higher levels of instability were noted.

We’re considering running the test again while reversing the order to see if indeed the order of the session queries created the increased movement the further we went into the process. It does seem like this is an important element to watch for in future testing.




What causes SERP flux?

I really have no conclusions at this point. We’ve likely not hit a trigger often enough to see any deeper personalization in play. I believe the next round, using more familiar queries and of the informational nature, there could be stronger Psearch elements present (as there were in the first round, late 08).

For the moment, in case you hadn’t guessed, the actual ongoing task development seems to be an important element. Much of the more gradual movement could easily be attributed to various servers the locales were hitting at the time. Many of the other elements we looked at (related searches, Psearch on/off) didn’t really seem to have a strong bearing in the SERP flux.

My guess? Query analysis will play as important a role as the Psearch does. There does seem to be a task related SERP adaptation from what we’re seeing in this round. The jury is out though until we know more.





Secondary observations



What do strong listings have in common?

Certainly the money listing is #1-3 as these seem to be the most resistant to movement. What was also interesting is that the odd 8th place or 5th even would remain more solid than some above and below them.

Indented listings would often be vulnerable but other than that, the higher the ranking, the more security one would have. Beyond that we’d need to actually dig into each of the sites for further clues.





Is related searches past 60 days making a difference?

Not any more than the avg. general flux. There doesn’t seem to be a huge difference or much of one at all from those with recent related searches to those without. I thought this was surprising as there was some expectation of this playing into SERP flux… It didn’t seem to make much of a difference ultimately.




Are concurrent tasks creating a difference?

This is the most active and interesting area that we’re seeing. All three queries the respondents ran for us were related to the same task. We were essentially building a task towards a goal. It seems that with each concurrent search, we’d see more and more flux.




Are browsers making a difference?

We’re seeing some flux in IE, FF and Chrome. Of interest is that the Chrome SERP was a model SERP (consistency wise). I really can’t see this as being a major element, but we would probably need more data for a conclusive decision.

So far, I’d say that this doesn’t play into it….or at least isn’t much of a signal of note.




Is Psearch on making a difference?

This one I still have to say not really. At least not in the traditional sense of having surfing history enabled. Actually, some of the odder SERP flux came from those that weren’t actively browsing with it turned on. If there is a strong Psearch in play, it wasn’t due to past related searches.


Is regional location creating flux?

While there is some flux, there was no major insertion of listings that could be considered regional. This being said, we’d likely want more regional data and more targted queries to better establish this to be the case.



Other notes;


SEO isn't deadPPC and Uni results. – it was also interesting to see how the Shopping results popped in and out. While it is beyond the scope if this research, they were interesting and far more irregular than the SERP flux. This does play into one’s placement in the top 10 and is something to look at down the road.


Indented listings came and went (on final query) – another finicky element were the indented listings. It seems those can be more finicky than the average flux/movement in the SERPs. Looking for this in the next round and digging deeper if it persists is another area of interest.


Click data – given the suspicions of query analysis and task related triggers, in the near future introducing random clicks on links (and return/bounce?) might be worth putting into the mix. We haven’t given many signals beyond merely a 3 query task. If we get more evidence of this being an element, deepening the tasks seems a likely next route (past further rounds like this one).



What does it mean for SEOs?

Once upon a time this journey began because I really wanted to know if ‘rankings are dead’ and if ‘everyone gets a different SERP’. A few of the age old, but poorly understood, questions of the search world. So, have they been answered? Well, sort of.


Rankings are dead? – Yes and no. The old days of ranking reports telling us one ranks in X-position, seem to not be a valid approach. Should we be tracking/watching rankings? Most certainly we want to be getting top 10 and then above the fold to ensure the best performance.

I’d also suggest that you might occasionally get people in a targeted region to check SERPs for you now and again.




Everyone sees a different SERP? – Yes and no. The rankings are reasonably consistent as far as the make up of the top 10. Even the movement isn’t massive when there is some.

But are they ALL the same… most certainly not.


We shouldn’t be changing how we approach things ultimately… Above the fold is the real estate that’s prime (what else is new?). It may be the measuring that we will have to adapt. You will need to find ways to check rankings from a few locales and discern a mean average instead of a definitive placement.


What’s next?

We really have only touched the surface so far (with this round) and are going to want to start looking at a few more things including;


  • Informational – next round we’re returning to the original informational query which is in a more familiar query space which is likely to produce some different effects.
  • Navigational – we may also consider running a round on a navigational query space to see how it reacts. I am more interested in the informational/transactional spaces, so time permitting on this one.
  • Reverse order – if the trend continues with concurrent searches (task development) we will want to reverse the order on one to see if that remains the case
  • Click data – obviously once we get past all of that, we are likely to start introducing click data into the mix to see what effect that has on things.
  • Temporal watching – another point brought to me by a information retriever was that any changes to Google’s processing would likely be incremental. This means we’ll re-test in 6 mo to see if things have changed.

As you may have started to notice, there are still a ton of elements which are unknown. This is why I am still not making any major calls on what we’re seeing. Please bear in mind that we’ve only 2 rounds last year and one recent set of data. This really only begins to give us some insight; nothing more. The last thing we need are SEOs running around taking any of this as gospel at this point.

In the short term we’re going to sort the UK data from this round and I will be posting about that in the coming weeks… Stay tuned.


Are you interested in participating in the analysis of this and other test data?
Join us in the SEO Training Dojo.

SEO training community

Comments (18)
  • Domenick  - Interesting
    Brotha I'm loving your blog more and more. This was a very interesting study, I know you said don't take it as the gospel, but your insights on what could be are great.

    So giving clients ranking reports could be in vain? For some SEO's I know conversion or ROI is more of a focus point than just getting top ranking. So do you think it's going to be more about conversion optimization than just simply getting good rankings?
  • Dave
    Hiya Domenic... good to see ya and glad yer enjoying the geekdom! I've gone off on SEOs and conversions in the past (on Search Engine Journal) so I'll leave that for the moment. But as far as ranking reports, I don't think people can use them definitively without some qualitative research in the form of having people in a target region search the term and create a 'mean average' for it.

    That being said, it goes without saying that top 10 and ultimately top 3 are still the goal. So we need them to get 'an idea' of where we are (and related traffic/revenues) but we can't state 'Your #4' beyond a potential of say 80% certainty and a qualifier stating it is a mean average, not set in stone.

    Back to the conversions. There are a crap load of factors that effect conversions and in many cases SEOs don't have the access nor the training to make that their domain. I think we need to consider 'targeted traffic' - ie; what terms, what revenue it's generating and overall search referrer increases. If the client's site sucks, product is crap/over priced... this screws the conversions...is that MY fault if I don't control such aspects? Certainly not. So try to establish the most qualified terms for targeted traffic and get them... That is the job of the search marketer.

    Anyway, I hope to get the next round going this week at some point... We shall see what we find next time out.
  • peter  - Above the fold the new no. 1
    Thanks for the research.

    I agree with the ranking obsession. I think companies should be thinking in terms of above the fold and below the fold, instead of position number 1 or 5 or 10.
  • Aussiewebmaster  - other questions
    Can you isolate what spot the indent would be in for influence of movement... there is a way to isolate each spot. Is it more likely to detach if second is in certain positions.

    Can you do movement (click set ones up) test of Psearch... is there an impact over time if this is done on a wide scale?

    Is sample really typical given high usage of FF? Could that be a confounding variable?

    Can this be tested by language? Is English more set than say Spanish, Arabic etc.

  • Dave
    @Peter... most certainly we're not talking 'We rank #x' for sure. It is now more of, as you eluded to, a game of above/below the fold. Also of note, is the effect Universal search elements have on a query space. This is an area we're going to look at more in the coming months.

    @Frank...sorry to wreck yer holiday mate!

    Ok.. 1. As I noted the indented listing was certainly prone to change. Sometimes it as present and other not. If we include the indented of a #1 listing, then it was one of the few times a top 3 would end up out of the top 10 altogether (remained #1 but indented listing was tossed). To actually test indented listings we'd need to pick query spaces that are known to produce them..Which is entirely a different study alltogether..lol...Methinks we want to look at Uni search as well.

    Click data - we also have decided that introducing some actual click data into the mix is important as well. Once we're past the next round (informational query, semi-relevant) of static query task development, we most certainly plan on doing that. The elements we can introduce is nearly endless... But that is one area we plan on looking at.

    Browser; The FF usage was a boner on my part as I suggested people use a FF bookmarklet to cull results. That kinda screwed the browser info. That being said, over the three rounds we've done (first two was cleaner) there really isn't any discernible action relating to browser type. Nothing that would cause me to want to dig into that area deeper.

    Language; while we did open the doors to all as far as sending us data, we really only had enough respondents from the US and UK (region and country). We we would have to make a more concerted effort to target a specific country at some point to see such affects. At this point I have maybe 4-5 from some regions which really isn't enough to look at as more than anecdotal evidence.

    Drop me an email Frank if you want to chat more on it and future rounds/other things to test.
  • Everett
    Even when we see a heavy personalization of SERPS, the information feeding results for our queries are still going to be in an index that forms a starting point. At least for the foreseeable future, there is going to be a "base" from which to begin putting together the order of documents show in a SERP. Thus, our goal should be to rank as highly as we possibly can. Period. If we could access that "base" SERP (&pws=0 etc.) then we could continue to track our progress relative to other websites.

    Should we do anything to leverage Psearch? That's an entirely different question. In short "yes" but it would be much more difficult than optimizing for the base index.

    Great experiment, by the way! You've just one a new subscriber. :D
  • Dave
    Hi Everett, glad to have ye aboard! I am not entirely sure how the old &pws is going to work now if everyone is automatically part of the Psearch program. hehe... something else worth testing some. My instinct is that it's still going to show flux with that as well.

    As for what we can do, outside of 'business as usual - go for the gold' I'd be hard pressed to make definitive statements quite yet. Not only do I have the UK data to look at, but further rounds as well. Hopefully by the end of this journey there will be better answers.

    Thanks for dropping in!
  • Renaud Joly  - SEO personnalized search in France
    Tks for this post.
    I'm agree, no revolution in SERP, the top 10 stays the top 10.
    A good tuning, local results are less often ranked than they were during 2009 first quarter. Pertinence wins some points.
    Sometimes a website appears from top 5 to top 3, probably for clic-through analysis.
  • Axel Scheuering  - great post
    thank's a lot for this little study. Most interessting in my opinion is the fact that the top 10 in serps are mainly stable.
  • Jonah Stein  - Stability in 5/8 spot
    In other search phrases I have analyzed, particularly brand/reputation queries, you see the sites in the 5th and/or 8th spot that are unphased by psearch because Google occasionally inserts sites into the top 10 based on site TYPE.

    Go to the search options sidebard and indicate you want only Forums or perhaps Blogs. Suddenly, that locked in 5th or 8th spot is the top of the vertical results. Google has all sorts of diversity and many results get added that way.

    Sadly, this insight doesn't help for "buy antique lamps"

    PS. Collector's Weekly RULES.
  • SEO Company
    This is an excellent think piece which should be shared with all that are concerned. It is important all aspects are brought to light so that issue can be understood in its entirety. SEO Company
  • Samuel Lavoie  - great + more testing
    Great to see this, somebody taking real step to debunk the madness around the "SEO is dead" scheme... looking forward to the next round.

    If you need someone from Quebec/Canada to test, I can help :)
  • Nick  - 4 Days
    Was there any difference on a day by day basis? You did the test over 4 days ....... I get a daily report of my ranking against some terms and I am quite used to seeing massive changes which are reversed a day or 2 later (or not!). That sort of random thing would obscure any trends over 4 days - you'd either have to look at results over a few hours, or several weeks.
  • Dave
    @Samuel - thanks on that, be sure to keep an eye on this blog as I will be announcing the next round shortly. We had a handful of responses from Canada, but not enough to really to formal analysis (under 10 respondents). But it does in itself show (yet another) reason why SEO has become more complicated if anything, not DEAD!!

    @Nick - well, several weeks would just make things more problematic considering temporal ranking functions including the ol QDF. As far as testing in a shorter time period, that ain't gonna happen either as we can't get enough people responding in that period of time. We'd need AT LEAST 100 peeps in that space and over 3 rounds of this, it ain't gonna happen. But you are making my case why ALL SEO testing is somewhat crap and all we can do is get some insight. There are FAR too many variables (including temporal ones) at play.
  • pbandjbandit  - geo terms
    on 12/15, a few days after google reportedly began using personalize search, i found a strong effect in a search using geo terms.

    i am in the process of starting a new business, so i am looking for lots of varied resources in the metro boston area.

    first, someone recommended an accountant in lexington, ma. so my search query contained their business name and lexington ma.

    immediately after, i was searching for a bank in boston, and searched "business banking boston, ma"

    in response to this second query that clearly had geo terms "boston, ma" the SERP contained 3 PPC ads (out of 9 on the page) with headlines containing the term "lexington."

    so google carried over the geo terms from one search query into the following search query that contained very different, specific geo terms.

    my physical location at the time was needham, ma - about 15 miles east of boston, and 15 miles south of lexington, so the results were not a reflection of my geographic location at the time of the query.

    if i were a lexington business running PPC ads, and they were displayed against a search query that specified boston, i'd be pissed because they're not likely to be clicked, thus impacing my click-thru-rate, quality score, etc.

    i suppose it's possible that these businesses purchased the term "boston" but it would be extraordinarily unlikely that 1/3 of the advertisers were in lexington, and none in any other surrounding communities - many of which are closer to my location and have thriving banking communities.

    this is obviously an anecdotal example; i was not endeavoring to measure or compare personalized results, but it was striking.
  • Dave
    Interesting insight and thanks for posting it. One area we haven't dug into, but plan on, is looking at Geo and Universal results. For quite some time Google has been using more geo/behavioural signals in the PPC side of things so that is something we've been aware of for some time now. Now we're starting to dig around the SERPs to see what effects it may be having on the rankings of the organic listings. Stay Tuned!

    =D
  • Ian  - Nothing changed?
    So basically nothing really changed in terms of ranking reports. You still need to rank above the fold (the higher the better) to get the clicks. And you still need to keep track of your progress to hit the front page. If you don't rank there in non-personalized search results, then the chances are you won't rank in Psearch either.

    This situation reminds me of the PageRank thing. You can't check the real PR, so the only option left is to go with the toolbar pagerank. Same here, you can't really tell how many people will see your site in the same position that your rank checker shows you, but it's the best option we have.

    The only thing that changes is the approach towards ranking reports and search volumes. Even if you rank #1 for a keyword with X search volume you need to keep in mind that a certain proportion of these searchers will have different SERPs and your site may not be first there.

    Anyway, thanks for your time and efforts. Your report should be of great help to the SEO community.

    Kudos
  • learn seo  - REply
    You have done a good research, Its a great idea to think in terms of above the fold and below the fold, instead of position number 1 or 5 or 10.
Write comment
Your Contact Details:
Comment:
[b] [i] [u] [url] [quote] [code] [img]   
Security
Please input the anti-spam code that you can read in the image.
 

Search the Site

SEO Training

Trail Maps

Please update your Flash Player to view content.

On Twitter

Follow me on Twitter

Socialize


Trail Badges

Weapons for Warriors - My fav supporters

Check out the full line of SEO and PPC keyword toolsRaven - SEO ToolsSEM Rush - keyword research toolsMajestic SEOLink AssitantAngies professional copywriting servicesNew Media - programming and plugins