Life is a mystery

Home arrow Search Engines  
Saturday, 05 July 2008
Guess Who


Latest Adventures
Dave's Favs
Fire Horse Features
180+ SEO Tools and resources
the SEOsearch engine
All about Google's personalized search
Social Media Marketing search
Search Top Social sites
Tweed the Twitter and FriendFeed search engine

Young Guns of SEO
Most Travelled
Syndicate

Follow the Trail of the Fire Horse

Get The FireHorse Feed

Add to Google

Add to Technorati Favorites
My FriendFeed
View David Harry's profile on LinkedIn Stumble Along With Me

Do link spammers leave footprints?
Tuesday, 24 June 2008
Digg!

Microsoft on link spam; using temporal tracking

A recent Microsoft search patent came out for a system which detects spam websites by looking at the changes in link information on a given page/set of pages over time. We recently covered some potential ways of going about this with some analysis of Google in ‘Historical ranking factors for link builders’ – be sure to give that a read as well if you’re in the mood to saunter down some related journeys. This time we have;

Detecting web spam from changes to links of websites
Filed
December 14, 2006 : Published June 19, 2008

As we know from the last excursion, link activity over time can unlock potential spam signals for search engines to use. This can be done by looking at a variety of features of the associated link information. As with any analysis the system uses a probabilistic model to judge what is and is not considered to be a spammy link profile.

This is also not limited to inbound links, but to outbound links as well (because a link profile is more than just inbounds right?).  The main problems that web spam creates for a search engine are the obvious lack of meaningful search results, but also bandwidth/spidering resources spent crawling/indexing spammy sites. So some yummy SERPs and good for the bottom line as well!!

 

Link spam temporal footprints

“"Spamming" in general refers to a deliberate action taken to unjustifiably increase the popularity or importance of a web page or web site. In the case of link spamming, a spammer can manipulate links to unjustifiably increase the importance of a web page. For example, a spammer may increase a web page's hub score by adding out links to the spammer's web page.”

Link Spam DetectionSome examples of tactics link spammers may use also included;

  1. create a copy of an existing link directory to quickly create a very large out link structure.
  2. a spammer may provide a web page of useful information with hidden links to spam web pages.
  3.  many web sites, such as blogs and web directories, allow visitors to post links. Spammers can post links to their spam web pages to directly or indirectly increase the importance of the spam web pages.
  4. a group of spammers may set up a link exchange mechanism in which their web sites point to each other to increase the importance of the web pages of the spammers' web sites.

 By looking at the link profiles of spam sites the search engine can create a template of it's linking activity to enable further algorithmic seek and destroy adaptations

As with many probabilistic systems, a set of training documents/websites can be used to train valuations of a spammy link profile. These can come from inputted sites that received a manual review and were deemed to be a spam website. These become the base set used for teaching the algorithm(s) what look for when crawling.

 

Read more...
 
Hey Google NoFollow this!
Monday, 09 June 2008
Digg!

 

A Monday morning rant

I love Google as much as the next guy or maybe more than most… Not only the search engine and many apps and services I play with, but I also enjoy how the engineers think (Patent hound here ya know)… but the whole link FUD has finally started to irk me.

I have yet to write about the ‘nofollow’ fiasco… today it changes.

Are we simply a gutless bunch of weenies under the spell of the mighty G? Methinks it’s true.

Matt McGee is a guy I enjoy reading and have respected for some time … and still do. BUT this morning I noticed a post of his that was thanking some advertisers and highlighting some of there posts. He not only nofollowed the links to some of their recent posts, but felt he had to state that they were nofollowed. Good lord, is this what we’ve come to?

Eat my shorts

Read more...
 
Link Builders Guide to Historical Ranking Factors
Tuesday, 29 April 2008
Digg!

Because it's about time 

When in comes to doing SEO for Google, one thing is certain; you’d better know about backlinks. Given the fact that the core algorithm is based largely on ranking and detecting spam via links, it is certainly an area to be familiar with.

A while ago there was a re-release of a Google patent on temporal ranking factors which really was a watershed back in 2005 when it first came out, as were subsequent related patents. Analyzing the recent (re-released) publication for new anomalies gave life to the thought that it would be interesting to highlight a few areas as a bit of a refresher course.

For this post I am merely going to address the implications of temporal factors on link building programs.

 

The basics of historical factors

Just because a web page is 10 years old, doesn’t mean it is still relevant. It also doesn’t ensure that it is. In some situations fresh content can be more relevant than older content. The discovery data can be used to re-rank a document in a positive or negative manner…

Search engines, (such as Google) can use temporal data to analyze link profiles for anomalies based on discovery, (inception) dates;

…it may be assumed that a document with a fairly recent inception date will not have a significant number of links from other documents

And can be used for Spam detection;

While a spiky rate of growth in the number of back links may be a factor used by search engine to score documents, it may also signal an attempt to spam search engine. Accordingly, in this situation, search engine may actually lower the score of a document(s) to reduce the effect of spamming.”

 

Thus inception dates and historical data can be used for link velocity analysis.

Read more...
 
You are not paranoid, someone IS watching you
Wednesday, 23 April 2008
Digg!

 

Microsoft’s take on user behavioural data

A search related patent released by Microsoft the other day touches on a popular theme of late, (with me at least); user behaviour analysis. In simplest terms, they look at various interactions with the search results (SERPs) and listing pages, to try and determine the relevance of a set of results. This has been a common theme among the Big 3 as noted by these recent posts;

Google confirms using query analysis – use of past queries in the regular index
Google on User Performance Metrics – trio of patents on the subject
Yahoo Personalized PageRank – user behaviour and PageRank
Yahoo on Personalized Networks – user annotations used to populate networks

..you get the idea...

While concepts utilizing user behaviour relating to AdServing have been around for while, there is increasing interest in ways to harness it within the main SERPs. That is beyond mere personal search implications – though they work best there.

 

The Microsoft approach

The patent; Search system using user behaviour data - Filed; December 2003 – Published; April 22 2008 

We're watching you

Read more...
 
The Quest for Social Search Sensibilities
Monday, 21 April 2008
Digg!

Another chat with Bill Slawski

Where is the future of Search headed? Will algorithmic, link based, search engines continue to rule the roost? Or will the Web 2.0 world produce a social search engine that takes the world by storm? What about a mix of both?

These are some of the questions that have been bantered about with a few of my friends in the SEO and SMM worlds over the last few months. As you might imagine, the views are as varied as the individuals I talk to. Many of my mates in the social media marketing world tend to believe that people are the answer to search relevance and the death of spam. I personally can’t see a straight social search approach ultimately providing the most relevant results. Granted it is hard to say as there have been no widely-accepted/used social search engines and the premise requires a larger data set than we can see so far. So for me the jury is still out.

For me, search engines are always taking in implied or passive data, which in many ways is a form of social search engineering (behavioural targeting at very least).

Talking to the experts

I don’t really do a lot of interviews here, but I do enjoy bringing conversations public and over the next few weeks shall be asking a few friends to talk about this with me here on the Trail. First up; technical search geek and fellow algo-holic – Bill Slawski (we last chatted here)

 

Read more...
 
<< Start < Prev 1 2 3 4 5 6 Next > End >>

Results 1 - 13 of 74
 

Find your way
Trail Maps

Learn Search Engine Optimization


Joomla SEO Club

Great Destinations

Fire Horse Friends

Search Sensations

Marketing Mavens

Social Butterflies


Trail Riders

Trail Badges

Big list of Search Marketing Blogs

All the Top sites on the Web

Columbo Award Runner Up