Life is a mystery

Home arrow Search Engines arrow Stay off the LSI bandwagon  
Saturday, 05 July 2008
Guess Who


Latest Adventures
Dave's Favs
Fire Horse Features
180+ SEO Tools and resources
the SEOsearch engine
All about Google's personalized search
Social Media Marketing search
Search Top Social sites
Tweed the Twitter and FriendFeed search engine

Young Guns of SEO
Most Travelled
Syndicate

Follow the Trail of the Fire Horse

Get The FireHorse Feed

Add to Google

Add to Technorati Favorites
My FriendFeed
View David Harry's profile on LinkedIn Stumble Along With Me

Stay off the LSI bandwagon
Thursday, 04 October 2007
Digg!

I wanted to talk about something that has been bugging me lately… SEO folks that are quoting or selling LSI related advice or services. It’s starting to really get on my nerves.

googsmall.jpgFor the uninitiated, (lucky peeps) it stands for Latent Semantic Indexing which was worked on by Applied Semantics from Latent Semantic Analysis concepts. Then waaaaayyyyy back in April of 2003 AS was acquired by Google.  For the most part, it would seem the main usage, at the time, was geared towards the AdWords AdSense aspects.

From the press release at the time;

Applied Semantics is a proven innovator in semantic text processing and online advertising," said Sergey Brin, Google's co-founder and president of Technology. "This acquisition will enable Google to create new technologies that make online advertising more useful to users, publishers, and advertisers alike.”

 

Enter the Bandwagon

Somewhere along the line the sooth seers and prognosticators of SEO far and wide began to relate LSI with the organic SERPs to the point those would claim it could superseded the traditional back link heavy model (PageRank). Lately I see the term used loosely by SEO types and find it on websites espousing how they can use it to make your site rank better.

Now, I have no problem with folks making pages that are more relevant, instead of whacking the keywords they target all over the freakin place. That makes the web better as far as I am concerned. The problem I am having is the lack of general understanding. There is no direct evidence that LSI ever ended up in whole or in part, in the organic indexing and retrieval system. Actually, Michael Duz makes a good case for the LSI Myth.

“So why would anybody claim that Google or any other search engine was using LSI? Two possible reasons, simple ignorance or as Dr. E. Garcia (information retrieval researcher) puts it “snake oil marketers”, SEO firms and individuals who find some commercial value in pretending they have an understanding of LSI. Here are some typical quotes right off their web pages:”

So what IS Google doing?


Who cares? We are not going to know one way or the other definitively, so let’s not get worked up over it. There are many other examples of methods search engineers are looking at to find relevance. Such as;

Phrase based indexing and retrieval methods

Incorporating data based upon user query sessions

Probabilistic latent semantic analysis

Latent Dirichlet allocation 

Hidden Topic Markov Models

… just to name a few…..

So let’s give it a break

rant.jpgIn the end it is learning about various models and looking for common elements that, for me at least, is the best way to look at things as far as your studies into search engines is concerned. It is a multi-layered system that could have any number of methods at work. Without knowing exactly which are being used and what thresholds and weighting is given to each, we cannot run around stating things such as LSI in Google being a certainty. I certainly don’t feel people should be selling services based upon it, nor blogging  about how to use LSI to improve ones rankings.

The next time some one tells you about the secret sauce that is LSI, shake your head gently, pat them on the head and walk away… very slowly… the believers can be a touchy bunch.

</rant>

Latent Semantic Indexing Resources;

LSI Explained - Meaning-based advertising and document relevance determination - Meaning-based information organization and retrieval - Dr. E. Garcia - SVD and LSI Tutorial 5: LSI Keyword Research and Co-Occurrence Theory - LSI Implementation and Scale-ability issues

Comments
Add New Search
Paul Smith  - semantics     |2007-10-05 19:32:14
The semantic issue is over-played largely because it isn't possible to get computers to parse the language the way humans want to. You're probably right, until a computer can make sense of a sentence.
theGypsy   |2007-10-05 19:40:44
While the computational shortcomings are part of why folks like Micheal poo-poo the whole thing, I don’t have a problem with people in the SOE world learning about such methods.

What does start to get bothersome is people providing services and using LSI as some sort of catch phrase to blind clients with fancy lexicons.

I love reading the technical stuff, but there is no ‘magic bullet’ there IMO. Some internet marketers jump to easily to conclusions. It’s simply part of the education/passion for me - not magic bullets.

(thanks for dropping by )
Write comment
Name:
Email:
 
Website:
Title:
UBBCode:
[b] [i] [u] [url] [quote] [code] [img] 
 
 
:angry::0:confused::cheer:B):evil::silly::dry::lol::kiss::D:pinch:
:(:shock::X:side::):P:unsure::woohoo::huh::whistle:;):s
:!::?::idea::arrow:
 
Please input the anti-spam code that you can read in the image.

3.20 Copyright (C) 2007 Alain Georgette / Copyright (C) 2006 Frantisek Hliva. All rights reserved."





Reddit!Del.icio.us!Facebook!Slashdot!Netscape!Technorati!StumbleUpon!Newsvine!Furl!Blogmarks!Yahoo!Ma.gnolia!PlugIM!Squidoo!BlogMemes!
 
< Prev   Next >
 

Find your way
Trail Maps

Learn Search Engine Optimization


Joomla SEO Club

Great Destinations

Fire Horse Friends

Search Sensations

Marketing Mavens

Social Butterflies


Trail Riders

Trail Badges

Big list of Search Marketing Blogs

All the Top sites on the Web

Columbo Award Runner Up