Core Technologies Behind Google Ranking

July 18, 2008

Searchers expectations in the last decade has not only rightly increased but has also changed from “give me what I said to give me what I want.” Google has been successful where others have failed as they work hard to fulfil the expectations of each and every search. Google core technology behind its searching algorithm and ranking system, in the academic world, the field of search is known as Information Retrieval (IR). The IR researchers uses statistical signals of word salience, like word frequency, to organize and deliver highly relevant pages, similar to how a human brain can interoperate road signs, IR provides a solid foundation, for Googles tremendous searching algorithm, which can be easily adapted to incorporate other new developing technologies, principal theories have pushed the technologies for understanding these three components (of the search process) to completely new dimensions.

Understanding pages:

  • Over years google has invested heavily in their ability to crawl and index, as a direct result Googles serp’s are both plentiful and fresh.
  • One of the key technologies we have developed to understand pages is associating important concepts to a page even when they are not obvious on the page or even if the page does not mention the term anywhere on the page.
  • Other technologies include the ability to make a distinction between important and less important words in a particular page.

Understanding queries:

  • A highly accurate spelling suggestion system. i.e. “Did you mean:”
  • A highly intelligent synonyms system. i.e. differentiate between “Dr Who” and “Rodeo Dr”
  • A very strong concept analysis system. Identifying critical concepts in the query i.e. further enhance query for “PC impact on people” to “Impact of computer on Society”

Understanding users:

  • Personalisation tailors search results to individual users who are logged-in and have signed up for Web History get results that are more relevant for them than the general Google results i.e. someone who does a lot of football related searches might get more football related results for “giants”, while other users might get results related to the baseball.
  • Universal Search attempts to interpret user intent i.e. a search “India” not only gets the important web pages, but also maps, videos and weather in India.

Finally, the latest advance in search: Cross Language Information Retrieval (CLIR). I will cover CLIR in more detail in my next post, however CLIR allows users to first discover information that is not in their language, and then using Google’s translation technology, to make information accessible, where by a user is prompted and presented with a link to search the English web.

NSEO Programmer
Vipul

  • Digg
  • del.icio.us
  • Netvouz
  • DZone
  • ThisNext
  • Wists
  • Furl
  • Spurl
  • Technorati
  • YahooMyWeb
  • Ma.gnolia
  • Reddit
  • Simpy
  • StumbleUpon
Listen to this podcast Listen to this podcast

No Comments

No comments yet.

RSS feed for comments on this post
TrackBack URI

Leave a comment