Articles on Export Import Trade over Internet

How to find information from Internet - Part VI Search Engine Results - Method in the Madness ?

Irony of Internet search is that - there never is a paucity of information - but an overdose of it. With databases that can keep the entire Web at its fingertips - search engines almost always can retrieve relevant pages but the challenge lies in separating wheat from the chaff - keeping out unwanted stuff. Most engines find more sites from a typical search query than you could ever wade through and so finding the relevant pages from its search result looks more like proverbial needle in the haystack situation.

We have discussed Boolean search in last issue. Its a great tool in terms of simplicity and speed - but incapable of differentiating search expressions which have same keywords but in different order (hence different meaning). So, Search expressions 'Dog Bites Man' and 'Man bites Dog' retrieves virtually same result (unless using exact phrase).

Search Engines are aware of this problem and have tried to solve it in different ways. Directory type search engines display search result in alphabetic order. But they are extremely selective - so the search result seldom goes beyond one or two pages.

Spider based search engines have no such luck - so they employ what is called 'relevance score' to sort search results.

Relevance score is a measure to bring the most relevant pages at top of any search result. Many search engines display relevant score of each retrieved page.

Relevance scores reflect the number of times a search term appears, where it appears (e.g. in the title, in the meta tags, towards the beginning of the document etc.), if all the search terms are near each other and many other relevance parameters. Each parameter has a different weightage. The pages are sorted by final relevance score.

Since each search engine has its own system of calculating relevance score - you find different search result from different search engines even when the search expression is same.

Related Links:

Source: FAIDA - Newsletter on Business Opportunties from India and Abroad Vol: 3, Issue 13 July 18' 2002

Author : Dr. Amit K. Chatterjee
(Amit worked in blue-chip Indian and MNCs for 15 years in various capacities like Research and Information Analysis, Market Development, MIS, R&D Information Systems etc. before starting his e-commerce venture in 1997. The views expressed in this columns are of his own. He may be reached at amit@infobanc.com )

Copyright � All Rights Reserved. Limited permission is granted to publish this article in a web-site or printed in a journal/ newspaper/ magazine provided the publisher takes prior permission from author, do not make any change in the article (i.e. keep it exactly same as displayed above) and cite the Source of this article as The Great Indian Bazaar with a link to this page.

Back

Sign In

Call Request