Webmasters favour Google over other search engines: Study
November 16th, 2007 - 4:23 pm ICT by admin ( Leave a comment )
Washington, November 16 (ANI): Penn State researchers have found that website policy makers who use robots.txt files to specify what is open and what is off limits to web crawlers favour Google over other search engines
The study involved more than 7,500 websites, said C. Lee Giles, the David Reese Professor of Information Sciences and Technology, whose team developed a new search engine called BotSeer for the study.
We expected that robots.txt files would treat all search engines equally or maybe disfavour certain obnoxious bots, so we were surprised to discover a strong correlation between the robots favoured and the search engines market share, said Giles of Penn States College of Information Sciences and Technology (IST).
Robots.txt files are known for regulating Web crawlers, also known as spiders and bots, which mine the Web 24/7 for everything from the latest news to e-mail addresses. Web policy makers use the files found in a websites directory to restrict crawler access to non-public information. These files are also used to reduce server load which can result in denial of service and shut down Web sites.
The researchers have now found that some web policy makers and administrators are writing robots.txt files that are not uniformly blocking access. They say that such files give access to Google, Yahoo and MSN while restricting other search engines.
Although the study did not reveal any reason as to why web policy makers opt to favour Google, the researchers believe that the choice was made consciously.
Robots.txt files are written by Web policy makers and administrators who have to intentionally specify Google as the favoured search engine, Giles said.
The finding has been described in a paper titled Determining Bias to Search Engines from Robots.txt, given at the recent 2007 IEEE/WIC/ACM International Conference on Web Intelligence in Silicon Valley. (ANI)
- Can't control content, Google, Facebook tell court (Lead) - Jan 16, 2012
- Court puts off hearing on Google, Facebook pleas (Lead) - Jan 19, 2012
- Google now restricts Facebook access to Gmail contacts - Nov 06, 2010
- U.S. orders Google to fulfill requirements before acquisition of ITA Software - Apr 09, 2011
- Filter content or face blackout, court warns Facebook, Google (Lead) - Jan 12, 2012
- Will rivals do to Google what Facebook did to MySpace? - Jul 19, 2011
- 'Google killer' search engine to 'clean up web searches' - Nov 06, 2010
- Don't deny access to Google: Pakistani court - Sep 20, 2011
- Google fires engineer for privacy violation - Sep 15, 2010
- Facebook, Google join web future warning - Jan 16, 2011
- 171 million Facebook users info leaked - Jul 29, 2010
- Google staff to get servants as perks - Nov 02, 2010
- Girl vents ire on ex-beau's images on Google - Feb 07, 2011
- Google to launch its own social network 'Google Circles' in May - Mar 15, 2011
- Google in risk of losing license to operate in China - Jun 29, 2010
Tags: acm international conference, administrators, favour, google, information sciences, lee giles, mail, msn, obnoxious, penn state researchers, penn states, robots txt, search engine, search engines, txt files, web crawlers, web intelligence, web policy, wic