« Hur nyheter sprids på Internet | Main | Infontology »

mars 15, 2004

Deep Web

Salon-artikeln In search of the deep Web handlar om Deep Web, dvs den del av webben som sökmotorerna av olika skäl inte kommer åt.

The next generation of Web search engines will do more than give you a longer list of search results. They will disrupt the information economy.
...
Those of us who place our faith in the Googlebot may be surprised to learn that the big search engines crawl less than 1 percent of the known Web. Beneath the surface layer of company sites, blogs and porn lies another, hidden Web. The "deep Web" is the great lode of databases, flight schedules, library catalogs, classified ads, patent filings, genetic research data and another 90-odd terabytes of data that never find their way onto a typical search results page.
...
As new search spiders penetrate the thickets of corporate databases, government documents and scholarly research databanks, they will not only help users retrieve better search results but also siphon transactions away from the organizations that traditionally mediate access to that data. As organizations commingle more of their data with the deep Web search engines, they are entering into a complex bargain, one they may not fully understand.

I artikeln står inte så mycket om begreppet Deep Web, så här är några länkar för vidare läsning.

Undersökningen som refereras till gjordes 2001. Det vit-papper som då skrevs
är Deep Web White Paper (PDF).

InternetBrus.com skrev tidigt (15 apr 2001) en svensk summering om "den osynliga webben" i Chris Sherman, Gary Price: The Invisible Web.

En senare sammanfattning finns i The Deep Web".

För den som gillar sådant, finns det även en typisk slashdot-diskussion med anledning av Salonartikeln.


(Inspiration från Simon Winter på Infontology.)

Posted by hakank at mars 15, 2004 09:57 FM Posted to Sökmotorer