Computer Science – Digital Libraries
Scientific paper
2007-03-15
Computer Science
Digital Libraries
5 pages, 9 figures, IS&T Archiving 2007, May, 2007
Scientific paper
Search engines provide cached copies of indexed content so users will have something to "click on" if the remote resource is temporarily or permanently unavailable. Depending on their proprietary caching strategies, search engines will purge their indexes and caches of resources that exceed a threshold of unavailability. Although search engine caches are provided only as an aid to the interactive user, we are interested in building reliable preservation services from the aggregate of these limited caching services. But first, we must understand the contents of search engine caches. In this paper, we have examined the cached contents of Ask, Google, MSN and Yahoo to profile such things as overlap between index and cache, size, MIME type and "staleness" of the cached resources. We also examined the overlap of the various caches with the holdings of the Internet Archive.
McCown Frank
Nelson Michael L.
No associations
LandOfFree
Characterization of Search Engine Caches does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Characterization of Search Engine Caches, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Characterization of Search Engine Caches will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-363625