Characterization of Search Engine Caches

Computer Science – Digital Libraries

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

5 pages, 9 figures, IS&T Archiving 2007, May, 2007

Scientific paper

Search engines provide cached copies of indexed content so users will have something to "click on" if the remote resource is temporarily or permanently unavailable. Depending on their proprietary caching strategies, search engines will purge their indexes and caches of resources that exceed a threshold of unavailability. Although search engine caches are provided only as an aid to the interactive user, we are interested in building reliable preservation services from the aggregate of these limited caching services. But first, we must understand the contents of search engine caches. In this paper, we have examined the cached contents of Ask, Google, MSN and Yahoo to profile such things as overlap between index and cache, size, MIME type and "staleness" of the cached resources. We also examined the overlap of the various caches with the holdings of the Internet Archive.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Characterization of Search Engine Caches does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Characterization of Search Engine Caches, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Characterization of Search Engine Caches will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-363625

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.