Analyzing the Persistence of Referenced Web Resources with Memento

Computer Science – Digital Libraries

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

4 pages, 5 figures. Accepted to Open Repositories 2011 Conference

Scientific paper

In this paper we present the results of a study into the persistence and availability of web resources referenced from papers in scholarly repositories. Two repositories with different characteristics, arXiv and the UNT digital library, are studied to determine if the nature of the repository, or of its content, has a bearing on the availability of the web resources cited by that content. Memento makes it possible to automate discovery of archived resources and to consider the time between the publication of the research and the archiving of the referenced URLs. This automation allows us to process more than 160000 URLs, the largest known such study, and the repository metadata allows consideration of the results by discipline. The results are startling: 45% (66096) of the URLs referenced from arXiv still exist, but are not preserved for future generations, and 28% of resources referenced by UNT papers have been lost. Moving forwards, we provide some initial recommendations, including that repositories should publish URL lists extracted from papers that could be used as seeds for web archiving systems.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Analyzing the Persistence of Referenced Web Resources with Memento does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Analyzing the Persistence of Referenced Web Resources with Memento, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Analyzing the Persistence of Referenced Web Resources with Memento will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-727990

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.