Investigating the Change of Web Pages' Titles Over Time

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

6 pages, 8 figures, 1 table, 1 appendix

Scientific paper

Inaccessible web pages are part of the browsing experience. The content of these pages however is often not completely lost but rather missing. Lexical signatures (LS) generated from the web pages' textual content have been shown to be suitable as search engine queries when trying to discover a (missing) web page. Since LSs are expensive to generate, we investigate the potential of web pages' titles as they are available at a lower cost. We present the results from studying the change of titles over time. We take titles from copies provided by the Internet Archive of randomly sampled web pages and show the frequency of change as well as the degree of change in terms of the Levenshtein score. We found very low frequencies of change and high Levenshtein scores indicating that titles, on average, change little from their original, first observed values (rooted comparison) and even less from the values of their previous observation (sliding).

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Investigating the Change of Web Pages' Titles Over Time does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Investigating the Change of Web Pages' Titles Over Time, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Investigating the Change of Web Pages' Titles Over Time will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-696925

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.