C-Rank: A Link-based Similarity Measure for Scientific Literature Databases

Computer Science – Digital Libraries

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

As the number of people who use scientific literature databases grows, the demand for literature retrieval services has been steadily increased. One of the most popular retrieval services is to find a set of papers similar to the paper under consideration, which requires a measure that computes similarities between papers. Scientific literature databases exhibit two interesting characteristics that are different from general databases. First, the papers cited by old papers are often not included in the database due to technical and economic reasons. Second, since a paper references the papers published before it, few papers cite recently-published papers. These two characteristics cause all existing similarity measures to fail in at least one of the following cases: (1) measuring the similarity between old, but similar papers, (2) measuring the similarity between recent, but similar papers, and (3) measuring the similarity between two similar papers: one old, the other recent. In this paper, we propose a new link-based similarity measure called C-Rank, which uses both in-link and out-link by disregarding the direction of references. In addition, we discuss the most suitable normalization method for scientific literature databases and propose an evaluation method for measuring the accuracy of similarity measures. We have used a database with real-world papers from DBLP and their reference information crawled from Libra for experiments and compared the performance of C-Rank with those of existing similarity measures. Experimental results show that C-Rank achieves a higher accuracy than existing similarity measures.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

C-Rank: A Link-based Similarity Measure for Scientific Literature Databases does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with C-Rank: A Link-based Similarity Measure for Scientific Literature Databases, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and C-Rank: A Link-based Similarity Measure for Scientific Literature Databases will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-92787

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.