Coherent Keyphrase Extraction via Web Mining

Computer Science – Learning

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

6 pages, related work available at http://purl.org/peter.turney/

Scientific paper

Keyphrases are useful for a variety of purposes, including summarizing, indexing, labeling, categorizing, clustering, highlighting, browsing, and searching. The task of automatic keyphrase extraction is to select keyphrases from within the text of a given document. Automatic keyphrase extraction makes it feasible to generate keyphrases for the huge number of documents that do not have manually assigned keyphrases. A limitation of previous keyphrase extraction algorithms is that the selected keyphrases are occasionally incoherent. That is, the majority of the output keyphrases may fit together well, but there may be a minority that appear to be outliers, with no clear semantic relation to the majority or to each other. This paper presents enhancements to the Kea keyphrase extraction algorithm that are designed to increase the coherence of the extracted keyphrases. The approach is to use the degree of statistical association among candidate keyphrases as evidence that they may be semantically related. The statistical association is measured using web mining. Experiments demonstrate that the enhancements improve the quality of the extracted keyphrases. Furthermore, the enhancements are not domain-specific: the algorithm generalizes well when it is trained on one domain (computer science documents) and tested on another (physics documents).

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Coherent Keyphrase Extraction via Web Mining does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Coherent Keyphrase Extraction via Web Mining, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Coherent Keyphrase Extraction via Web Mining will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-541674

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.