Not As Easy As It Seems: Automating the Construction of Lexical Chains Using Roget's Thesaurus

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

5 pages

Scientific paper

Morris and Hirst present a method of linking significant words that are about the same topic. The resulting lexical chains are a means of identifying cohesive regions in a text, with applications in many natural language processing tasks, including text summarization. The first lexical chains were constructed manually using Roget's International Thesaurus. Morris and Hirst wrote that automation would be straightforward given an electronic thesaurus. All applications so far have used WordNet to produce lexical chains, perhaps because adequate electronic versions of Roget's were not available until recently. We discuss the building of lexical chains using an electronic version of Roget's Thesaurus. We implement a variant of the original algorithm, and explain the necessary design decisions. We include a comparison with other implementations.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Not As Easy As It Seems: Automating the Construction of Lexical Chains Using Roget's Thesaurus does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Not As Easy As It Seems: Automating the Construction of Lexical Chains Using Roget's Thesaurus, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Not As Easy As It Seems: Automating the Construction of Lexical Chains Using Roget's Thesaurus will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-556187

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.