A syntax-based part-of-speech analyser

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

EACL95, uuencoded and gzipped .ps. (Bibliographic mistake corrected.)

Scientific paper

There are two main methodologies for constructing the knowledge base of a natural language analyser: the linguistic and the data-driven. Recent state-of-the-art part-of-speech taggers are based on the data-driven approach. Because of the known feasibility of the linguistic rule-based approach at related levels of description, the success of the data-driven approach in part-of-speech analysis may appear surprising. In this paper, a case is made for the syntactic nature of part-of-speech tagging. A new tagger of English that uses only linguistic distributional rules is outlined and empirically evaluated. Tested against a benchmark corpus of 38,000 words of previously unseen text, this syntax-based system reaches an accuracy of above 99%. Compared to the 95-97% accuracy of its best competitors, this result suggests the feasibility of the linguistic approach also in part-of-speech analysis.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A syntax-based part-of-speech analyser does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A syntax-based part-of-speech analyser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A syntax-based part-of-speech analyser will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-531473

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.