Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, in proceedings of the ACL Balancing Act workshop

Scientific paper

Eric Brill has recently proposed a simple and powerful corpus-based language modeling approach that can be applied to various tasks including part-of-speech tagging and building phrase structure trees. The method learns a series of symbolic transformational rules, which can then be applied in sequence to a test corpus to produce predictions. The learning process only requires counting matches for a given set of rule templates, allowing the method to survey a very large space of possible contextual factors. This paper analyses Brill's approach as an interesting variation on existing decision tree methods, based on experiments involving part-of-speech tagging for both English and ancient Greek corpora. In particular, the analysis throws light on why the new mechanism seems surprisingly resistant to overtraining. A fast, incremental implementation and a mechanism for recording the dependencies that underlie the resulting rule sequence are also described.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-39356

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.