Computer Science – Computation and Language
Scientific paper
1994-06-03
ACL Balancing Act Workshop proceedings, July 94, pp. 86-95
Computer Science
Computation and Language
10 pages, in proceedings of the ACL Balancing Act workshop
Scientific paper
Eric Brill has recently proposed a simple and powerful corpus-based language modeling approach that can be applied to various tasks including part-of-speech tagging and building phrase structure trees. The method learns a series of symbolic transformational rules, which can then be applied in sequence to a test corpus to produce predictions. The learning process only requires counting matches for a given set of rule templates, allowing the method to survey a very large space of possible contextual factors. This paper analyses Brill's approach as an interesting variation on existing decision tree methods, based on experiments involving part-of-speech tagging for both English and ancient Greek corpora. In particular, the analysis throws light on why the new mechanism seems surprisingly resistant to overtraining. A fast, incremental implementation and a mechanism for recording the dependencies that underlie the resulting rule sequence are also described.
Marcus Mitchell P.
Ramshaw Lance A.
No associations
LandOfFree
Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Exploring the Statistical Derivation of Transformational Rule Sequences for Part-of-Speech Tagging will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-39356