Computer Science – Computation and Language
Scientific paper
2000-07-13
Proceedings of the 2nd International Conference on Language Resources and Evaluation (LREC 2000), pp. 17--20
Computer Science
Computation and Language
4 pages
Scientific paper
This paper describes a new method, Combi-bootstrap, to exploit existing taggers and lexical resources for the annotation of corpora with new tagsets. Combi-bootstrap uses existing resources as features for a second level machine learning module, that is trained to make the mapping to the new tagset on a very small sample of annotated corpus material. Experiments show that Combi-bootstrap: i) can integrate a wide variety of existing resources, and ii) achieves much higher accuracy (up to 44.7 % error reduction) than both the best single tagger and an ensemble tagger constructed out of the same small training sample.
Daelemans Walter
Zavrel Jakub
No associations
LandOfFree
Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-415539