Exploiting Diversity for Natural Language Parsing

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Ph.D. Thesis, Johns Hopkins University. Advisor: Eric Brill. 169 pages

Scientific paper

The popularity of applying machine learning methods to computational linguistics problems has produced a large supply of trainable natural language processing systems. Most problems of interest have an array of off-the-shelf products or downloadable code implementing solutions using various techniques. Where these solutions are developed independently, it is observed that their errors tend to be independently distributed. This thesis is concerned with approaches for capitalizing on this situation in a sample problem domain, Penn Treebank-style parsing. The machine learning community provides techniques for combining outputs of classifiers, but parser output is more structured and interdependent than classifications. To address this discrepancy, two novel strategies for combining parsers are used: learning to control a switch between parsers and constructing a hybrid parse from multiple parsers' outputs. Off-the-shelf parsers are not developed with an intention to perform well in a collaborative ensemble. Two techniques are presented for producing an ensemble of parsers that collaborate. All of the ensemble members are created using the same underlying parser induction algorithm, and the method for producing complementary parsers is only loosely constrained by that chosen algorithm.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Exploiting Diversity for Natural Language Parsing does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Exploiting Diversity for Natural Language Parsing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Exploiting Diversity for Natural Language Parsing will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-482399

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.