Pearl: A Probabilistic Chart Parser

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

7 pages

Scientific paper

This paper describes a natural language parsing algorithm for unrestricted text which uses a probability-based scoring function to select the "best" parse of a sentence. The parser, Pearl, is a time-asynchronous bottom-up chart parser with Earley-type top-down prediction which pursues the highest-scoring theory in the chart, where the score of a theory represents the extent to which the context of the sentence predicts that interpretation. This parser differs from previous attempts at stochastic parsers in that it uses a richer form of conditional probabilities based on context to predict likelihood. Pearl also provides a framework for incorporating the results of previous work in part-of-speech assignment, unknown word models, and other probabilistic models of linguistic features into one parsing tool, interleaving these techniques instead of using the traditional pipeline architecture. In preliminary tests, Pearl has been successful at resolving part-of-speech and word (in speech processing) ambiguity, determining categories for unknown words, and selecting correct parses first using a very loosely fitting covering grammar.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Pearl: A Probabilistic Chart Parser does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Pearl: A Probabilistic Chart Parser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Pearl: A Probabilistic Chart Parser will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-8785

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.