Probabilistic top-down parsing and language modeling

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

28 pages, 6 tables, 8 figures. To appear in Computational Linguistics 27(2), June 2001

Scientific paper

This paper describes the functioning of a broad-coverage probabilistic top-down parser, and its application to the problem of language modeling for speech recognition. The paper first introduces key notions in language modeling and probabilistic parsing, and briefly reviews some previous approaches to using syntactic structure for language modeling. A lexicalized probabilistic top-down parser is then presented, which performs very well, in terms of both the accuracy of returned parses and the efficiency with which they are found, relative to the best broad-coverage statistical parsers. A new language model which utilizes probabilistic top-down parsing is then outlined, and empirical results show that it improves upon previous work in test corpus perplexity. Interpolation with a trigram model yields an exceptional improvement relative to the improvement observed by other models, demonstrating the degree to which the information captured by our parsing model is orthogonal to that captured by a trigram model. A small recognition experiment also demonstrates the utility of the model.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Probabilistic top-down parsing and language modeling does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Probabilistic top-down parsing and language modeling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Probabilistic top-down parsing and language modeling will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-455075

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.