Exploiting Syntactic Structure for Natural Language Modeling

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Advisor: Frederick Jelinek, Ph.D. Thesis, 122 pages; removed unused .eps file

Scientific paper

The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an original probabilistic parameterization of a shift-reduce parser. A maximum likelihood reestimation procedure belonging to the class of expectation-maximization algorithms is employed for training the model. Experiments on the Wall Street Journal, Switchboard and Broadcast News corpora show improvement in both perplexity and word error rate - word lattice rescoring - over the standard 3-gram language model. The significance of the thesis lies in presenting an original approach to language modeling that uses the hierarchical - syntactic - structure in natural language to improve on current 3-gram modeling techniques for large vocabulary speech recognition.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Exploiting Syntactic Structure for Natural Language Modeling does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Exploiting Syntactic Structure for Natural Language Modeling, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Exploiting Syntactic Structure for Natural Language Modeling will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-308843

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.