Factorization of Language Models through Backing-Off Lattices

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

7 pages and 1 figure; technical report; added one more related work, and removed some errors

Scientific paper

Factorization of statistical language models is the task that we resolve the most discriminative model into factored models and determine a new model by combining them so as to provide better estimate. Most of previous works mainly focus on factorizing models of sequential events, each of which allows only one factorization manner. To enable parallel factorization, which allows a model event to be resolved in more than one ways at the same time, we propose a general framework, where we adopt a backing-off lattice to reflect parallel factorizations and to define the paths along which a model is resolved into factored models, we use a mixture model to combine parallel paths in the lattice, and generalize Katz's backing-off method to integrate all the mixture models got by traversing the entire lattice. Based on this framework, we formulate two types of model factorizations that are used in natural language modeling.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Factorization of Language Models through Backing-Off Lattices does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Factorization of Language Models through Backing-Off Lattices, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Factorization of Language Models through Backing-Off Lattices will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-133407

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.