A Word-to-Word Model of Translational Equivalence

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

8 pages; this version has some typos corrected

Scientific paper

Many multilingual NLP applications need to translate words between different languages, but cannot afford the computational expense of inducing or applying a full translation model. For these applications, we have designed a fast algorithm for estimating a partial translation model, which accounts for translational equivalence only at the word level. The model's precision/recall trade-off can be directly controlled via one threshold parameter. This feature makes the model more suitable for applications that are not fully statistical. The model's hidden parameters can be easily conditioned on information extrinsic to the model, providing an easy way to integrate pre-existing knowledge such as part-of-speech, dictionaries, word order, etc.. Our model can link word tokens in parallel texts as well as other translation models in the literature. Unlike other translation models, it can automatically produce dictionary-sized translation lexicons, and it can do so with over 99% accuracy.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Word-to-Word Model of Translational Equivalence does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Word-to-Word Model of Translational Equivalence, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Word-to-Word Model of Translational Equivalence will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-70057

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.