Resolution of Unidentified Words in Machine Translation

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

4 pages, 2 figures, 2 tables, The 4th annual International New Exploratory Technologies Conference 2007 (NEXT 2007), Seoul, So

Scientific paper

This paper presents a mechanism of resolving unidentified lexical units in Text-based Machine Translation (TBMT). In a Machine Translation (MT) system it is unlikely to have a complete lexicon and hence there is intense need of a new mechanism to handle the problem of unidentified words. These unknown words could be abbreviations, names, acronyms and newly introduced terms. We have proposed an algorithm for the resolution of the unidentified words. This algorithm takes discourse unit (primitive discourse) as a unit of analysis and provides real time updates to the lexicon. We have manually applied the algorithm to news paper fragments. Along with anaphora and cataphora resolution, many unknown words especially names and abbreviations were updated to the lexicon.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Resolution of Unidentified Words in Machine Translation does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Resolution of Unidentified Words in Machine Translation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Resolution of Unidentified Words in Machine Translation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-659586

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.