Computer Science – Computation and Language
Scientific paper
1994-09-22
Proc. AMTA-94
Computer Science
Computation and Language
8 pages, uuencoded, compressed PostScript
Scientific paper
We propose a new algorithm called DK-vec for aligning pairs of Asian/Indo-European noisy parallel texts without sentence boundaries. DK-vec improves on previous alignment algorithms in that it handles better the non-linear nature of noisy corpora. The algorithm uses frequency, position and recency information as features for pattern matching. Dynamic Time Warping is used as the matching technique between word pairs. This algorithm produces a small bilingual lexicon which provides anchor points for alignment.
Fung Pascale
McKeown Kathleen
No associations
LandOfFree
Aligning Noisy Parallel Corpora Across Language Groups : Word Pair Feature Matching by Dynamic Time Warping does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Aligning Noisy Parallel Corpora Across Language Groups : Word Pair Feature Matching by Dynamic Time Warping, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Aligning Noisy Parallel Corpora Across Language Groups : Word Pair Feature Matching by Dynamic Time Warping will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-674497