Word Sense Disambiguation Using English-Spanish Aligned Phrases over Comparable Corpora

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

latex2e, 8 pages, 1 figure, published in the Proceedings of Cross-Language Knowledge Induction Workshop, 2005 Cluj-Napoca, hel

Scientific paper

In this paper we describe a WSD experiment based on bilingual English-Spanish comparable corpora in which individual noun phrases have been identified and aligned with their respective counterparts in the other language. The evaluation of the experiment has been carried out against SemCor. We show that, with the alignment algorithm employed, potential precision is high (74.3%), however the coverage of the method is low (2.7%), due to alignments being far less frequent than we expected. Contrary to our intuition, precision does not rise consistently with the number of alignments. The coverage is low due to several factors; there are important domain differences, and English and Spanish are too close languages for this approach to be able to discriminate efficiently between senses, rendering it unsuitable for WSD, although the method may prove more productive in machine translation.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Word Sense Disambiguation Using English-Spanish Aligned Phrases over Comparable Corpora does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Word Sense Disambiguation Using English-Spanish Aligned Phrases over Comparable Corpora, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Word Sense Disambiguation Using English-Spanish Aligned Phrases over Comparable Corpora will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-405401

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.