Manual Annotation of Translational Equivalence: The Blinker Project

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

Bilingual annotators were paid to link roughly sixteen thousand corresponding words between on-line versions of the Bible in modern French and modern English. These annotations are freely available to the research community from http://www.cis.upenn.edu/~melamed . The annotations can be used for several purposes. First, they can be used as a standard data set for developing and testing translation lexicons and statistical translation models. Second, researchers in lexical semantics will be able to mine the annotations for insights about cross-linguistic lexicalization patterns. Third, the annotations can be used in research into certain recently proposed methods for monolingual word-sense disambiguation. This paper describes the annotated texts, the specially-designed annotation tool, and the strategies employed to increase the consistency of the annotations. The annotation process was repeated five times by different annotators. Inter-annotator agreement rates indicate that the annotations are reasonably reliable and that the method is easy to replicate.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Manual Annotation of Translational Equivalence: The Blinker Project does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Manual Annotation of Translational Equivalence: The Blinker Project, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Manual Annotation of Translational Equivalence: The Blinker Project will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-202408

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.