Unsupervised Discovery of Morphemes

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, to appear in Proceedings of Morphological and Phonological Learning Workshop of ACL'02

Scientific paper

We present two methods for unsupervised segmentation of words into morpheme-like units. The model utilized is especially suited for languages with a rich morphology, such as Finnish. The first method is based on the Minimum Description Length (MDL) principle and works online. In the second method, Maximum Likelihood (ML) optimization is used. The quality of the segmentations is measured using an evaluation method that compares the segmentations produced to an existing morphological analysis. Experiments on both Finnish and English corpora show that the presented methods perform well compared to a current state-of-the-art system.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Unsupervised Discovery of Morphemes does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Unsupervised Discovery of Morphemes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Unsupervised Discovery of Morphemes will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-455154

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.