Soft Uncoupling of Markov Chains for Permeable Language Distinction: A New Algorithm

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

6 pages, 7 embedded figures, LaTeX 2e using the ecai2006.cls document class and the algorithm2e.sty style file (+ standard pac

Scientific paper

Without prior knowledge, distinguishing different languages may be a hard task, especially when their borders are permeable. We develop an extension of spectral clustering -- a powerful unsupervised classification toolbox -- that is shown to resolve accurately the task of soft language distinction. At the heart of our approach, we replace the usual hard membership assignment of spectral clustering by a soft, probabilistic assignment, which also presents the advantage to bypass a well-known complexity bottleneck of the method. Furthermore, our approach relies on a novel, convenient construction of a Markov chain out of a corpus. Extensive experiments with a readily available system clearly display the potential of the method, which brings a visually appealing soft distinction of languages that may define altogether a whole corpus.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Soft Uncoupling of Markov Chains for Permeable Language Distinction: A New Algorithm does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Soft Uncoupling of Markov Chains for Permeable Language Distinction: A New Algorithm, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Soft Uncoupling of Markov Chains for Permeable Language Distinction: A New Algorithm will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-558112

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.