A Universal Part-of-Speech Tagset

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

To facilitate future research in unsupervised induction of syntactic structure and to standardize best-practices, we propose a tagset that consists of twelve universal part-of-speech categories. In addition to the tagset, we develop a mapping from 25 different treebank tagsets to this universal set. As a result, when combined with the original treebank data, this universal tagset and mapping produce a dataset consisting of common parts-of-speech for 22 different languages. We highlight the use of this resource via two experiments, including one that reports competitive accuracies for unsupervised grammar induction without gold standard part-of-speech tags.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Universal Part-of-Speech Tagset does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Universal Part-of-Speech Tagset, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Universal Part-of-Speech Tagset will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-730929

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.