Annotation graphs as a framework for multidimensional linguistic data analysis

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, 10 figures, Towards Standards and Tools for Discourse Tagging, Proceedings of the Workshop. pp. 1-10. Association fo

Scientific paper

In recent work we have presented a formal framework for linguistic annotation based on labeled acyclic digraphs. These `annotation graphs' offer a simple yet powerful method for representing complex annotation structures incorporating hierarchy and overlap. Here, we motivate and illustrate our approach using discourse-level annotations of text and speech data drawn from the CALLHOME, COCONUT, MUC-7, DAMSL and TRAINS annotation schemes. With the help of domain specialists, we have constructed a hybrid multi-level annotation for a fragment of the Boston University Radio Speech Corpus which includes the following levels: segment, word, breath, ToBI, Tilt, Treebank, coreference and named entity. We show how annotation graphs can represent hybrid multi-level structures which derive from a diverse set of file formats. We also show how the approach facilitates substantive comparison of multiple annotations of a single signal based on different theoretical models. The discussion shows how annotation graphs open the door to wide-ranging integration of tools, formats and corpora.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Annotation graphs as a framework for multidimensional linguistic data analysis does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Annotation graphs as a framework for multidimensional linguistic data analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Annotation graphs as a framework for multidimensional linguistic data analysis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-31015

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.