ATLAS: A flexible and extensible architecture for linguistic annotation

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

8 pages, 9 figures

Scientific paper

We describe a formal model for annotating linguistic artifacts, from which we derive an application programming interface (API) to a suite of tools for manipulating these annotations. The abstract logical model provides for a range of storage formats and promotes the reuse of tools that interact through this API. We focus first on ``Annotation Graphs,'' a graph model for annotations on linear signals (such as text and speech) indexed by intervals, for which efficient database storage and querying techniques are applicable. We note how a wide range of existing annotated corpora can be mapped to this annotation graph model. This model is then generalized to encompass a wider variety of linguistic ``signals,'' including both naturally occuring phenomena (as recorded in images, video, multi-modal interactions, etc.), as well as the derived resources that are increasingly important to the engineering of natural language processing systems (such as word lists, dictionaries, aligned bilingual corpora, etc.). We conclude with a review of the current efforts towards implementing key pieces of this architecture.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

ATLAS: A flexible and extensible architecture for linguistic annotation does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with ATLAS: A flexible and extensible architecture for linguistic annotation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and ATLAS: A flexible and extensible architecture for linguistic annotation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-415564

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.