A Memory-Based Approach to Learning Shallow Natural Language Patterns

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

27 pages. This is a revised and extended version of the paper presented in COLING-ACL '98. To appear in Journal of Experimenta

Scientific paper

Recognizing shallow linguistic patterns, such as basic syntactic relationships between words, is a common task in applied natural language and text processing. The common practice for approaching this task is by tedious manual definition of possible pattern structures, often in the form of regular expressions or finite automata. This paper presents a novel memory-based learning method that recognizes shallow patterns in new text based on a bracketed training corpus. The training data are stored as-is, in efficient suffix-tree data structures. Generalization is performed on-line at recognition time by comparing subsequences of the new text to positive and negative evidence in the corpus. This way, no information in the training is lost, as can happen in other learning systems that construct a single generalized model at the time of training. The paper presents experimental results for recognizing noun phrase, subject-verb and verb-object patterns in English. Since the learning approach enables easy porting to new domains, we plan to apply it to syntactic patterns in other languages and to sub-language patterns for information extraction.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Memory-Based Approach to Learning Shallow Natural Language Patterns does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Memory-Based Approach to Learning Shallow Natural Language Patterns, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Memory-Based Approach to Learning Shallow Natural Language Patterns will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-438071

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.