Corpus-Driven Knowledge Acquisition for Discourse Analysis

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

6 pages, AAAI-94

Scientific paper

The availability of large on-line text corpora provides a natural and promising bridge between the worlds of natural language processing (NLP) and machine learning (ML). In recent years, the NLP community has been aggressively investigating statistical techniques to drive part-of-speech taggers, but application-specific text corpora can be used to drive knowledge acquisition at much higher levels as well. In this paper we will show how ML techniques can be used to support knowledge acquisition for information extraction systems. It is often very difficult to specify an explicit domain model for many information extraction applications, and it is always labor intensive to implement hand-coded heuristics for each new domain. We have discovered that it is nevertheless possible to use ML algorithms in order to capture knowledge that is only implicitly present in a representative text corpus. Our work addresses issues traditionally associated with discourse analysis and intersentential inference generation, and demonstrates the utility of ML algorithms at this higher level of language analysis. The benefits of our work address the portability and scalability of information extraction (IE) technologies. When hand-coded heuristics are used to manage discourse analysis in an information extraction system, months of programming effort are easily needed to port a successful IE system to a new domain. We will show how ML algorithms can reduce this

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Corpus-Driven Knowledge Acquisition for Discourse Analysis does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Corpus-Driven Knowledge Acquisition for Discourse Analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Corpus-Driven Knowledge Acquisition for Discourse Analysis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-218287

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.