Computer Science – Computation and Language
Scientific paper
1997-05-20
In Roche E. and Y. Schabes, eds., Finite-State Language Processing, The MIT Press, Cambridge, MA, 1997, pages 383-406.
Computer Science
Computation and Language
22 pages. In E. Roche and Y. Schabes, eds., Finite State Devices for Natural Language Processing, MIT Press, Cambridge, Massac
Scientific paper
FASTUS is a system for extracting information from natural language text for entry into a database and for other applications. It works essentially as a cascaded, nondeterministic finite-state automaton. There are five stages in the operation of FASTUS. In Stage 1, names and other fixed form expressions are recognized. In Stage 2, basic noun groups, verb groups, and prepositions and some other particles are recognized. In Stage 3, certain complex noun groups and verb groups are constructed. Patterns for events of interest are identified in Stage 4 and corresponding ``event structures'' are built. In Stage 5, distinct event structures that describe the same event are identified and merged, and these are used in generating database entries. This decomposition of language processing enables the system to do exactly the right amount of domain-independent syntax, so that domain-dependent semantic and pragmatic processing can be applied to the right larger-scale structures. FASTUS is very efficient and effective, and has been used successfully in a number of applications.
Appelt Douglas
Bear John
Hobbs Jerry R.
Israel David
Kameyama Megumi
No associations
LandOfFree
FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-219534