Computer Science – Computation and Language
Scientific paper
1997-10-30
Recent Advances in Natural Language Processing (RANLP-97), European Commission, DG XIII, Tzigov Chark, Bulgaria, September 199
Computer Science
Computation and Language
Scientific paper
This paper describes the automation of a new text categorization task. The categories assigned in this task are more syntactically, semantically, and contextually complex than those typically assigned by fully automatic systems that process unseen test data. Our system for assigning these categories is a probabilistic classifier, developed with a recent method for formulating a probabilistic model from a predefined set of potential features. This paper focuses on feature selection. It presents a number of fully automatic features. It identifies and evaluates various approaches to organizing collocational properties into features, and presents the results of experiments covarying type of organization and type of property. We find that one organization is not best for all kinds of properties, so this is an experimental parameter worth investigating in NLP systems. In addition, the results suggest a way to take advantage of properties that are low frequency but strongly indicative of a class. The problems of recognizing and organizing the various kinds of contextual information required to perform a linguistically complex categorization task have rarely been systematically investigated in NLP.
Bruce Rebecca
Duan Lei
Wiebe Janyce
No associations
LandOfFree
Probabilistic Event Categorization does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Probabilistic Event Categorization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Probabilistic Event Categorization will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-591305