Computer Science – Computation and Language
Scientific paper
1997-09-15
ACL/EACL Workshop on Automatic Extraction and Building of Lexical Semantic Resources for Natural Language Applications, 1997
Computer Science
Computation and Language
12 pages, 3 figures (2 tables)
Scientific paper
Automatic text categorization is a complex and useful task for many natural language processing applications. Recent approaches to text categorization focus more on algorithms than on resources involved in this operation. In contrast to this trend, we present an approach based on the integration of widely available resources as lexical databases and training collections to overcome current limitations of the task. Our approach makes use of WordNet synonymy information to increase evidence for bad trained categories. When testing a direct categorization, a WordNet based one, a training algorithm, and our integrated approach, the latter exhibits a better perfomance than any of the others. Incidentally, WordNet based approach perfomance is comparable with the training approach one.
Buenaga Rodriguez Manuel de
Gomez Hidalgo Jose Maria
No associations
LandOfFree
Integrating a Lexical Database and a Training Collection for Text Categorization does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Integrating a Lexical Database and a Training Collection for Text Categorization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Integrating a Lexical Database and a Training Collection for Text Categorization will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-696121