Integrating a Lexical Database and a Training Collection for Text Categorization

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

12 pages, 3 figures (2 tables)

Scientific paper

Automatic text categorization is a complex and useful task for many natural language processing applications. Recent approaches to text categorization focus more on algorithms than on resources involved in this operation. In contrast to this trend, we present an approach based on the integration of widely available resources as lexical databases and training collections to overcome current limitations of the task. Our approach makes use of WordNet synonymy information to increase evidence for bad trained categories. When testing a direct categorization, a WordNet based one, a training algorithm, and our integrated approach, the latter exhibits a better perfomance than any of the others. Incidentally, WordNet based approach perfomance is comparable with the training approach one.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Integrating a Lexical Database and a Training Collection for Text Categorization does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Integrating a Lexical Database and a Training Collection for Text Categorization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Integrating a Lexical Database and a Training Collection for Text Categorization will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-696121

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.