Computer Science – Digital Libraries
Scientific paper
2000-07-04
Computer Science
Digital Libraries
10 pages. A short form published in DCC2000
Scientific paper
Text mining is about looking for patterns in natural language text, and may be defined as the process of analyzing text to extract information from it for particular purposes. In previous work, we claimed that compression is a key technology for text mining, and backed this up with a study that showed how particular kinds of lexical tokens---names, dates, locations, etc.---can be identified and located in running text, using compression models to provide the leverage necessary to distinguish different token types (Witten et al., 1999)
Bainbridge David
Witten Ian H.
Yeates Stuart
No associations
LandOfFree
Using compression to identify acronyms in text does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Using compression to identify acronyms in text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Using compression to identify acronyms in text will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-121320