Applying Winnow to Context-Sensitive Spelling Correction

Computer Science – Computation and Language

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

9 pages

Scientific paper

Multiplicative weight-updating algorithms such as Winnow have been studied extensively in the COLT literature, but only recently have people started to use them in applications. In this paper, we apply a Winnow-based algorithm to a task in natural language: context-sensitive spelling correction. This is the task of fixing spelling errors that happen to result in valid words, such as substituting {\it to\/} for {\it too}, {\it casual\/} for {\it causal}, and so on. Previous approaches to this problem have been statistics-based; we compare Winnow to one of the more successful such approaches, which uses Bayesian classifiers. We find that: (1)~When the standard (heavily-pruned) set of features is used to describe problem instances, Winnow performs comparably to the Bayesian method; (2)~When the full (unpruned) set of features is used, Winnow is able to exploit the new features and convincingly outperform Bayes; and (3)~When a test set is encountered that is dissimilar to the training set, Winnow is better than Bayes at adapting to the unfamiliar test set, using a strategy we will present for combining learning on the training set with unsupervised learning on the (noisy) test set.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Applying Winnow to Context-Sensitive Spelling Correction does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Applying Winnow to Context-Sensitive Spelling Correction, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Applying Winnow to Context-Sensitive Spelling Correction will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-242085

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.