Computer Science – Computation and Language
Scientific paper
1995-09-07
Eighth Australian Joint Conference on Artificial Intelligence, Canberra, 1995.
Computer Science
Computation and Language
8 pages
Scientific paper
In this paper I address the practical concern of predicting how much training data is sufficient for a statistical language learning system. First, I briefly review earlier results and show how these can be combined to bound the expected accuracy of a mode-based learner as a function of the volume of training data. I then develop a more accurate estimate of the expected accuracy function under the assumption that inputs are uniformly distributed. Since this estimate is expensive to compute, I also give a close but cheaply computable approximation to it. Finally, I report on a series of simulations exploring the effects of inputs that are not uniformly distributed. Although these results are based on simplistic assumptions, they are a tentative step toward a useful theory of data requirements for SLL systems.
No associations
LandOfFree
Conserving Fuel in Statistical Language Learning: Predicting Data Requirements does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Conserving Fuel in Statistical Language Learning: Predicting Data Requirements, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Conserving Fuel in Statistical Language Learning: Predicting Data Requirements will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-158148