Computer Science – Learning
Scientific paper
2006-07-11
Computer Science
Learning
14 pages
Scientific paper
A standard approach in pattern classification is to estimate the distributions of the label classes, and then to apply the Bayes classifier to the estimates of the distributions in order to classify unlabeled examples. As one might expect, the better our estimates of the label class distributions, the better the resulting classifier will be. In this paper we make this observation precise by identifying risk bounds of a classifier in terms of the quality of the estimates of the label class distributions. We show how PAC learnability relates to estimates of the distributions that have a PAC guarantee on their $L_1$ distance from the true distribution, and we bound the increase in negative log likelihood risk in terms of PAC bounds on the KL-divergence. We give an inefficient but general-purpose smoothing method for converting an estimated distribution that is good under the $L_1$ metric into a distribution that is good under the KL-divergence.
Goldberg Paul W.
Palmer Nick
No associations
LandOfFree
PAC Classification based on PAC Estimates of Label Class Distributions does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with PAC Classification based on PAC Estimates of Label Class Distributions, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and PAC Classification based on PAC Estimates of Label Class Distributions will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-138061