Statistics – Applications
Scientific paper
2007-12-12
Annals of Applied Statistics 2007, Vol. 1, No. 2, 459-479
Statistics
Applications
Published in at http://dx.doi.org/10.1214/07-AOAS121 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Ins
Scientific paper
10.1214/07-AOAS121
Estimation of the allele frequency at genetic markers is a key ingredient in biological and biomedical research, such as studies of human genetic variation or of the genetic etiology of heritable traits. As genetic data becomes increasingly available, investigators face a dilemma: when should data from other studies and population subgroups be pooled with the primary data? Pooling additional samples will generally reduce the variance of the frequency estimates; however, used inappropriately, pooled estimates can be severely biased due to population stratification. Because of this potential bias, most investigators avoid pooling, even for samples with the same ethnic background and residing on the same continent. Here, we propose an empirical Bayes approach for estimating allele frequencies of single nucleotide polymorphisms. This procedure adaptively incorporates genotypes from related samples, so that more similar samples have a greater influence on the estimates. In every example we have considered, our estimator achieves a mean squared error (MSE) that is smaller than either pooling or not, and sometimes substantially improves over both extremes. The bias introduced is small, as is shown by a simulation study that is carefully matched to a real data example. Our method is particularly useful when small groups of individuals are genotyped at a large number of markers, a situation we are likely to encounter in a genome-wide association study.
Coram Marc
Tang Hua
No associations
LandOfFree
Improving population-specific allele frequency estimates by adapting supplemental data: an empirical Bayes approach does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Improving population-specific allele frequency estimates by adapting supplemental data: an empirical Bayes approach, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Improving population-specific allele frequency estimates by adapting supplemental data: an empirical Bayes approach will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-661583