Statistics – Applications
Scientific paper
2011-01-05
Annals of Applied Statistics 2010, Vol. 4, No. 4, 1722-1748
Statistics
Applications
Published in at http://dx.doi.org/10.1214/10-AOAS349 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Ins
Scientific paper
10.1214/10-AOAS349
Massive datasets in the gigabyte and terabyte range combined with the availability of increasingly sophisticated statistical tools yield analyses at the boundary of what is computationally feasible. Compromising in the face of this computational burden by partitioning the dataset into more tractable sizes results in stratified analyses, removed from the context that justified the initial data collection. In a Bayesian framework, these stratified analyses generate intermediate realizations, often compared using point estimates that fail to account for the variability within and correlation between the distributions these realizations approximate. However, although the initial concession to stratify generally precludes the more sensible analysis using a single joint hierarchical model, we can circumvent this outcome and capitalize on the intermediate realizations by extending the dynamic iterative reweighting MCMC algorithm. In doing so, we reuse the available realizations by reweighting them with importance weights, recycling them into a now tractable joint hierarchical model. We apply this technique to intermediate realizations generated from stratified analyses of 687 influenza A genomes spanning 13 years allowing us to revisit hypotheses regarding the evolutionary history of influenza within a hierarchical statistical framework.
Sinsheimer Janet S.
Suchard Marc A.
Tom Jennifer A.
No associations
LandOfFree
Reuse, recycle, reweigh: Combating influenza through efficient sequential Bayesian computation for massive data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Reuse, recycle, reweigh: Combating influenza through efficient sequential Bayesian computation for massive data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Reuse, recycle, reweigh: Combating influenza through efficient sequential Bayesian computation for massive data will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-266940