Mathematics – Probability
Scientific paper
2011-07-31
Mathematics
Probability
Scientific paper
Mutation rate variation across loci is well known to cause difficulties, notably identifiability issues, in the reconstruction of evolutionary trees from molecular sequences. Here we introduce a new approach for estimating general rates-across-sites models. Our results imply, in particular, that large phylogenies are typically identifiable under rate variation. We also derive sequence-length requirements for high-probability reconstruction. Our main contribution is a novel algorithm that clusters sites according to their mutation rate. Following this site clustering step, standard reconstruction techniques can be used to recover the phylogeny. Our results rely on a basic insight: that, for large trees, certain site statistics experience concentration-of-measure phenomena.
Mossel Elchanan
Roch Sebastien
No associations
LandOfFree
Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-702485