Principal components analysis in the space of phylogenetic trees

Mathematics – Statistics Theory

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Published in at http://dx.doi.org/10.1214/11-AOS915 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of

Scientific paper

10.1214/11-AOS915

Phylogenetic analysis of DNA or other data commonly gives rise to a collection or sample of inferred evolutionary trees. Principal Components Analysis (PCA) cannot be applied directly to collections of trees since the space of evolutionary trees on a fixed set of taxa is not a vector space. This paper describes a novel geometrical approach to PCA in tree-space that constructs the first principal path in an analogous way to standard linear Euclidean PCA. Given a data set of phylogenetic trees, a geodesic principal path is sought that maximizes the variance of the data under a form of projection onto the path. Due to the high dimensionality of tree-space and the nonlinear nature of this problem, the computational complexity is potentially very high, so approximate optimization algorithms are used to search for the optimal path. Principal paths identified in this way reveal and quantify the main sources of variation in the original collection of trees in terms of both topology and branch lengths. The approach is illustrated by application to simulated sets of trees and to a set of gene trees from metazoan (animal) species.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Principal components analysis in the space of phylogenetic trees does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Principal components analysis in the space of phylogenetic trees, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Principal components analysis in the space of phylogenetic trees will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-659500

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.