Statistics – Methodology
Scientific paper
2012-02-27
Statistics
Methodology
21 pages, including 6 figures and 5 pages of appendices
Scientific paper
Two challenging problems in the clinical study of cancer are the characterization of cancer subtypes and the classification of individual patients according to those subtypes. Statistical approaches addressing these problems are hampered by population heterogeneity and challenges inherent in data integration across high-dimensional, diverse covariates. We have developed a survival-supervised latent Dirichlet allocation (survLDA) modeling framework to address these concerns. LDA models have proven extremely effective at identifying themes common across large collections of text, but applications to genomics have been limited. Our framework extends LDA to the genome by considering each patient as a `document' with `text' constructed from clinical and high-dimensional genomic measurements. We then further extend the framework to allow for supervision by a time-to-event response. The model enables the efficient identification of collections of clinical and genomic features that co-occur within patient subgroups, and then characterizes each patient by those features. An application of survLDA to The Cancer Genome Atlas (TCGA) ovarian project identifies informative patient subgroups that are characterized by different propensities for exhibiting abnormal mRNA expression and methylations, corresponding to differential rates of survival from primary therapy.
Dawson John A.
Kendziorski Christina
No associations
LandOfFree
Survival-supervised latent Dirichlet allocation models for genomic analysis of time-to-event outcomes does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Survival-supervised latent Dirichlet allocation models for genomic analysis of time-to-event outcomes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Survival-supervised latent Dirichlet allocation models for genomic analysis of time-to-event outcomes will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-263969