Statistics – Applications
Scientific paper
2012-04-17
Statistics
Applications
Scientific paper
Gaussian graphical models are often used to infer gene networks based on microarray expression data. Currently, however, many scientists are using high-throughput sequencing technologies to measure gene expression levels of all genes for a given sample. As the resulting high-dimensional data consists of counts of sequencing reads for each gene, Gaussian graphical models are not optimal for modeling gene networks based on this discrete data. We develop a novel method for estimating high-dimensional Poisson graphical models, the Log-Linear Graphical Model, allowing us to infer networks based on high-throughput sequencing data. Our model assumes that conditional on all other genes, each gene is Poisson, jointly defining a pair-wise Poisson Markov random field. We estimate our genetic networks via neighborhood selection by fitting `1-norm penalized log-linear models, an approach we call the Poisson Graphical Lasso. Additionally, we develop a fast parallel algorithm, permitting us to fit our graphical models to high-dimensional genomic data sets. In simulations and a novel application of Markov Networks to microRNA sequencing data, we illustrate the effectiveness of our methods for recovering genetic networks. Our estimated microRNA networks find known regulators of breast cancer genes and discover novel microRNA clusters and hubs that are targets for future research.
Allen Genevera I.
Liu Zhandong
No associations
LandOfFree
A Log-Linear Graphical Model for Inferring Genetic Networks from High-Throughput Sequencing Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Log-Linear Graphical Model for Inferring Genetic Networks from High-Throughput Sequencing Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Log-Linear Graphical Model for Inferring Genetic Networks from High-Throughput Sequencing Data will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-410819