Statistics – Applications
Scientific paper
2012-04-06
Statistics
Applications
24 pages, 9 figures
Scientific paper
Biological sequences may contain patterns that are signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We present a Bayesian model that is an extended version of the model adopted by the Gibbs motif sampler, and propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the maximum a posteriori estimator.
No associations
LandOfFree
Bayesian Centroid Estimation for Motif Discovery does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Bayesian Centroid Estimation for Motif Discovery, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Bayesian Centroid Estimation for Motif Discovery will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-36924