On Weight Matrix and Free Energy Models for Sequence Motif Detection

Biology – Quantitative Biology – Genomics

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

23 pages, 1 figure and 4 tables

Scientific paper

10.1089/cmb.2009.0142

The problem of motif detection can be formulated as the construction of a discriminant function to separate sequences of a specific pattern from background. In computational biology, motif detection is used to predict DNA binding sites of a transcription factor (TF), mostly based on the weight matrix (WM) model or the Gibbs free energy (FE) model. However, despite the wide applications, theoretical analysis of these two models and their predictions is still lacking. We derive asymptotic error rates of prediction procedures based on these models under different data generation assumptions. This allows a theoretical comparison between the WM-based and the FE-based predictions in terms of asymptotic efficiency. Applications of the theoretical results are demonstrated with empirical studies on ChIP-seq data and protein binding microarray data. We find that, irrespective of underlying data generation mechanisms, the FE approach shows higher or comparable predictive power relative to the WM approach when the number of observed binding sites used for constructing a discriminant decision is not too small.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

On Weight Matrix and Free Energy Models for Sequence Motif Detection does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with On Weight Matrix and Free Energy Models for Sequence Motif Detection, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and On Weight Matrix and Free Energy Models for Sequence Motif Detection will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-222679

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.