Di-nucleotide Entropy as a Measure of Genomic Sequence Functionality

Biology – Quantitative Biology – Genomics

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

10 pages, 7 figures, grammatical revision

Scientific paper

Considering vast amounts of genomic sequences of mostly unknown functionality, in-silico prediction of functional regions is an important enterprise. Many genomic browsers employ GC content, which was observed to be elevated in gene-rich functional regions. This report shows that the entropy of di- and tri-nucleotides distributions provides a superior measure of genomic sequence functionality, and proposes an explanation on why the GC content must be elevated (closer to 50%) in functional regions. Regions with high entropy strongly co-localize with exons and provide genome-wide evidences of purifying selection acting on non-coding regions, such as decreased SNPs density. The observations suggest that functional non-coding regions are optimised for mutation load in a way, that transition mutations have less impact on functionality than transversions, leading to the decrease in transversions to transitions ratio in functional regions.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Di-nucleotide Entropy as a Measure of Genomic Sequence Functionality does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Di-nucleotide Entropy as a Measure of Genomic Sequence Functionality, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Di-nucleotide Entropy as a Measure of Genomic Sequence Functionality will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-338584

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.