Robust and accurate data enrichment statistics via distribution function of sum of weights

Biology – Quantitative Biology – Quantitative Methods

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Revised manuscript. 22 pages, 9 figures, 1 table

Scientific paper

10.1093/bioinformatics/btq511

Term enrichment analysis facilitates biological interpretation by assigning to experimentally/computationally obtained data annotation associated with terms from controlled vocabularies. This process usually involves obtaining statistical significance for each vocabulary term and using the most significant terms to describe a given set of biological entities, often associated with weights. Many existing enrichment methods require selections of (arbitrary number of) the most significant entities and/or do not account for weights of entities. Others either mandate extensive simulations to obtain statistics or assume normal weight distribution. In addition, most methods have difficulty assigning correct statistical significance to terms with few entities. Implementing the well-known Lugananni-Rice formula, we have developed a novel approach, called SaddleSum, that is free from all the aforementioned constraints and evaluated it against several existing methods. With entity weights properly taken into account, SaddleSum is internally consistent and stable with respect to the choice of number of most significant entities selected. Making few assumptions on the input data, the proposed method is universal and can thus be applied to areas beyond analysis of microarrays. Employing asymptotic approximation, SaddleSum provides a term-size dependent score distribution function that gives rise to accurate statistical significance even for terms with few entities. As a consequence, SaddleSum enables researchers to place confidence in its significance assignments to small terms that are often biologically most specific.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Robust and accurate data enrichment statistics via distribution function of sum of weights does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Robust and accurate data enrichment statistics via distribution function of sum of weights, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Robust and accurate data enrichment statistics via distribution function of sum of weights will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-69278

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.