Empirical distribution of k-word matches in biological sequences

Biology – Quantitative Biology – Quantitative Methods

Scientific paper

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Empirical distribution of k-word matches in biological sequences Empirical distribution of k-word matches in biological sequences

: 2008-03-14
: arxiv.org/abs/0803.2085v1
: Pattern Recognition 42 (2009) 539-548
: Biology
: Quantitative Biology
: Quantitative Methods

: 23 pages, 10 figures
: Scientific paper
: 10.1016/j.patcog.2008.06.026
: This study focuses on an alignment-free sequence comparison method: the number of words of length k shared between two sequences, also known as the D_2 statistic. The advantages of the use of this statistic over alignment-based methods are firstly that it does not assume that homologous segments are contiguous, and secondly that the algorithm is computationally extremely fast, the runtime being proportional to the size of the sequence under scrutiny. Existing applications of the D_2 statistic include the clustering of related sequences in large EST databases such as the STACK database. Such applications have typically relied on heuristics without any statistical basis. Rigorous statistical characterisations of the distribution of D_2 have subsequently been undertaken, but have focussed on the distribution's asymptotic behaviour, leaving the distribution of D_2 uncharacterised for most practical cases. The work presented here bridges these two worlds to give usable approximations of the distribution of D_2 for ranges of parameters most frequently encountered in the study of biological sequences.

Affiliated with

Burden Conrad J.

Physics – High Energy Physics – High Energy Physics - Theory

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Foret Sylvain

Biology – Quantitative Biology – Quantitative Methods

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Wilson Stephen R.

Biology – Quantitative Biology – Quantitative Methods

Scientist

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Empirical distribution of k-word matches in biological sequences does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Empirical distribution of k-word matches in biological sequences, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Empirical distribution of k-word matches in biological sequences will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFWR-SCP-O-246346

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure