Reference Sequence Construction for Relative Compression of Genomes

Biology – Quantitative Biology – Quantitative Methods

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

12 pages, 2 figures, to appear in the Proceedings of SPIRE2011 as a short paper

Scientific paper

Relative compression, where a set of similar strings are compressed with respect to a reference string, is a very effective method of compressing DNA datasets containing multiple similar sequences. Relative compression is fast to perform and also supports rapid random access to the underlying data. The main difficulty of relative compression is in selecting an appropriate reference sequence. In this paper, we explore using the dictionary of repeats generated by Comrad, Re-pair and Dna-x algorithms as reference sequences for relative compression. We show this technique allows better compression and supports random access just as well. The technique also allows more general repetitive datasets to be compressed using relative compression.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Reference Sequence Construction for Relative Compression of Genomes does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Reference Sequence Construction for Relative Compression of Genomes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Reference Sequence Construction for Relative Compression of Genomes will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-254411

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.