Faster Approximate String Matching for Short Patterns

Computer Science – Data Structures and Algorithms

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

To appear in Theory of Computing Systems

Scientific paper

We study the classical approximate string matching problem, that is, given strings $P$ and $Q$ and an error threshold $k$, find all ending positions of substrings of $Q$ whose edit distance to $P$ is at most $k$. Let $P$ and $Q$ have lengths $m$ and $n$, respectively. On a standard unit-cost word RAM with word size $w \geq \log n$ we present an algorithm using time $$ O(nk \cdot \min(\frac{\log^2 m}{\log n},\frac{\log^2 m\log w}{w}) + n) $$ When $P$ is short, namely, $m = 2^{o(\sqrt{\log n})}$ or $m = 2^{o(\sqrt{w/\log w})}$ this improves the previously best known time bounds for the problem. The result is achieved using a novel implementation of the Landau-Vishkin algorithm based on tabulation and word-level parallelism.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Faster Approximate String Matching for Short Patterns does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Faster Approximate String Matching for Short Patterns, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Faster Approximate String Matching for Short Patterns will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-163117

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.