Computer Science – Distributed – Parallel – and Cluster Computing
Scientific paper
2009-05-12
as: F. Saeed, A. Khokhar, A domain decomposition strategy for alignment of multiple biological sequences on multiprocessor pla
Computer Science
Distributed, Parallel, and Cluster Computing
36 pages, 17 figures, Accepted manuscript in Journal of Parallel and Distributed Computing(JPDC)
Scientific paper
10.1016/j.jpdc.2009.03.006
Multiple Sequences Alignment (MSA) of biological sequences is a fundamental problem in computational biology due to its critical significance in wide ranging applications including haplotype reconstruction, sequence homology, phylogenetic analysis, and prediction of evolutionary origins. The MSA problem is considered NP-hard and known heuristics for the problem do not scale well with increasing number of sequences. On the other hand, with the advent of new breed of fast sequencing techniques it is now possible to generate thousands of sequences very quickly. For rapid sequence analysis, it is therefore desirable to develop fast MSA algorithms that scale well with the increase in the dataset size. In this paper, we present a novel domain decomposition based technique to solve the MSA problem on multiprocessing platforms. The domain decomposition based technique, in addition to yielding better quality, gives enormous advantage in terms of execution time and memory requirements. The proposed strategy allows to decrease the time complexity of any known heuristic of O(N)^x complexity by a factor of O(1/p)^x, where N is the number of sequences, x depends on the underlying heuristic approach, and p is the number of processing nodes. In particular, we propose a highly scalable algorithm, Sample-Align-D, for aligning biological sequences using Muscle system as the underlying heuristic. The proposed algorithm has been implemented on a cluster of workstations using MPI library. Experimental results for different problem sizes are analyzed in terms of quality of alignment, execution time and speed-up.
Khokhar Ashfaq
Saeed Fahad
No associations
LandOfFree
A Domain Decomposition Strategy for Alignment of Multiple Biological Sequences on Multiprocessor Platforms does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Domain Decomposition Strategy for Alignment of Multiple Biological Sequences on Multiprocessor Platforms, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Domain Decomposition Strategy for Alignment of Multiple Biological Sequences on Multiprocessor Platforms will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-336571