ABPSE: DNA Aligner Based on Bit-level Parallelism and the Seed and Extend Strategy

Authors

DOI:

https://doi.org/10.17488/RMIB.40.1.4

Keywords:

DNA, bioinformatics, Myers, seed-and-extend, FM index

Abstract

DNA alignment is a key process in the assembly of genomes from the millions of short reads that are produced by massive parallel sequencing machines. Such a process is usually done by means of high spatial and temporal com-plexity algorithms, which takes hours to deliver the results as well as tens of GB of RAM. This has prompted the search for new algorithms and/or strategies that allow shorter runtimes, while using minimal memory footprint. In this article, we present ABPSE, a new DNA aligner that combines the Ferragina and Manzini algorithm (or FM indexes) and the Myers algorithm, by means of the seed and extend strategy. In the seeding, the FM indices allow a rapid calculation of the regions with high probability of alignment. In the extension, the Myers algorithm refines the alignment using operations based on bit vectors. It simultaneously calculates several cells of the dynamic pro-gramming matrix. The results show 96.1% of correctly aligned reads, an acceleration factor of 2.45x in relation to BWA-SW and a memory footprint of only 7.6 GB when aligning the entire human genome.

Downloads

Download data is not yet available.

Published

2018-12-04

How to Cite

Pacheco-Bautista, D., Martínez-Oviedo, J., Carreño-Aguilera, R., Algredo-Badillo, I., & Sánchez-Sánchez, S. (2018). ABPSE: DNA Aligner Based on Bit-level Parallelism and the Seed and Extend Strategy. Revista Mexicana De Ingenieria Biomedica, 40(1), 1–13. https://doi.org/10.17488/RMIB.40.1.4

Issue

Section

Research Articles

Dimensions Citation