Perfect Hashing Structures for Parallel Similarity Searches

Tuan Tu Tran; Mathieu Giraud; Jean-Stéphane Varré

doi:10.1109/IPDPSW.2015.105

Communication Dans Un Congrès Année : 2015

Perfect Hashing Structures for Parallel Similarity Searches

(1) , (2, 3) , (3, 2)

1
2
3

Tuan Tu Tran

Fonction : Auteur

Johannes Gutenberg - Universität Mainz = Johannes Gutenberg University

Mathieu Giraud

Fonction : Auteur
PersonId : 279
IdHAL : magiraud
ORCID : 0000-0003-2741-8047
IdRef : 094640610

Bioinformatics and Sequence Analysis

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Jean-Stéphane Varré

Fonction : Auteur
PersonId : 152
IdHAL : jeanstephanevarre
ORCID : 0000-0001-6322-0519
IdRef : 060904208

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Bioinformatics and Sequence Analysis

Résumé

Seed-based heuristics have proved to be efficient for studying similarity between genetic databases with billions of base pairs. This paper focuses on algorithms and data structures for the filtering phase in seed-based heuristics, with an emphasis on efficient parallel GPU/manycores implementa- tion. We propose a 2-stage index structure which is based on neighborhood indexing and perfect hashing techniques. This structure performs a filtering phase over the neighborhood regions around the seeds in constant time and avoid as much as possible random memory accesses and branch divergences. Moreover, it fits particularly well on parallel SIMD processors, because it requires intensive but homogeneous computational operations. Using this data structure, we developed a fast and sensitive OpenCL prototype read mapper.

Mots clés

OpenCL GPU parallelism perfect hash function read mapper seed-based heuristics

Domaines

Bio-informatique [q-bio.QM] Calcul parallèle, distribué et partagé [cs.DC]

Mathieu Giraud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01153893

Soumis le : mercredi 20 mai 2015-16:02:26

Dernière modification le : mardi 9 avril 2024-11:18:02

Dates et versions

hal-01153893 , version 1 (20-05-2015)

Identifiants

HAL Id : hal-01153893 , version 1
DOI : 10.1109/IPDPSW.2015.105

Citer

Tuan Tu Tran, Mathieu Giraud, Jean-Stéphane Varré. Perfect Hashing Structures for Parallel Similarity Searches. International Workshop on High Performance Computational Biology (HiCOMB 2015) / International Parallel and Distributed Processing Symposium (IPDPS 2015), 2015, Hyderabad, India. pp.332-341, ⟨10.1109/IPDPSW.2015.105⟩. ⟨hal-01153893⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-BONSAI UNIV-LILLE ANR

385 Consultations

0 Téléchargements

Perfect Hashing Structures for Parallel Similarity Searches

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager