ExBLAS: Reproducible and Accurate BLAS Library - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Poster De Conférence Année : 2015

ExBLAS: Reproducible and Accurate BLAS Library

Résumé

Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectures, getting a bitwise reproducible floating-point result for multiple executions of the same code on different or even similar parallel architectures is challenging. We address the problem of reproducibility in the context of fundamental linear algebra operations — like the ones included in the Basic Linear Algebra Subprograms (BLAS) library — and propose algorithms that yields both reproducible and accurate (rounding to the nearest) results. We present implementations of these reproducible and accurate algorithms for the BLAS routines in parallel environments such as Intel server CPUs, Intel Xeon Phi, and both NVIDIA and AMD GPUs. We show that the performance of our implementations is comparable to the standard ones.
Fichier principal
Vignette du fichier
tikzp.pdf (386.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01140280 , version 1 (08-04-2015)

Identifiants

  • HAL Id : hal-01140280 , version 1

Citer

Roman Iakymchuk, Stef Graillat, Caroline Collange, David Defour. ExBLAS: Reproducible and Accurate BLAS Library. RAIM: Rencontres Arithmétiques de l’Informatique Mathématique, Apr 2015, Rennes, France. , 7ème Rencontre Arithmétique de l'Informatique Mathématique, 2015. ⟨hal-01140280⟩
459 Consultations
302 Téléchargements

Partager

Gmail Facebook X LinkedIn More