Performance Analysis and Optimization of the Tiled Cholesky Factorization on NUMA Machines - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Performance Analysis and Optimization of the Tiled Cholesky Factorization on NUMA Machines

Résumé

We discuss some performance issues of the tiled Cholesky factorization on non-uniform memory access-time (NUMA) shared memory machines. We show how to optimize thread placement and data placement in order to achieve performance gain up to 50% compared to state-of-the-art libraries such as Plasma or MKL.
Fichier principal
Vignette du fichier
jeannot.pdf (428.24 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00772790 , version 1 (11-01-2013)

Identifiants

  • HAL Id : hal-00772790 , version 1

Citer

Emmanuel Jeannot. Performance Analysis and Optimization of the Tiled Cholesky Factorization on NUMA Machines. PAAP 2012 - IEEE International Symposium on Parallel Architectures, Algorithms and Programming, Dec 2012, Taipei, Taiwan. ⟨hal-00772790⟩
99 Consultations
159 Téléchargements

Partager

Gmail Facebook X LinkedIn More