Concurrent Number Cruncher : An Efficient Sparse Linear Solver on the GPU - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Concurrent Number Cruncher : An Efficient Sparse Linear Solver on the GPU

Luc Buatois
  • Fonction : Auteur
  • PersonId : 758906
  • IdRef : 137447272
Bruno Lévy

Résumé

A wide class of geometry processing and PDE resolution methods needs to solve a linear system, where the non-zero pattern of the matrix is dictated by the connectivity matrix of the mesh. The advent of GPUs with their ever-growing amount of parallel horsepower makes them a tempting resource for such numerical computations. This can be helped by new APIs (CTM from ATI and CUDA from NVIDIA) which give a direct access to the multithreaded computational resources and associated memory bandwidth of GPUs; CUDA even provides a BLAS implementation but only for dense matrices (CuBLAS). However, existing GPU linear solvers are restricted to specific types of matrices, or use non-optimal compressed row storage strategies. By combining recent GPU programming techniques with supercomputing strategies (namely block compressed row storage and register blocking), we implement a sparse generalpurpose linear solver which outperforms leading-edge CPU counterparts (MKL / ACML).
Fichier principal
Vignette du fichier
HPCC_number_cruncher.pdf (3.45 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00186833 , version 1 (12-11-2007)

Identifiants

Citer

Luc Buatois, Guillaume Caumon, Bruno Lévy. Concurrent Number Cruncher : An Efficient Sparse Linear Solver on the GPU. High Performance Computation Conference - HPCC'07, University of Houston, Sep 2007, Houston, United States. pp.358-371, ⟨10.1007/978-3-540-75444-2_37⟩. ⟨inria-00186833⟩
185 Consultations
362 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More