High performance BLAS formulation of the multipole-to-local operator in the Fast Multipole Method - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue SIAM Journal on Scientific Computing Année : 2005

High performance BLAS formulation of the multipole-to-local operator in the Fast Multipole Method

Résumé

The multipole-to-local (M2L) operator is the most time-consuming part of the far field computation in the Fast Multipole Method. Its natural expression, though commonly used, does not respect a sharp error bound: we here first prove the correctness of a second but less efficient writing. We then propose the use of BLAS (Basic Linear Algebra Subprograms) to speed up its computation for these two expressions. The more efficient level 3 BLAS are obtained through recopies but this additional cost can be avoided thanks to special data storages. This BLAS version is finally compared, theorically and practically with uniform distributions, to other M2L improvements such as block FFT and rotations. When considering runtime, extra memory storage and numerical stability, the BLAS version appears as the best one.
Fichier principal
Vignette du fichier
CoulaudFortinRoman05.pdf (419.81 Ko) Télécharger le fichier

Dates et versions

inria-00000957 , version 1 (19-12-2005)
inria-00000957 , version 2 (04-01-2007)

Identifiants

  • HAL Id : inria-00000957 , version 1

Citer

Olivier Coulaud, Pierre Fortin, Jean Roman. High performance BLAS formulation of the multipole-to-local operator in the Fast Multipole Method. SIAM Journal on Scientific Computing, 2005. ⟨inria-00000957v1⟩
214 Consultations
256 Téléchargements

Partager

Gmail Facebook X LinkedIn More