On Numerical Resiliency in Numerical Linear Algebra Solvers - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

On Numerical Resiliency in Numerical Linear Algebra Solvers

Résumé

In this talk we will discuss possible numerical remedies to survive data loss in some numerical linear algebra solvers namely Krylov subspace linear solvers and some widely used aigensolvers. We will present a new class of numerical fault tolerance algorithms at application level that does not require extra resources, i.e., computational unit or computing time, when no fault occurs. Assuming that a separate mechanism ensures fault detection, we propose numerical algorithms to extract relevant information from available data after a fault. After data extraction, well chosen part of missing data is regenerated through interpolation strategies to constitute meaningful inputs to numerically restart the algorithm. We have designed these methods called interpolation-restart techniques for the solution of linear systems and eigensolvers. We will also present some preliminary investigations to address soft error detection again at the application level in the conjugate gradient framework. Finally we will expose the numerous open questions that we are facing that hopefully will lead to fruitful discussions.
Fichier non déposé

Dates et versions

hal-01162627 , version 1 (11-06-2015)

Identifiants

  • HAL Id : hal-01162627 , version 1

Citer

Emmanuel Agullo, Luc Giraud, Pablo Salas, Emrullah Fatih Yetkin, Mawussi Zounon. On Numerical Resiliency in Numerical Linear Algebra Solvers . Salishan Conference on High-Speed Computing, DOE laboratories, Apr 2015, Salishan, United States. ⟨hal-01162627⟩
224 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More