A new flexible Checkpoint/Restart model - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2008

A new flexible Checkpoint/Restart model

Résumé

The utilization of new generation computing platforms like computational grids or desktop grids introduces new challenging problems. In particular, due to the huge number of the involved processors, security and fault-tolerance aspects are key issues that must be taken into account. Coordinated checkpointing is one of the most popular technique to deal with failures in such platforms. The approach of application-directed checkpointing in fault-tolerance puts an incredible strain on the storage system and the communications. This results in large overheads on the execution times of applications that severely impact the performance and the scalability. This work presents a new model of coordinated checkpoint/restart mechanism for several types of computing platforms. Its main feature is that it is independent from the failure law which makes it very flexible. We will show that such a model may be used to determine the optimal periodic checkpoint interval and to reduce the checkpoint overhead through mathematical analysis of reliability. Moreover, unlike most of the existing checkpointing models, the proposed model is able to take into account a variable checkpoint cost. Finally, we report some experiments based on simulations for random failure distributions corresponding to the two most popular laws, namely, the Poisson's process and Weibull's law.
Fichier principal
Vignette du fichier
RR-6751.pdf (261.24 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00348135 , version 1 (17-12-2008)

Identifiants

  • HAL Id : inria-00348135 , version 1

Citer

Mohamed Slim Bouguerra, Denis Trystram, Thierry Gautier, Jean-Marc Vincent. A new flexible Checkpoint/Restart model. [Research Report] RR-6751, INRIA. 2008. ⟨inria-00348135⟩
371 Consultations
311 Téléchargements

Partager

Gmail Facebook X LinkedIn More