Availability-based methods for distributed storage systems

Anne-Marie Kermarrec; Erwan Le Merrer; Gilles Straub; Alexandre van Kempen

Pré-Publication, Document De Travail Année : 2010

Availability-based methods for distributed storage systems

(1) , (2) , (2) , (2)

1
2

Anne-Marie Kermarrec

Fonction : Auteur
PersonId : 830783

As Scalable As Possible: foundations of large scale dynamic distributed systems

Erwan Le Merrer

Fonction : Auteur
PersonId : 874498

Technicolor R & I [Cesson Sévigné]

Gilles Straub

Fonction : Auteur
PersonId : 879051

Technicolor R & I [Cesson Sévigné]

Alexandre van Kempen

Fonction : Auteur
PersonId : 879703

Technicolor R & I [Cesson Sévigné]

Résumé

Distributed storage systems rely heavily on replication to ensure data availability as well as durability. In networked systems subject to intermittent node unavailability, replicas need to be maintained (i.e. replicated and/or relocated upon failure). Repairs are well-known to be extremely bandwidth-consuming and it has been shown that, without care, they may significantly congest the system. In this paper, we propose an approach to replica management accounting for nodes heterogeneity with respect to availability. We show that by using the availability history of nodes, the performance of two important faces of distributed storage (replica placement and repair) can be significantly improved. Replica placement is achieved based on complementary nodes with respect to nodes availability, improving the overall data availability. Repairs can be scheduled thanks to an adaptive per-node timeout according to node availability, so as to decrease the number of repairs while reaching comparable availability. We propose practical heuristics for those two issues. We evaluate our approach through extensive simulations based on real and well-known availability traces. Results clearly show the benefits of our approach with regards to the critical trade-off between data availability, load-balancing and bandwidth consumption.

Mots clés

Distributed storage systems Availability timeout

Domaines

Calcul parallèle, distribué et partagé [cs.DC] Algorithme et structure de données [cs.DS]

Fichier principal

Availability.pdf (202.75 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Erwan Le Merrer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00521034

Soumis le : vendredi 24 septembre 2010-17:48:42

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : mardi 18 septembre 2012-13:05:35

Dates et versions

hal-00521034 , version 1 (24-09-2010)

hal-00521034 , version 2 (07-03-2011)

Identifiants

HAL Id : hal-00521034 , version 1

Citer

Anne-Marie Kermarrec, Erwan Le Merrer, Gilles Straub, Alexandre van Kempen. Availability-based methods for distributed storage systems. 2010. ⟨hal-00521034v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

383 Consultations

1178 Téléchargements

Availability-based methods for distributed storage systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager