Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction

Radu-Alexandru Dragomir; Hadrien Hendrikx; Mathieu Even

Communication Dans Un Congrès Proceedings of the 38 th International Conference on Machine Learning Année : 2021

Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction

(1) , (2, 1, 3) , (2)

1
2
3

Radu-Alexandru Dragomir

Fonction : Auteur
PersonId : 797857
ORCID : 0000-0002-4600-7748

Statistical Machine Learning and Parsimony

Hadrien Hendrikx

Fonction : Auteur
PersonId : 1053506

Dynamics of Geometric Networks

Statistical Machine Learning and Parsimony

Microsoft Research - Inria Joint Centre

Mathieu Even

Fonction : Auteur
PersonId : 749075
IdHAL : mathieu-even

Dynamics of Geometric Networks

Résumé

We study the problem of minimizing a relatively-smooth convex function using stochastic Bregman gradient methods. We first prove the convergence of Bregman Stochastic Gradient Descent (BSGD) to a region that depends on the noise (magnitude of the gradients) at the optimum. In particular, BSGD with a constant step-size converges to the exact minimizer when this noise is zero (\emph{interpolation} setting, in which the data is fit perfectly). Otherwise, when the objective has a finite sum structure, we show that variance reduction can be used to counter the effect of noise. In particular, fast convergence to the exact minimizer can be obtained under additional regularity assumptions on the Bregman reference function. We illustrate the effectiveness of our approach on two key applications of relative smoothness: tomographic reconstruction with Poisson noise and statistical preconditioning for distributed optimization.

Mots clés

Relative smoothness Bregman gradient Mirror descent Stochastic methods Variance reduction Poisson inverse problems Statistical preconditioning

Domaines

Optimisation et contrôle [math.OC]

Hadrien Hendrikx : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03383164

Soumis le : lundi 18 octobre 2021-14:57:09

Dernière modification le : vendredi 19 avril 2024-16:18:58

Dates et versions

hal-03383164 , version 1 (18-10-2021)

Identifiants

HAL Id : hal-03383164 , version 1
ARXIV : 2104.09813

Citer

Radu-Alexandru Dragomir, Hadrien Hendrikx, Mathieu Even. Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction. ICML 2021- 38th International Conference on Machine Learning, Jul 2021, virtual, United States. pp.2815-2825. ⟨hal-03383164⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 TDS-MACS PSL ANR PRAIRIE-IA

55 Consultations

0 Téléchargements

Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager