Mirror descent learning in continuous games

Zhengyuan Zhou; Panayotis Mertikopoulos; Aris L. Moustakas; Nicholas Bambos; Peter W. Glynn

Communication Dans Un Congrès Année : 2017

Mirror descent learning in continuous games

(1) , (2) , (3) , (1) , (1)

1
2
3

Zhengyuan Zhou

Fonction : Auteur

Department of Electrical Engineering [Stanford]

Panayotis Mertikopoulos

Fonction : Auteur
PersonId : 1933
IdHAL : mertikop
ORCID : 0000-0003-2026-9616
IdRef : 253119758

Performance analysis and optimization of LARge Infrastructures and Systems

Aris L. Moustakas

Fonction : Auteur

Department of Physics [Athens]

Nicholas Bambos

Fonction : Auteur

Department of Electrical Engineering [Stanford]

Peter W. Glynn

Fonction : Auteur

Department of Electrical Engineering [Stanford]

Résumé

Online Mirror Descent (OMD) is an important and widely used class of adaptive learning algorithms that enjoys good regret performance guarantees. It is therefore natural to study the evolution of the joint action in a multi-agent decision process (typically modeled as a repeated game) where every agent employs an OMD algorithm. This well-motivated question has received much attention in the literature that lies at the intersection between learning and games. However, much of the existing literature has been focused on the time average of the joint iterates. In this paper, we tackle a harder problem that is of practical utility, particularly in the online decision making setting: the convergence of the last iterate when all the agents make decisions according to OMD. We introduce an equilibrium stability notion called variational stability (VS) and show that in variationally stable games, the last iterate of OMD converges to the set of Nash equilibria. We also extend the OMD learning dynamics to a more general setting where the exact gradient is not available and show that the last iterate (now random) of OMD converges to the set of Nash equilibria almost surely.

Domaines

Optimisation et contrôle [math.OC]

Panayotis Mertikopoulos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01643341

Soumis le : mardi 21 novembre 2017-13:11:40

Dernière modification le : vendredi 5 avril 2024-03:09:35

Dates et versions

hal-01643341 , version 1 (21-11-2017)

Identifiants

HAL Id : hal-01643341 , version 1

Citer

Zhengyuan Zhou, Panayotis Mertikopoulos, Aris L. Moustakas, Nicholas Bambos, Peter W. Glynn. Mirror descent learning in continuous games. CDC '17: Proceedings of the 56th IEEE Annual Conference on Decision and Control, Dec 2017, Melbourne, Australia. ⟨hal-01643341⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIG_SRCPR INRIA2 TDS-MACS LIG-SRCPR-POLARIS LIG_SIDCH

185 Consultations

0 Téléchargements

Mirror descent learning in continuous games

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager