Refined Lower Bounds for Adversarial Bandits

Sébastien Gerchinovitz; Tor Lattimore

Communication Dans Un Congrès Année : 2016

Refined Lower Bounds for Adversarial Bandits

(1) , (2)

1
2

Sébastien Gerchinovitz

Fonction : Auteur
PersonId : 12754
IdHAL : sebastien-gerchinovitz
IdRef : 156515776

Institut de Mathématiques de Toulouse UMR5219

Tor Lattimore

Fonction : Auteur

University of Alberta

Résumé

We provide new lower bounds on the regret that must be suffered by adversarial bandit algorithms. The new results show that recent upper bounds that either (a) hold with high-probability or (b) depend on the total loss of the best arm or (c) depend on the quadratic variation of the losses, are close to tight. Besides this we prove two impossibility results. First, the existence of a single arm that is optimal in every round cannot improve the regret in the worst case. Second, the regret cannot scale with the effective range of the losses. In contrast, both results are possible in the full-information setting.

Mots clés

bandits lower bounds online learning

Domaines

Statistiques [math.ST] Machine Learning [stat.ML] Apprentissage [cs.LG]

Fichier principal

GL16-banditlowerbounds.pdf (223.47 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sébastien Gerchinovitz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01319572

Soumis le : samedi 25 février 2017-16:52:30

Dernière modification le : mercredi 24 avril 2024-09:58:56

Dates et versions

hal-01319572 , version 1 (20-05-2016)

hal-01319572 , version 2 (25-02-2017)

Identifiants

HAL Id : hal-01319572 , version 2
ARXIV : 1605.07416

Citer

Sébastien Gerchinovitz, Tor Lattimore. Refined Lower Bounds for Adversarial Bandits. NIPS 2016, Dec 2016, Barcelona, Spain. pp.1198--1206. ⟨hal-01319572v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS INSA-TOULOUSE IMT UT1-CAPITOLE INSA-GROUPE ANR UNIV-UT3 UT3-TOULOUSEINP

176 Consultations

184 Téléchargements

Refined Lower Bounds for Adversarial Bandits

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager