Diversity-Preserving K-Armed Bandits, Revisited

Hédi Hadiji; Sébastien Gerchinovitz; Jean-Michel Loubes; Gilles Stoltz

Pré-Publication, Document De Travail Année : 2024

Diversity-Preserving K-Armed Bandits, Revisited

(1) , (2, 3) , (2) , (4, 5, 6)

1
2
3
4
5
6

Hédi Hadiji

Fonction : Auteur
PersonId : 175390
IdHAL : hedi-hadiji
ORCID : 0000-0001-8936-5054
IdRef : 252779525

Laboratoire des signaux et systèmes

Sébastien Gerchinovitz

Fonction : Auteur
PersonId : 12754
IdHAL : sebastien-gerchinovitz
IdRef : 156515776

Institut de Mathématiques de Toulouse UMR5219

IRT Saint Exupéry - Institut de Recherche Technologique

Jean-Michel Loubes

Fonction : Auteur
PersonId : 12832
IdHAL : jean-michel-loubes
IdRef : 078527015

Institut de Mathématiques de Toulouse UMR5219

Gilles Stoltz

Fonction : Auteur
PersonId : 738739
IdHAL : gilles-stoltz
ORCID : 0000-0003-1240-1007
IdRef : 091575419

Statistique mathématique et apprentissage

Laboratoire de Mathématiques d'Orsay

HEC Paris - Recherche - Hors Laboratoire

Résumé

We consider the bandit-based framework for diversity-preserving recommendations introduced by Celis et al. (2019), who approached it in the case of a polytope mainly by a reduction to the setting of linear bandits. We design a UCB algorithm using the specific structure of the setting and show that it enjoys a bounded distribution-dependent regret in the natural cases when the optimal mixed actions put some probability mass on all actions (i.e., when diversity is desirable). The regret lower bounds provided show that otherwise, at least when the model is mean-unbounded, a regret is suffered. We also discuss an example beyond the special case of polytopes.

Domaines

Machine Learning [stat.ML]

Fichier principal

GHLS24--Diversity-preserving.pdf (399.38 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gilles Stoltz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02957485

Soumis le : vendredi 5 avril 2024-09:33:52

Dernière modification le : mardi 16 avril 2024-09:42:52

Dates et versions

hal-02957485 , version 1 (05-10-2020)

hal-02957485 , version 2 (05-04-2024)

Licence

Paternité

Identifiants

HAL Id : hal-02957485 , version 2
ARXIV : 2010.01874

Citer

Hédi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz. Diversity-Preserving K-Armed Bandits, Revisited. 2024. ⟨hal-02957485v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

HEC UNIV-TLSE2 CNRS INRIA INSA-TOULOUSE SUP_LSS IMT LM-ORSAY UT1-CAPITOLE CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY IRT_SAINT-EXUPERY INSA-GROUPE ANR ANITI GS-MATHEMATIQUES GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT UNIV-UT3 UT3-TOULOUSEINP

279 Consultations

99 Téléchargements

Diversity-Preserving K-Armed Bandits, Revisited

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager