Calibrated Fairness in Bandits

Yang Liu; Goran Radanovic; Christos Dimitrakakis; Debmalya Mandal; David C Parkes

Pré-Publication, Document De Travail Année : 2018

Calibrated Fairness in Bandits

(1) , (1) , (2) , (1) , (1)

1
2

Yang Liu

Fonction : Auteur

Harvard John A. Paulson School of Engineering and Applied Sciences

Goran Radanovic

Fonction : Auteur

Harvard John A. Paulson School of Engineering and Applied Sciences

Christos Dimitrakakis

Fonction : Auteur
PersonId : 6538
IdHAL : christos-dimitrakakis
ORCID : 0000-0002-5367-5189

Sequential Learning

Debmalya Mandal

Fonction : Auteur
PersonId : 1040441

Harvard John A. Paulson School of Engineering and Applied Sciences

David C Parkes

Fonction : Auteur

Harvard John A. Paulson School of Engineering and Applied Sciences

Résumé

We study fairness within the stochastic, multi-armed bandit (MAB) decision making framework. We adapt the fairness framework of "treating similar individuals similarly" [5] to this seing. Here, an 'individual' corresponds to an arm and two arms are 'similar' if they have a similar quality distribution. First, we adopt a smoothness constraint that if two arms have a similar quality distribution then the probability of selecting each arm should be similar. In addition, we dene the fairness regret, which corresponds to the degree to which an algorithm is not calibrated, where perfect calibration requires that the probability of selecting an arm is equal to the probability with which the arm has the best quality realization. We show that a variation on ompson sampling satises smooth fairness for total variation distance, and give añ O((kT) 2/3) bound on fairness regret. is complements prior work [12], which protects an on-average beer arm from being less favored. We also explain how to extend our algorithm to the dueling bandit seing. ACM Reference format:

Domaines

Machine Learning [stat.ML] Autres [stat.ML]

Fichier principal

1707.01875.pdf (178.64 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christos Dimitrakakis : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01953314

Soumis le : mercredi 12 décembre 2018-18:12:24

Dernière modification le : vendredi 19 avril 2024-09:47:16

Archivage à long terme le : mercredi 13 mars 2019-15:31:12

Dates et versions

hal-01953314 , version 1 (12-12-2018)

Identifiants

HAL Id : hal-01953314 , version 1

Citer

Yang Liu, Goran Radanovic, Christos Dimitrakakis, Debmalya Mandal, David C Parkes. Calibrated Fairness in Bandits. 2018. ⟨hal-01953314⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

167 Consultations

64 Téléchargements

Calibrated Fairness in Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager