Finite Continuum-Armed Bandits

Solenne Gaucher

Communication Dans Un Congrès Année : 2020

Finite Continuum-Armed Bandits

(1, 2)

1
2

Solenne Gaucher

Fonction : Auteur
PersonId : 1079870

Laboratoire de Mathématiques d'Orsay

Statistique mathématique et apprentissage

Résumé

We consider a situation where an agent has $T$ ressources to be allocated to a larger number $N$ of actions. Each action can be completed at most once and results in a stochastic reward with unknown mean. The goal of the agent is to maximize her cumulative reward. Non trivial strategies are possible when side information on the actions is available, for example in the form of covariates. Focusing on a nonparametric setting, where the mean reward is an unknown function of a one-dimensional covariate, we propose an optimal strategy for this problem. Under natural assumptions on the reward function, we prove that the optimal regret scales as $O(T^{1/3})$ up to poly-logarithmic factors when the budget $T$ is proportional to the number of actions $N$. When $T$ becomes small compared to $N$, a smooth transition occurs. When the ratio $T/N$ decreases from a constant to $N^{-1/3}$, the regret increases progressively up to the $O(T^{1/2})$ rate encountered in continuum-armed bandits.

Domaines

Statistiques [math.ST] Autres [stat.ML]

Fichier principal

main.pdf (465.11 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Solenne Gaucher : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02975304

Soumis le : lundi 2 novembre 2020-09:07:16

Dernière modification le : jeudi 14 mars 2024-03:14:08

Dates et versions

hal-02975304 , version 1 (22-10-2020)

hal-02975304 , version 2 (02-11-2020)

Identifiants

HAL Id : hal-02975304 , version 2
ARXIV : 2010.12236

Citer

Solenne Gaucher. Finite Continuum-Armed Bandits. Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020, Online, Canada. pp.3186--3196. ⟨hal-02975304v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA LM-ORSAY INRIA2 UNIV-PARIS-SACLAY GS-MATHEMATIQUES GS-COMPUTER-SCIENCE

55 Consultations

246 Téléchargements

Finite Continuum-Armed Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager