Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit

Alexandra Carpentier; Rémi Munos

Rapport (Rapport Technique) Année : 2012

Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit

(1) , (1)

Alexandra Carpentier

Fonction : Auteur
PersonId : 910455

Sequential Learning

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Résumé

We consider a linear stochastic bandit problem where the dimension $K$ of the unknown parameter $\theta$ is larger than the sampling budget $n$. In such cases, it is in general impossible to derive sub-linear regret bounds since usual linear bandit algorithms have a regret in $O(K\sqrt{n})$. In this paper we assume that $\theta$ is $S-$sparse, i.e.~has at most $S-$non-zero components, and that the space of arms is the unit ball for the $||.||_2$ norm. We combine ideas from Compressed Sensing and Bandit Theory and derive algorithms with regret bounds in $O(S\sqrt{n})$.

Domaines

Statistiques [math.ST] Théorie [stat.TH] Apprentissage [cs.LG]

Fichier principal

SparseBanditsAISTATS.pdf (1.22 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alexandra Carpentier : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00659731

Soumis le : mercredi 16 mai 2012-17:01:28

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : vendredi 17 août 2012-02:36:53

Dates et versions

hal-00659731 , version 1 (13-01-2012)

hal-00659731 , version 2 (16-05-2012)

Identifiants

HAL Id : hal-00659731 , version 2
ARXIV : 1205.4094

Citer

Alexandra Carpentier, Rémi Munos. Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit. [Technical Report] 2012. ⟨hal-00659731v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA INSMI LAGIS INRIA2 LARA

342 Consultations

304 Téléchargements

Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager