A multiple-play bandit algorithm applied to recommender systems - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

A multiple-play bandit algorithm applied to recommender systems

Résumé

For several web tasks such as ad placement or e-commerce, recommender systems must recommend multiple items to their users-such problems can be modeled as bandits with multiple plays. State-of-the-art methods require running as many single-play bandit algorithms as there are items to recommend. On the contrary, some recent theoretical work in the machine learning literature designed new algorithms to address the multiple-play case directly. These algorithms were proved to have strong theoretical guarantees. In this paper we compare one such multiple-play algorithm with previous methods. We show on two real-world datasets that the multiple-play algorithm we use converges to equivalent values but learns about three times faster than state-of-the-art methods. We also show that carefully adapting these earlier methods can improve their performance.
Fichier principal
Vignette du fichier
louedec_18744.pdf (314.65 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04077707 , version 1 (21-04-2023)

Identifiants

  • HAL Id : hal-04077707 , version 1
  • OATAO : 18744

Citer

Jonathan Louëdec, Max Chevalier, Josiane Mothe, Aurélien Garivier, Sébastien Gerchinovitz. A multiple-play bandit algorithm applied to recommender systems. 28th International Florida Artificial Intelligence Research Society (FLAIRS 2015), May 2015, Hollywood, United States. pp.67-72. ⟨hal-04077707⟩
158 Consultations
50 Téléchargements

Partager

Gmail Facebook X LinkedIn More