Computing Whittle (and Gittins) Index in Subcubic Time - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Computing Whittle (and Gittins) Index in Subcubic Time

Résumé

Whittle index is a generalization of Gittins index that provides very efficient allocation rules for restless multiarmed bandits. In this paper, we develop an algorithm to test the indexability and compute the Whittle indices of any finite-state Markovian bandit problem. This algorithm works in the discounted and non-discounted cases. As a byproduct, it can also be used to compute Gittins index. Our algorithm builds on three tools: (1) a careful characterization of Whittle index that allows one to compute recursively the th smallest index from the (− 1)th smallest, and to test indexability, (2) the use of Sherman-Morrison formula to make this recursive computation efficient, and (3) a sporadic use of fast matrix inversion and multiplication to obtain a subcubic complexity. We show that an efficient use of the Sherman-Morrison formula leads to an algorithm that computes Whittle index in (2⇑3) 3 + (3) arithmetic operations, where is the number of states of the arm. The careful use of fast matrix multiplication leads to the first subcubic algorithm to compute Whittle (or Gittins) index. By using the current fastest matrix multiplications, our algorithm runs in (2.5286). We also conduct a series of experiments that demonstrate that our algorithm is very efficient in practice and can compute indices of Markov chains with several thousands of states in a few seconds.
Fichier principal
Vignette du fichier
compute_whittle_idx.pdf (729.39 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03602458 , version 1 (09-03-2022)
hal-03602458 , version 2 (29-04-2022)
hal-03602458 , version 3 (08-12-2022)
hal-03602458 , version 4 (17-04-2023)
hal-03602458 , version 5 (20-06-2023)

Identifiants

Citer

Nicolas Gast, Bruno Gaujal, Kimang Khun. Computing Whittle (and Gittins) Index in Subcubic Time. 2022. ⟨hal-03602458v2⟩
188 Consultations
330 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More