Skip to Main content Skip to Navigation

hal-00826056v1  Conference papers
Sertan GirginPhilippe Preux. Feature discovery in reinforcement learning using genetic programming
11th European Conference on Genetic Programming (EUROGP), 2008, Naples, Italy. pp.218-229
inria-00124685v1  Journal articles
Rémi Munos. Performance Bounds in Lp norm for Approximate Value Iteration
SIAM Journal on Control and Optimization, Society for Industrial and Applied Mathematics, 2007
hal-00826055v1  Conference papers
Sertan GirginPhilippe Preux. Basis Expansion in Natural Actor Critic Methods
European Workshop on Reinforcement Learning, Jun 2008, Villeneuve d'Ascq, France. pp.110-123
inria-00136198v2  Reports
Pierre-Arnaud CoquelinRémi Munos. Bandit Algorithms for Tree Search
[Research Report] RR-6141, INRIA. 2007, pp.20
tel-01297386v1  Theses
Victor Gabillon. Budgeted Classification-based Policy Iteration
Machine Learning [stat.ML]. Universite Lille 1, 2014. English
hal-00772060v1  Journal articles
Alessandro LazaricMohammad GhavamzadehRémi Munos. Finite-Sample Analysis of Least-Squares Policy Iteration
Journal of Machine Learning Research, Microtome Publishing, 2012, 13, pp.3041-3074
hal-03123999v1  Conference papers
Mathieu SeurinFlorian StrubPhilippe PreuxOlivier Pietquin. A Machine of Few Words Interactive Speaker Recognition with Reinforcement Learning
Conference of the International Speech Communication Association (INTERSPEECH), Oct 2020, Shanghai, China. ⟨10.21437/Interspeech.2020-2892⟩
hal-01146187v1  Conference papers
Julien AudiffrenMichal ValkoAlessandro LazaricMohammad Ghavamzadeh. Maximum Entropy Semi-Supervised Inverse Reinforcement Learning
International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina
hal-01629733v2  Conference papers
Lilian BessonEmilie Kaufmann. Multi-Player Bandits Revisited
Algorithmic Learning Theory, Mehryar Mohri; Karthik Sridharan, Apr 2018, Lanzarote, Spain
hal-00759822v1  Conference papers
Adrien CouetouxOlivier TeytaudHassen Doghmen. Learning a Move-Generator for Upper Con dence Trees
International Computer Symposium 2012, Dec 2012, Hualien, Taiwan
inria-00384970v1  Conference papers
Julien PerezCécile GermainBalázs KéglCharles Loomis. Responsive Elastic Computing
2009 ACM/IEEE Conference on International Conference on Autonomic Computing, Jun 2009, Barcelone, Spain. pp.55-64, ⟨10.1145/1555301.1555311⟩
tel-01816069v1  Theses
Marc Abeille. Exploration-Exploitation with Thompson Sampling in Linear Systems
Mathematics [math]. Université de Lille 1, 2017. English
hal-00776608v2  Journal articles
Mohammad GhavamzadehYaakov EngelMichal Valko. Bayesian Policy Gradient and Actor-Critic Algorithms
Journal of Machine Learning Research, Microtome Publishing, 2016, 17 (66), pp.1-53
hal-01234427v1  Conference papers
Aristide TossouChristos Dimitrakakis. Algorithms for Differentially Private Multi-Armed Bandits
AAAI 2016, Feb 2016, Phoenix, Arizona, United States
hal-02305105v3  Conference papers
Yannis Flet-BerliacPhilippe Preux. MERL: Multi-Head Reinforcement Learning
Deep Reinforcement Learning Workshop, NeurIPS, Dec 2019, Vancouver, Canada
hal-01401513v1  Journal articles
Alessandro LazaricMohammad GhavamzadehRémi Munos. Analysis of Classification-based Policy Iteration Algorithms
Journal of Machine Learning Research, Microtome Publishing, 2016, 17, pp.1 - 30
hal-01569447v1  Conference papers
Julian KrebsTommaso MansiHervé DelingetteLi ZhangFlorin Ghesu et al.  Robust non-rigid registration through agent-based action learning
Medical Image Computing and Computer Assisted Interventions (MICCAI), Sep 2017, Quebec, Canada. pp.344-352, ⟨10.1007/978-3-319-66182-7_40⟩
hal-01252744v1  Conference papers
Topalidou MeropiDaisuke KaseThomas BoraudNicolas P. Rougier. The formation of habits: a computational model mixing reinforcement and Hebbian learning
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2015), Jun 2015, Edmonton, Canada
hal-01350651v1  Conference papers
Matthieu ZimmerYann BonifaceAlain Dutech. Neural Fitted Actor-Critic
ESANN 2016 - Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Apr 2016, Bruges, Belgium
hal-00793610v1  Journal articles
Pierre-Yves OudeyerFrédéric KaplanVéréna Hafner. Intrinsic Motivation for Autonomous Mental Development
IEEE Transactions on Evolutionary Computation, Institute of Electrical and Electronics Engineers, 2007, 11 (2), pp.265-286. ⟨10.1109/TEVC.2006.890271⟩