|
||
---|---|---|
hal-00826056v1
Conference papers
Feature discovery in reinforcement learning using genetic programming 11th European Conference on Genetic Programming (EUROGP), 2008, Naples, Italy. pp.218-229 |
||
inria-00124685v1
Journal articles
Performance Bounds in Lp norm for Approximate Value Iteration SIAM Journal on Control and Optimization, Society for Industrial and Applied Mathematics, 2007 |
||
hal-00826055v1
Conference papers
Basis Expansion in Natural Actor Critic Methods European Workshop on Reinforcement Learning, Jun 2008, Villeneuve d'Ascq, France. pp.110-123 |
||
inria-00120882v4
Reports
Finite Time Bounds for Sampling-Based Fitted Value Iteration [Research Report] 2007, pp.46 |
||
inria-00136198v2
Reports
Bandit Algorithms for Tree Search [Research Report] RR-6141, INRIA. 2007, pp.20 |
||
inria-00272368v2
Reports
Incremental Basis Function Expansion in Reinforcement Learning using Cascade-Correlation Networks [Research Report] RR-6505, INRIA. 2008 |
||
tel-01297386v1
Theses
Budgeted Classification-based Policy Iteration Machine Learning [stat.ML]. Universite Lille 1, 2014. English |
||
inria-00187997v2
Reports
Feature Discovery in Reinforcement Learning using Genetic Programming [Research Report] INRIA. 2007 |
||
hal-00772060v1
Journal articles
Finite-Sample Analysis of Least-Squares Policy Iteration Journal of Machine Learning Research, Microtome Publishing, 2012, 13, pp.3041-3074 |
||
hal-01337332v2
Preprints, Working Papers, ...
One critic, two actors: evidence for covert learning in the basal ganglia 2017 |
||
hal-03123999v1
Conference papers
A Machine of Few Words Interactive Speaker Recognition with Reinforcement Learning Conference of the International Speech Communication Association (INTERSPEECH), Oct 2020, Shanghai, China. ⟨10.21437/Interspeech.2020-2892⟩ |
||
hal-01146187v1
Conference papers
Maximum Entropy Semi-Supervised Inverse Reinforcement Learning International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina |
||
hal-01629733v2
Conference papers
Multi-Player Bandits Revisited Algorithmic Learning Theory, Mehryar Mohri; Karthik Sridharan, Apr 2018, Lanzarote, Spain |
||
hal-01840022v1
Preprints, Working Papers, ...
SMPyBandits: an Experimental Framework for Single and Multi-Players Multi-Arms Bandits Algorithms in Python 2018 |
||
tel-01749537v1
Theses
Paradigme de pot de miel adaptatif permettant d'étudier et d'évaluer le comportement et compétences des pirates informatiques Other [cs.OH]. Institut National Polytechnique de Lorraine, 2011. English. ⟨NNT : 2011INPL037N⟩ |
||
hal-00759822v1
Conference papers
Learning a Move-Generator for Upper Con dence Trees International Computer Symposium 2012, Dec 2012, Hualien, Taiwan |
||
inria-00384970v1
Conference papers
Responsive Elastic Computing 2009 ACM/IEEE Conference on International Conference on Autonomic Computing, Jun 2009, Barcelone, Spain. pp.55-64, ⟨10.1145/1555301.1555311⟩ |
||
tel-01816069v1
Theses
Exploration-Exploitation with Thompson Sampling in Linear Systems Mathematics [math]. Université de Lille 1, 2017. English |
||
hal-00776608v2
Journal articles
Bayesian Policy Gradient and Actor-Critic Algorithms Journal of Machine Learning Research, Microtome Publishing, 2016, 17 (66), pp.1-53 |
||
hal-01234427v1
Conference papers
Algorithms for Differentially Private Multi-Armed Bandits AAAI 2016, Feb 2016, Phoenix, Arizona, United States |
||
hal-02305105v3
Conference papers
MERL: Multi-Head Reinforcement Learning Deep Reinforcement Learning Workshop, NeurIPS, Dec 2019, Vancouver, Canada |
||
hal-02177808v1
Preprints, Working Papers, ...
Active Roll-outs in MDP with Irreversible Dynamics 2019 |
||
hal-01401513v1
Journal articles
Analysis of Classification-based Policy Iteration Algorithms Journal of Machine Learning Research, Microtome Publishing, 2016, 17, pp.1 - 30 |
||
hal-02295705v3
Preprints, Working Papers, ...
High-Dimensional Control Using Generalized Auxiliary Tasks 2019 |
||
hal-01404304v1
Journal articles
Curiosity and Intrinsic Motivation for Autonomous Machine Learning ERCIM News, ERCIM, 2016, 107, pp.2 |
||
hal-01569447v1
Conference papers
Robust non-rigid registration through agent-based action learning Medical Image Computing and Computer Assisted Interventions (MICCAI), Sep 2017, Quebec, Canada. pp.344-352, ⟨10.1007/978-3-319-66182-7_40⟩ |
||
hal-01252744v1
Conference papers
The formation of habits: a computational model mixing reinforcement and Hebbian learning The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2015), Jun 2015, Edmonton, Canada |
||
hal-01350651v1
Conference papers
Neural Fitted Actor-Critic ESANN 2016 - Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Apr 2016, Bruges, Belgium |
||
|
||
hal-00793610v1
Journal articles
Intrinsic Motivation for Autonomous Mental Development IEEE Transactions on Evolutionary Computation, Institute of Electrical and Electronics Engineers, 2007, 11 (2), pp.265-286. ⟨10.1109/TEVC.2006.890271⟩ |
||
|