Learning Aquatic Locomotion with Animats - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Learning Aquatic Locomotion with Animats

Résumé

One of the challenges of researching spiking neural networks (SNN) is translation from temporal spiking behavior to classic controller output. While many encoding schemes exist to facilitate this translation, there are few benchmarks for neural networks that inherently utilize a temporal controller. In this work, we consider the common reinforcement problem of animat locomotion in an environment suited for evaluating SNNs. Using this problem, we explore novel methods of reward distribution as they impacts learning. Hebbian learning, in the form of spike time dependent plasticity (STDP), is modulated by a dopamine signal and affected by reward-induced neural activity. Different reward strategies are parameterized and the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) is used to find the best strategies for fixed animat morphologies. The contribution of this work is two-fold: to cast the problem of animat locomotion in a form directly applicable to simple temporal controllers, and to demonstrate novel methods for reward modulated Hebbian learning.
Fichier principal
Vignette du fichier
wilson_22084.pdf (1.47 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02860849 , version 1 (08-06-2020)

Identifiants

  • HAL Id : hal-02860849 , version 1
  • OATAO : 22084

Citer

Dennis G. Wilson, Jean Disset, Sylvain Cussat-Blanc, Yves Duthen, Hervé Luga. Learning Aquatic Locomotion with Animats. ECAL 2017: the 14th European Conference on Artificial Life, Sep 2017, Lyon, France. pp.585-592. ⟨hal-02860849⟩
33 Consultations
35 Téléchargements

Partager

Gmail Facebook X LinkedIn More