A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams

Résumé

We introduce a computational framework that allows a machine to bootstrap flexible autonomous learning of speech recognition skills. Technically, this framework shall en- able a robot to incrementally learn to recog- nize speech invariants from unsegmented au- dio streams and with no prior knowledge of phonetics. To achieve this, we import the bag-of-words/bag-of-features approach from recent research in computer vision, and adapt it to incremental developmental speech pro- cessing. We evaluate an implementation of this framework on a complex speech database.
Fichier principal
Vignette du fichier
mangin.2010.eprirob.pdf (499.42 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

inria-00541802 , version 1 (01-12-2010)

Identifiants

  • HAL Id : inria-00541802 , version 1

Citer

Olivier Mangin, Pierre-Yves Oudeyer, David Filliat. A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams. Tenth International Conference on Epigenetic Robotics, 2010, Örenäs Slott, Sweden. ⟨inria-00541802⟩
205 Consultations
79 Téléchargements

Partager

Gmail Facebook X LinkedIn More