A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams

Olivier Mangin; Pierre-Yves Oudeyer; David Filliat

Communication Dans Un Congrès Année : 2010

A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams

(1) , (1) , (1, 2)

1
2

Olivier Mangin

Fonction : Auteur correspondant
PersonId : 884109

Connectez-vous pour contacter l'auteur

Flowing Epigenetic Robots and Systems

Pierre-Yves Oudeyer

Fonction : Auteur
PersonId : 6675
IdHAL : pyoudeyer
ORCID : 0000-0002-9404-7613
IdRef : 081674481

Flowing Epigenetic Robots and Systems

David Filliat

Fonction : Auteur
PersonId : 45
IdHAL : david-filliat
ORCID : 0000-0002-5739-1618
IdRef : 070072337

Flowing Epigenetic Robots and Systems

Unité d'Électronique et d'informatique

Résumé

We introduce a computational framework that allows a machine to bootstrap flexible autonomous learning of speech recognition skills. Technically, this framework shall en- able a robot to incrementally learn to recog- nize speech invariants from unsegmented au- dio streams and with no prior knowledge of phonetics. To achieve this, we import the bag-of-words/bag-of-features approach from recent research in computer vision, and adapt it to incremental developmental speech pro- cessing. We evaluate an implementation of this framework on a complex speech database.

Domaines

Robotique [cs.RO]

Fichier principal

mangin.2010.eprirob.pdf (499.42 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Pierre Rouanet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00541802

Soumis le : mercredi 1 décembre 2010-11:41:07

Dernière modification le : mercredi 15 mars 2023-08:50:07

Archivage à long terme le : mercredi 2 mars 2011-03:00:30

Dates et versions

inria-00541802 , version 1 (01-12-2010)

Identifiants

HAL Id : inria-00541802 , version 1

Citer

Olivier Mangin, Pierre-Yves Oudeyer, David Filliat. A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams. Tenth International Conference on Epigenetic Robotics, 2010, Örenäs Slott, Sweden. ⟨inria-00541802⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA INRIA ENSTA_U2IS INRIA2

205 Consultations

79 Téléchargements

A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager