Dynamic Bayesian Networks for multi-band automatic speech recognition

Khalid Daoudi; Dominique Fohr; Christophe Antoine

doi:10.1016/S0885-2308(03)00011-1

Article Dans Une Revue Computer Speech and Language Année : 2003

Dynamic Bayesian Networks for multi-band automatic speech recognition

(1) , (1) , (1)

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Christophe Antoine

Fonction : Auteur
PersonId : 1035714
IdHAL : christophe-antoine

Analysis, perception and recognition of speech

Résumé

This paper presents a new approach to multi-band automatic speech recognition which has the advantage to overcome many limitations of classical muti-band systems. The principle of this new approach is to build a speech model in the time-frequency domain using the formalism of dynamic Bayesian networks. In contrast to classical multi-band modeling, this formalism leads to a probabilistic speech model which allows communications between the different sub-bands and, consequently, no recombination step is required in recognition. We develop efficient learning and decoding algorithms both for isolated and continuous speech recognition. We present illustrative experiments on isolated and connected digit recognition tasks. These experiments show that the this new approach is very promising in the field of noisy speech recognition.

Mots clés

Bayesian networks Speech recognition

Reconnaissance de la parole Réseaux bayésiens

Domaines

Autre [cs.OH]

Fichier principal

00099530.pdf (19.82 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00099530

Soumis le : vendredi 13 novembre 2020-13:16:29

Dernière modification le : jeudi 1 février 2024-10:05:51

Archivage à long terme le : dimanche 14 février 2021-18:54:18

Dates et versions

inria-00099530 , version 1 (13-11-2020)

Identifiants

HAL Id : inria-00099530 , version 1
DOI : 10.1016/S0885-2308(03)00011-1

Citer

Khalid Daoudi, Dominique Fohr, Christophe Antoine. Dynamic Bayesian Networks for multi-band automatic speech recognition. Computer Speech and Language, 2003, 17 (2-3), pp.263-285. ⟨10.1016/S0885-2308(03)00011-1⟩. ⟨inria-00099530⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

71 Consultations

38 Téléchargements

Dynamic Bayesian Networks for multi-band automatic speech recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager