Speech recognition with speech density estimation by the dirichlet process mixture

Kenko Ota; Emmanuel Duflos; Philippe Vanheeghe; Masuzo Yanagida

doi:10.1109/ICASSP.2008.4517919

Communication Dans Un Congrès Année : 2008

Speech recognition with speech density estimation by the dirichlet process mixture

(1) , (2, 3) , (2, 3) ,

1
2
3

Kenko Ota

Fonction : Auteur
PersonId : 848160

Laboratoire d'Automatique, Génie Informatique et Signal

Emmanuel Duflos

Fonction : Auteur

Sequential Learning

LAGIS-SI

Philippe Vanheeghe

Fonction : Auteur

Sequential Learning

LAGIS-SI

Masuzo Yanagida

Fonction : Auteur

Résumé

This paper shows a method for the modeling of speech signal distributions based on Dirichlet process mixtures (DPM) and the estimation of noise sequences based on particle filtering. In real situations, the speech recognition rate degrades miser ably because of the effect of environmental noises, reflected waves and so on. To improve the speech recognition rate, a technique for the estimation of noise sequences is necessary. In this paper, the distribution of the clean speech is modeled using the DPM instead of the traditional model, which is a Gaussian mixture model (GMM). Speech signal sequences are generated according to the mean and covariance generated from the DPM. Then, noise signal sequences are estimated with a particle filter. The proposed method using extended Kalman filter (EKF) can improve the speech recognition rate significantly in the low SNR region. Applying unscented Kalman filter (UKF), better results can be obtained in also the high SNR.

Mots clés

Kalman filtering Signal processing Speech enhancement Speech recognition Stochastic processes

Domaines

Automatique

Philippe Vanheeghe : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00782333

Soumis le : mardi 29 janvier 2013-15:23:29

Dernière modification le : vendredi 24 mars 2023-14:52:56

Dates et versions

hal-00782333 , version 1 (29-01-2013)

Identifiants

HAL Id : hal-00782333 , version 1
DOI : 10.1109/ICASSP.2008.4517919

Citer

Kenko Ota, Emmanuel Duflos, Philippe Vanheeghe, Masuzo Yanagida. Speech recognition with speech density estimation by the dirichlet process mixture. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008., Mar 2008, Las Vegas, United States. pp.1553 - 1556, ⟨10.1109/ICASSP.2008.4517919⟩. ⟨hal-00782333⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS LAGIS-SI INRIA2

217 Consultations

0 Téléchargements

Speech recognition with speech density estimation by the dirichlet process mixture

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager