Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE

Résumé

Uncertainty propagation is an established approach to handle noisy and reverberant conditions in automatic speech recognition (ASR), but it has little been studied for speaker recognition so far. Yu et al. recently proposed to propagate uncertainty to the Baum-Welch (BW) statistics without changing the posterior probability of each mixture component. They obtained good results on a small dataset (YOHO) but little improvement on the NIST-SRE dataset, despite the use of oracle uncertainty estimates. In this paper, we propose to modify the computation of the posterior probability of each mixture component in order to obtain unbiased BW statistics. We show that our approach improves the accuracy of BW statistics on the Wall Street Journal (WSJ) corpus, but yields little or no improvement on NIST-SRE again. We provide a theoretical explanation for this that opens the way for more efficient exploitation of uncertainty on NIST-SRE and other large datasets in the future.
Fichier principal
Vignette du fichier
UPivector_2015.pdf (379.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01158775 , version 1 (02-06-2015)
hal-01158775 , version 2 (05-06-2015)
hal-01158775 , version 3 (05-08-2015)

Identifiants

  • HAL Id : hal-01158775 , version 3

Citer

Dayana Ribas, Emmanuel Vincent, José Ramon Calvo. Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE. Interspeech 2015, Sep 2015, Dresden, Germany. pp.5. ⟨hal-01158775v3⟩
349 Consultations
425 Téléchargements

Partager

Gmail Facebook X LinkedIn More