Probabilistic scoring using decision trees for fast and scalable speaker recognition - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue Speech Communication Année : 2009

Probabilistic scoring using decision trees for fast and scalable speaker recognition

Résumé

In the context of fast and low cost speaker recognition, this article investigates several techniques based on decision trees. A new approach is introduced where the trees are used to estimate a score function rather than returning a decision among classes. This technique is developed to approximate the GMM log-likelihood ratio (LLR) score function. On top of this approach, different solutions are derived to improve the accuracy of the proposed trees. The first one studies the quantization of the LLR function to create classification trees on the LLR values. The second one makes use of knowledge on the GMM distribution of the acoustic features in order to build oblique trees. A third extension consists in using a low-complexity score function in each of the tree leaves. Series of comparative experiments are performed on the NIST 2005 speaker recognition evaluation data in order to evaluate the impact of the proposed improvements in terms of efficiency, execution time and algorithmic complexity. Considering a baseline system with an Equal Error Rate (EER) of 9.6% on the NIST 2005 evaluation, the best tree-based configuration achieves an EER of 12.9%, with a computational cost adapted to embedded devices and an execution time suitable for real-time speaker identification.
Fichier principal
Vignette du fichier
article_specom_gonon_decision_trees_ASR_final.pdf (1.3 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00544959 , version 1 (06-02-2011)

Identifiants

Citer

Gilles Gonon, Frédéric Bimbot, Rémi Gribonval. Probabilistic scoring using decision trees for fast and scalable speaker recognition. Speech Communication, 2009, 51 (11), pp.1065 - 1081. ⟨10.1016/j.specom.2009.02.007⟩. ⟨inria-00544959⟩
207 Consultations
279 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More