Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System

Irina Illina

Communication Dans Un Congrès Année : 2002

Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System

(1)

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Analysis, perception and recognition of speech

Résumé

In this paper, the problem of the adaptation of a speech recognition system to a new environment is addressed. Recently, a Structural Maximum a Posteriori adaptation (SMAP) for a frame-based HMM model adaptation has been developed. In this method, acoustic model pdfs are organised in a tree and the means and variances of the pdfs are adapted using the linear transformations estimated under MAP criteria. In this paper, we extend the SMAP adaptation to a segment-based model: the Mixture Stochastic Trajectory Model (MSTM). SMAP approach is completed by the tree construction driven by adaptation data, a Minimum Description Length (MDL) structure definition of this tree and trajectory and state adaptations. On the Resource Management task, the speaker adaptation and noise adaptation experiments show that the proposed SMAP approach gives a significant improvement compared to unadapted system.

Mots clés

continuous speech recognition model adaptation adaptation du modèle acoustique modèle fondé sur les segments segment-based system reconnaissance de la parole continue

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00100756

Soumis le : mardi 26 septembre 2006-14:50:17

Dernière modification le : vendredi 24 mars 2023-14:52:48

Dates et versions

inria-00100756 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00100756 , version 1

Citer

Irina Illina. Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System. 7th International Conference on Spoken Language Processing - ICSLP'02, 2002, Denver, Colorado, USA, 4 p. ⟨inria-00100756⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

83 Consultations

0 Téléchargements

Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager