Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System

Irina Illina

Résumé

In this paper, the problem of the adaptation of a speech recognition system to a new environment is addressed. Recently, a Structural Maximum a Posteriori adaptation (SMAP) for a frame-based HMM model adaptation has been developed. In this method, acoustic model pdfs are organised in a tree and the means and variances of the pdfs are adapted using the linear transformations estimated under MAP criteria. In this paper, we extend the SMAP adaptation to a segment-based model: the Mixture Stochastic Trajectory Model (MSTM). SMAP approach is completed by the tree construction driven by adaptation data, a Minimum Description Length (MDL) structure definition of this tree and trajectory and state adaptations. On the Resource Management task, the speaker adaptation and noise adaptation experiments show that the proposed SMAP approach gives a significant improvement compared to unadapted system.
Fichier non déposé

Dates et versions

inria-00100756 , version 1 (26-09-2006)

Identifiants

  • HAL Id : inria-00100756 , version 1

Citer

Irina Illina. Tree-Structured Maximum a Posteriori Adaptation for a Segment-Based Speech Recognition System. 7th International Conference on Spoken Language Processing - ICSLP'02, 2002, Denver, Colorado, USA, 4 p. ⟨inria-00100756⟩
83 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More