Detailed pronunciation variant modeling for speech transcription - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Detailed pronunciation variant modeling for speech transcription

Denis Jouvet
Dominique Fohr
Irina Illina

Résumé

Modeling pronunciation variants is an important topic for automatic speech recognition. This paper investigates the pronunciation modeling at the lexical level, and presents a detailed modeling of the probabilities of the pronunciation variants. The approach is evaluated on the French ESTER2 corpus, and a significant word error rate reduction is achieved through the use of context and speaking rate dependent modeling of these pronunciation probabilities. A rule-based approach makes it possible to derive a priori probabilities for the pronunciation of words that are not present in the training corpus, and a MAP estimation process yields reliable estimates of the pronunciation variant probabilities.
Fichier non déposé

Dates et versions

inria-00528225 , version 1 (21-10-2010)

Identifiants

  • HAL Id : inria-00528225 , version 1

Citer

Denis Jouvet, Dominique Fohr, Irina Illina. Detailed pronunciation variant modeling for speech transcription. INTERSPEECH, ISCA, Sep 2010, Makuhari, Japan. ⟨inria-00528225⟩
136 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More