On semi-supervised LF-MMI training of acoustic models with limited data

Imran Sheikh; Emmanuel Vincent; Irina Illina

Communication Dans Un Congrès Année : 2020

On semi-supervised LF-MMI training of acoustic models with limited data

(1) , (1) , (1)

Imran Sheikh

Fonction : Auteur
PersonId : 968903

Speech Modeling for Facilitating Oral-Based Communication

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Résumé

This work investigates semi-supervised training of acoustic models (AM) with the lattice-free maximum mutual information (LF-MMI) objective in practically relevant scenarios with a limited amount of labeled in-domain data. An error detection driven semi-supervised AM training approach is proposed, in which an error detector controls the hypothesized transcriptions or lattices used as LF-MMI training targets on additional unlabeled data. Under this approach, our first method uses a single error-tagged hypothesis whereas our second method uses a modified supervision lattice. These methods are evaluated and compared with existing semi-supervised AM training methods in three different matched or mismatched, limited data setups. Word error recovery rates of 28 to 89% are reported.

Mots clés

lattice-free MMI semi-supervised training speech recognition error detection

Domaines

Informatique et langage [cs.CL] Apprentissage [cs.LG]

Fichier principal

is20_wsl_310720.pdf (165.03 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02907924

Soumis le : vendredi 31 juillet 2020-15:19:23

Dernière modification le : jeudi 1 février 2024-10:06:10

Dates et versions

hal-02907924 , version 1 (31-07-2020)

Identifiants

HAL Id : hal-02907924 , version 1

Citer

Imran Sheikh, Emmanuel Vincent, Irina Illina. On semi-supervised LF-MMI training of acoustic models with limited data. INTERSPEECH 2020, Oct 2020, Shanghai, China. ⟨hal-02907924⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SILECS UR1-MATH-NUM

235 Consultations

619 Téléchargements

On semi-supervised LF-MMI training of acoustic models with limited data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager