Multilingual Recognition of Non-Native Speech using Acoustic Model Transformation and Pronunciation Modeling

Ghazi Bouselmi; Dominique Fohr; Irina Illina

Article Dans Une Revue International Journal of Speech Technology Année : 2012

Multilingual Recognition of Non-Native Speech using Acoustic Model Transformation and Pronunciation Modeling

(1) , (1) , (1)

Ghazi Bouselmi

Fonction : Auteur
PersonId : 836336

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Analysis, perception and recognition of speech

Résumé

This article presents an approach for the automatic recognition of non-native speech. Some non-native speakers tend to pronounce phonemes as they would in their native language. Model adaptation can improve the recognition rate for non-native speakers, but has difficulties dealing with pronunciation errors like phoneme insertions or substitutions. For these pronunciation mismatches, pronunciation modeling can make the recognition system more robust. Our approach is based on acoustic model transformation and pronunciation modeling for multiple non-native accents. For acoustic model transformation, two approaches are evaluated: MAP and model re-estimation. For pronunciation modeling, confusion rules (alternate pronunciations) are automatically extracted from a small non-native speech corpus. This paper presents a novel approach to introduce confusion rules in the recognition system which are automatically learned through pronunciation modelling. The modified HMM of a foreign spoken language phoneme includes its canonical pronunciation along with all the alternate non-native pronunciations, so that spoken language phonemes pronounced correctly by a non-native speaker could be recognized. We evaluate our approaches on the European project HIWIRE non-native corpus which contains English sentences pronounced by French, Italian, Greek and Spanish speakers. Two cases are studied: the native language of the test speaker is either known or unknown. Our approach gives better recognition results than the classical acoustic adaptation of HMM when the foreign origin of the speaker is known. We obtain 22% WER reduction compared to the reference system. Furthermore, we take into account the written form of the spoken words: non-native speakers may rely on the writing of the words in order to pronounce them. This approach does not provide any further improvements.

Domaines

Interface homme-machine [cs.HC]

Dominique Fohr : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00764626

Soumis le : jeudi 13 décembre 2012-11:34:57

Dernière modification le : vendredi 24 mars 2023-14:52:56

Dates et versions

hal-00764626 , version 1 (13-12-2012)

Identifiants

HAL Id : hal-00764626 , version 1

Citer

Ghazi Bouselmi, Dominique Fohr, Irina Illina. Multilingual Recognition of Non-Native Speech using Acoustic Model Transformation and Pronunciation Modeling. International Journal of Speech Technology, 2012, 15 (2), pp.203 - 213. ⟨hal-00764626⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

114 Consultations

0 Téléchargements

Multilingual Recognition of Non-Native Speech using Acoustic Model Transformation and Pronunciation Modeling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager