Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Chapitre D'ouvrage Année : 2018

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Résumé

The aim of this work is to develop an algorithm for controlling the articulators (the jaw, the tongue, the lips, the velum, the larynx and the epiglottis) to produce given speech sounds, syllables and phrases. This control has to take into account coarticulation and be flexible enough to be able to vary strategies for speech production. The data for the algorithm are 97 static MRI images capturing the articulation of French vowels and blocked consonant-vowel syllables. The results of this synthesis are evaluated visually, acoustically and perceptually, and the problems encountered are broken down by their origin: the dataset, its modeling, the algorithm for managing the vocal tract shapes, their translation to the area functions, and the acoustic simulation. We conclude that, among our test examples, the articulatory strategies for vowels and stops are most correct, followed by those of nasals and fricatives. Improving timing strategies with dynamic data is suggested as an avenue for future work.
Fichier principal
Vignette du fichier
issp25-tsukanova.pdf (3.01 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01937950 , version 1 (07-12-2018)

Identifiants

Citer

Anastasiia Tsukanova, Benjamin Elie, Yves Laprie. Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets. Qiang Fang; Jianwu Dang; Pascal Perrier; Jianguo Wei; Longbiao Wang; Nan Yan. Studies on Speech Production, 10733, Springer, pp.37-47, 2018, Lecture Notes in Computer Science, 978-3-030-00125-4. ⟨10.1007/978-3-030-00126-1_4⟩. ⟨hal-01937950⟩
108 Consultations
229 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More