Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Anastasiia Tsukanova; Benjamin Elie; Yves Laprie

doi:10.1007/978-3-030-00126-1_4

Chapitre D'ouvrage Année : 2018

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

(1) , (2, 1) , (1)

1
2

Anastasiia Tsukanova

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Benjamin Elie

Fonction : Auteur
PersonId : 959483

Laboratoire des signaux et systèmes

Speech Modeling for Facilitating Oral-Based Communication

Yves Laprie

Fonction : Auteur
PersonId : 6696
IdHAL : yves-laprie
ORCID : 0000-0002-2379-6481
IdRef : 060274387

Speech Modeling for Facilitating Oral-Based Communication

Résumé

The aim of this work is to develop an algorithm for controlling the articulators (the jaw, the tongue, the lips, the velum, the larynx and the epiglottis) to produce given speech sounds, syllables and phrases. This control has to take into account coarticulation and be flexible enough to be able to vary strategies for speech production. The data for the algorithm are 97 static MRI images capturing the articulation of French vowels and blocked consonant-vowel syllables. The results of this synthesis are evaluated visually, acoustically and perceptually, and the problems encountered are broken down by their origin: the dataset, its modeling, the algorithm for managing the vocal tract shapes, their translation to the area functions, and the acoustic simulation. We conclude that, among our test examples, the articulatory strategies for vowels and stops are most correct, followed by those of nasals and fricatives. Improving timing strategies with dynamic data is suggested as an avenue for future work.

Mots clés

Articulatory synthesis Articulatory gestures

Synthèse articulatoire Gestes articulatoires Coarticulation

Domaines

Modélisation et simulation Intelligence artificielle [cs.AI] Informatique et langage [cs.CL]

Fichier principal

issp25-tsukanova.pdf (3.01 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Anastasiia Tsukanova : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01937950

Soumis le : vendredi 7 décembre 2018-12:07:39

Dernière modification le : dimanche 17 mars 2024-11:46:04

Archivage à long terme le : vendredi 8 mars 2019-13:09:54

Dates et versions

hal-01937950 , version 1 (07-12-2018)

Identifiants

HAL Id : hal-01937950 , version 1
DOI : 10.1007/978-3-030-00126-1_4

Citer

Anastasiia Tsukanova, Benjamin Elie, Yves Laprie. Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets. Qiang Fang; Jianwu Dang; Pascal Perrier; Jianguo Wei; Longbiao Wang; Nan Yan. Studies on Speech Production, 10733, Springer, pp.37-47, 2018, Lecture Notes in Computer Science, 978-3-030-00125-4. ⟨10.1007/978-3-030-00126-1_4⟩. ⟨hal-01937950⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA SUP_LSS SUP_SIGNAUX CENTRALESUPELEC UNIV-LORRAINE INRIA2 TDS-MACS LORIA LORIA-NLPKD UNIV-PARIS-SACLAY ANR GS-ENGINEERING GS-COMPUTER-SCIENCE

108 Consultations

229 Téléchargements

Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager