Can we Generate Emotional Pronunciations for Expressive Speech Synthesis? - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Affective Computing Année : 2020

Can we Generate Emotional Pronunciations for Expressive Speech Synthesis?

Résumé

In the field of expressive speech synthesis, a lot of work has been conducted on suprasegmental prosodic features while few has been done on pronunciation variants. However, prosody is highly related to the sequence of phonemes to be expressed. This article raises two issues in the generation of emotional pronunciations for TTS systems. The first issue consists in designing an automatic pronunciation generation method from text, while the second issue addresses the very existence of emotional pronunciations through experiments conducted on emotional speech. To do so, an innovative pronunciation adaptation method which automatically adapts canonical phonemes first to those labeled in the corpus used to create a synthetic voice, then to those labeled in an expressive corpus, is presented. This method consists in training conditional random fields pronunciation models with prosodic, linguistic, phonological and articulatory features. The analysis of emotional pronunciations reveals strong dependencies between prosody and phoneme assimilation or elisions. According to perception tests, the double adaptation allows to synthesize expressive speech samples of good quality, but emotion-specific pronunciations are too subtle to be perceived by testers.
Fichier principal
Vignette du fichier
TAC2017.pdf (1.16 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01802463 , version 1 (10-09-2018)

Identifiants

Citer

Marie Tahon, Gwénolé Lecorvé, Damien Lolive. Can we Generate Emotional Pronunciations for Expressive Speech Synthesis?. IEEE Transactions on Affective Computing, 2020, 11 (4), pp.684-695. ⟨10.1109/TAFFC.2018.2828429⟩. ⟨hal-01802463⟩
390 Consultations
1170 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More