Layer adaptation for transfer of expressivity in speech synthesis

Ajinkya Kulkarni; Vincent Colotte; Denis Jouvet

Communication Dans Un Congrès Année : 2019

Layer adaptation for transfer of expressivity in speech synthesis

(1) , (1) , (1)

Ajinkya Kulkarni

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Vincent Colotte

Fonction : Auteur
PersonId : 16268
IdHAL : vincent-colotte
IdRef : 070401683

Speech Modeling for Facilitating Oral-Based Communication

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Expressive speech synthesis using parametric approaches is constrained by the style of the speech corpus used. In this paper, we present the development of an expressive speech synthesis for a new speaker voice without requiring a specific recording of expressive speech by new speaker. We propose deep neural network based layer adaptation framework for transferring the expressive characteristics of speech to a new speaker's voice for which only neutral speech data is available. The focus of the work is on investigating transfer learning mechanism, which will accelerate the efforts towards exploiting existing expressive speech corpus. Experiments using expressive Caroline speech corpus and neutral Lisa speech corpus shows layer adaptation technique is able to transfer expressive characteristics while keeping the speaker's style characteristics.

Mots clés

expressive speech synthesis deep learning transfer learning domain adaptation emotion

Domaines

Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP]

Fichier principal

LTC19.pdf (372.33 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Vincent Colotte : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02177945

Soumis le : mardi 9 juillet 2019-14:15:08

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-02177945 , version 1 (09-07-2019)

Identifiants

HAL Id : hal-02177945 , version 1

Citer

Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet. Layer adaptation for transfer of expressivity in speech synthesis. LTC'19 - 9th Language & Technology Conference, May 2019, Poznan, Poland. ⟨hal-02177945⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

139 Consultations

256 Téléchargements

Layer adaptation for transfer of expressivity in speech synthesis

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager