An episodic memory-based solution for the acoustic-to-articulatory inversion problem - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue Journal of the Acoustical Society of America Année : 2013

An episodic memory-based solution for the acoustic-to-articulatory inversion problem

Résumé

This paper presents an acoustic-to-articulatory inversion method based on an episodic memory. An episodic memory is an interesting model for two reasons. First, it does not rely on any assumptions about the mapping function but rather it relies on real synchronized acoustic and articulatory data streams. Second, the memory inherently represents the real articulatory dynamics as observed. It is argued that the computational models of episodic memory, as they are usually designed, cannot provide a satisfying solution for the acoustic-to-articulatory inversion problem due to the insufficient quantity of training data. Therefore, an episodic memory is proposed, called generative episodic memory (G-Mem), which is able to produce articulatory trajectories that do not belong to the set of episodes the memory is based on. The generative episodic memory is evaluated using two electromagnetic articulography corpora: one for English and one for French. Comparisons with a codebook-based method and with a classical episodic memory (which is termed concatenative episodic memory) are presented in order to evaluate the proposed generative episodic memory in terms of both its modeling of articulatory dynamics and its generalization capabilities. The results show the effectiveness of the method where an overall root-mean-square error of 1.65 mm and a correlation of 0.71 are obtained for the G-Mem method. They are comparable to those of methods recently proposed.
Fichier principal
Vignette du fichier
JASA_Demange_Ouni.pdf (662.61 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00834556 , version 1 (06-10-2015)

Identifiants

Citer

Sébastien Demange, Slim Ouni. An episodic memory-based solution for the acoustic-to-articulatory inversion problem. Journal of the Acoustical Society of America, 2013, 133 (5), pp.2921-2930. ⟨10.1121/1.4798665⟩. ⟨hal-00834556⟩
168 Consultations
160 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More