A Sparsity-promoting Dictionary Model for Variational Autoencoders

Mostafa Sadeghi; Paul Magron

Communication Dans Un Congrès Année : 2022

A Sparsity-promoting Dictionary Model for Variational Autoencoders

(1) , (1)

Mostafa Sadeghi

Fonction : Auteur
PersonId : 752828
IdHAL : msadeghi
ORCID : 0000-0002-0272-8017

Speech Modeling for Facilitating Oral-Based Communication

Paul Magron

Fonction : Auteur
PersonId : 1085197
ORCID : 0000-0002-8561-0961

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Structuring the latent space in probabilistic deep generative models, e.g., variational autoencoders (VAEs), is important to yield more expressive models and interpretable representations, and to avoid overfitting. One way to achieve this objective is to impose a sparsity constraint on the latent variables, e.g., via a Laplace prior. However, such approaches usually complicate the training phase, and they sacrifice the reconstruction quality to promote sparsity. In this paper, we propose a simple yet effective methodology to structure the latent space via a sparsity-promoting dictionary model, which assumes that each latent code can be written as a sparse linear combination of a dictionary's columns. In particular, we leverage a computationally efficient and tuning-free method, which relies on a zeromean Gaussian latent prior with learnable variances. We derive a variational inference scheme to train the model. Experiments on speech generative modeling demonstrate the advantage of the proposed approach over competing techniques, since it promotes sparsity while not deteriorating the output speech quality.

Mots clés

generative models variational autoencoders sparsity dictionary model speech spectrogram modeling

Domaines

Traitement du signal et de l'image [eess.SP] Intelligence artificielle [cs.AI]

Fichier principal

main.pdf (247.28 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Paul Magron : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03623769

Soumis le : vendredi 17 juin 2022-13:44:47

Dernière modification le : vendredi 2 février 2024-16:53:48

Dates et versions

hal-03623769 , version 1 (29-03-2022)

hal-03623769 , version 2 (17-06-2022)

Identifiants

HAL Id : hal-03623769 , version 2

Citer

Mostafa Sadeghi, Paul Magron. A Sparsity-promoting Dictionary Model for Variational Autoencoders. INTERSPEECH 2022, Sep 2022, Incheon, South Korea. ⟨hal-03623769v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD SILECS

87 Consultations

132 Téléchargements

A Sparsity-promoting Dictionary Model for Variational Autoencoders

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager