MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS

Sandipana Dowerah; Romain Serizel; Denis Jouvet; Mohammad Mohammadamini; Driss Matrouf

Pré-Publication, Document De Travail Année : 2021

MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS

(1) , (1) , (1) , (2) , (2)

1
2

Sandipana Dowerah

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Romain Serizel

Fonction : Auteur
PersonId : 10320
IdHAL : romain-serizel
IdRef : 223797391

Speech Modeling for Facilitating Oral-Based Communication

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Mohammad Mohammadamini

Fonction : Auteur

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur

Laboratoire Informatique d'Avignon

Résumé

Speech signals can be corrupted by environmental noise as well as room reverberation which severely affects the speaker verification performance. In this paper, we propose to combine a multichannel pre-processing pipeline including filter-and-sum network (FaSnet), Rank-1 multichannel Wiener filter, and weighted prediction error as a front-end to speaker verification. Experimental evaluation shows that the pre-processing can improve the speaker verification performance as long as the enrollment files are processed similarly to the test data and that test and enrollment occur within similar SNR ranges. Our proposed pipeline is trained on synthetic data but generalizes to unseen, real recorded clips included in the VOiCES eval dataset and improves the speaker verification performance on all the noise conditions.

Domaines

Informatique [cs]

Sandipana Dowerah : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03487420

Soumis le : vendredi 17 décembre 2021-15:44:51

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-03487420 , version 1 (17-12-2021)

Identifiants

HAL Id : hal-03487420 , version 1

Citer

Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf. MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS. 2021. ⟨hal-03487420⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD LIA ANR

236 Consultations

52 Téléchargements

MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager