Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

Cyril Plapous; Claude Marro; Pascal Scalart

Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2006

Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

(1) , (1) , (2)

1
2

Cyril Plapous

Fonction : Auteur

Orange Labs [Lannion]

Claude Marro

Fonction : Auteur

Orange Labs [Lannion]

Pascal Scalart

Fonction : Auteur
PersonId : 866537

Reconfigurable and Retargetable Digital Devices

Résumé

This paper addresses the problem of single microphone speech enhancement in noisy environments. State-of-the-art short-time noise reduction techniques are most often expressed as a spectral gain depending on the Signal-to-Noise Ratio (SNR). The well-known decision-directed (DD) approach drastically limits the level of musical noise but the estimated a priori SNR is biased since it depends on the speech spectrum estimation in the previous frame. Therefore the gain function matches the previous frame rather than the current one which degrades the noise reduction performance. The consequence of this bias is an annoying reverberation effect. We propose a method called Two-Step Noise Reduction (TSNR) technique which solves this problem while maintaining the benefits of the decision-directed approach. The estimation of the a priori SNR is refined by a second step to remove the bias of the DD approach, thus removing the reverberation effect. However, classic short-time noise reduction techniques, including TSNR, introduce harmonic distortion in enhanced speech because of the unreliability of estimators for small signal-to-noise ratios. This is mainly due to the difficult task of noise PSD estimation in single microphone schemes. To overcome this problem, we propose a method called Harmonic Regeneration Noise Reduction (HRNR). A non-linearity is used to regenerate the degraded harmonics of the distorted signal in an efficient way. The resulting artificial signal is produced in order to refine the a priori SNR used to compute a spectral gain able to preserve the speech harmonics. These methods are analyzed and objective and formal subjective test results between HRNR and TSNR techniques are provided. A significant improvement is brought by HRNR compared to TSNR thanks to the preservation of harmonics.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

2005_plapous.pdf (6.43 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Pascal Scalart : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00450766

Soumis le : mercredi 27 janvier 2010-10:09:46

Dernière modification le : vendredi 24 mars 2023-14:52:52

Archivage à long terme le : vendredi 18 juin 2010-01:29:29

Dates et versions

inria-00450766 , version 1 (27-01-2010)

Identifiants

HAL Id : inria-00450766 , version 1

Citer

Cyril Plapous, Claude Marro, Pascal Scalart. Improved Signal-to-Noise Ratio Estimation for Speech Enhancement. IEEE Transactions on Audio, Speech and Language Processing, 2006. ⟨inria-00450766⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

439 Consultations

6251 Téléchargements

Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager