Limitations of weak labels for embedding and tagging

Nicolas Turpault; Romain Serizel; Emmanuel Vincent

Communication Dans Un Congrès Année : 2020

Limitations of weak labels for embedding and tagging

(1) , (1) , (1)

Nicolas Turpault

Fonction : Auteur
PersonId : 1042968

Speech Modeling for Facilitating Oral-Based Communication

Romain Serizel

Fonction : Auteur
PersonId : 10320
IdHAL : romain-serizel
IdRef : 223797391

Speech Modeling for Facilitating Oral-Based Communication

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Résumé

While many datasets and approaches in ambient sound analysis use weakly labeled data, the impact of weak labels on the performance in comparison to strong labels remains unclear. Indeed, weakly labeled data is usually used because it is too expensive to annotate every data with a strong label and for some use cases strong labels are not sure to give better results. Moreover, weak labels are usually mixed with various other challenges like multilabels, unbalanced classes, overlapping events. In this paper, we formulate a supervised problem which involves weak labels. We create a dataset that focuses on difference between strong and weak labels. We investigate the impact of weak labels when training an embedding or an end-to-end classi-fier. Different experimental scenarios are discussed to give insights into which type of applications are most sensitive to weakly labeled data.

Mots clés

Index Terms-weak labels triplet loss prototypical network audio tagging audio embedding

Domaines

Son [cs.SD] Apprentissage [cs.LG] Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP]

Fichier principal

icassp2020.pdf (285.73 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Turpault : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02467401

Soumis le : vendredi 7 février 2020-10:08:58

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-02467401 , version 1 (04-02-2020)

hal-02467401 , version 2 (07-02-2020)

hal-02467401 , version 3 (30-04-2020)

hal-02467401 , version 4 (07-12-2020)

Identifiants

HAL Id : hal-02467401 , version 2
ARXIV : 2002.01687

Citer

Nicolas Turpault, Romain Serizel, Emmanuel Vincent. Limitations of weak labels for embedding and tagging. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02467401v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

280 Consultations

343 Téléchargements

Limitations of weak labels for embedding and tagging

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager