Limitations of weak labels for embedding and tagging - Department of Natural Language Processing & Knowledge Discovery Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Limitations of weak labels for embedding and tagging

Résumé

While many datasets and approaches in ambient sound analysis use weakly labeled data, the impact of weak labels on the performance in comparison to strong labels remains unclear. Indeed, weakly labeled data is usually used because it is too expensive to annotate every data with a strong label and for some use cases strong labels are not sure to give better results. Moreover, weak labels are usually mixed with various other challenges like multilabels, unbalanced classes, overlapping events. In this paper, we formulate a supervised problem which involves weak labels. We create a dataset that focuses on difference between strong and weak labels. We investigate the impact of weak labels when training an embedding or an end-to-end classi-fier. Different experimental scenarios are discussed to give insights into which type of applications are most sensitive to weakly labeled data.
Fichier principal
Vignette du fichier
icassp2020.pdf (285.73 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02467401 , version 1 (04-02-2020)
hal-02467401 , version 2 (07-02-2020)
hal-02467401 , version 3 (30-04-2020)
hal-02467401 , version 4 (07-12-2020)

Identifiants

Citer

Nicolas Turpault, Romain Serizel, Emmanuel Vincent. Limitations of weak labels for embedding and tagging. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02467401v2⟩
280 Consultations
343 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More