Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

Hugo Schmutz; Olivier Humbert; Pierre-Alexandre Mattei

Communication Dans Un Congrès Année : 2023

Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

(1, 2) , (1, 3) , (2)

1
2
3

Hugo Schmutz

Fonction : Auteur
PersonId : 1195830
IdHAL : hugo-schmutz

Transporteurs et Imagerie, Radiothérapie en Oncologie et Mécanismes biologiques des Altérations du Tissu Osseux

Modèles et algorithmes pour l’intelligence artificielle

Olivier Humbert

Fonction : Auteur
PersonId : 765561
IdRef : 08681589X

Transporteurs et Imagerie, Radiothérapie en Oncologie et Mécanismes biologiques des Altérations du Tissu Osseux

Centre de Lutte contre le Cancer Antoine Lacassagne [Nice]

Pierre-Alexandre Mattei

Fonction : Auteur
PersonId : 8469
IdHAL : pierre-alexandre-mattei
IdRef : 224920278

Modèles et algorithmes pour l’intelligence artificielle

Résumé

Semi supervised learning (SSL) provides an effective means of leveraging unlabelled data to improve a model's performance. Even though the domain has received a considerable amount of attention in the past years, most methods present the common drawback of being unsafe. By safeness we mean the quality of not degrading a fully supervised model when including unlabelled data. Our starting point is to notice that the estimate of the risk that most discriminative SSL methods minimise is biased, even asymptotically. This bias makes these techniques untrustable without a proper validation set, but we propose a simple way of removing the bias. Our debiasing approach is straightforward to implement, and applicable to most deep SSL methods. We provide simple theoretical guarantees on the safeness of these modified methods, without having to rely on the strong assumptions on the data distribution that SSL theory usually requires. We evaluate debiased versions of different existing SSL methods and show that debiasing can compete with classic deep SSL techniques in various classic settings and even performs well when traditional SSL fails.

Domaines

Machine Learning [stat.ML] Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Calcul [stat.CO] Méthodologie [stat.ME]

Hugo Schmutz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03610272

Soumis le : mercredi 16 mars 2022-11:40:18

Dernière modification le : mercredi 3 avril 2024-10:20:13

Dates et versions

hal-03610272 , version 1 (16-03-2022)

Licence

Paternité

Identifiants

HAL Id : hal-03610272 , version 1
ARXIV : 2203.07512

Citer

Hugo Schmutz, Olivier Humbert, Pierre-Alexandre Mattei. Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing. International Conference on Learning Representations, 2023, Kigali, Rwanda. ⟨hal-03610272⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS INRIA I3S DIEUDONNE INRIA2 CEA-UPSAY UNIV-PARIS-SACLAY UNIV-COTEDAZUR JOLIOT CEA-DRF SHFJ 3IA-COTEDAZUR ANR

70 Consultations

0 Téléchargements

Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager