Selection bias in the reported performances of AD classification pipelines

Alex F. Mendelson; Maria A. Zuluaga; Marco Lorenzi; Brian F. Hutton; Sébastien Ourselin

doi:10.1016/j.nicl.2016.12.018

Article Dans Une Revue Neuroimage-Clinical Année : 2017

Selection bias in the reported performances of AD classification pipelines

(1) , (1) , (2) , (3) , (1)

1
2
3

Alex F. Mendelson

Fonction : Auteur

Centre for Medical Image Computing

Maria A. Zuluaga

Fonction : Auteur
PersonId : 748500
IdHAL : maria-a-zuluaga
ORCID : 0000-0002-1147-766X
IdRef : 171363035

Centre for Medical Image Computing

Marco Lorenzi

Fonction : Auteur
PersonId : 178572
IdHAL : marco-lorenzi
ORCID : 0000-0003-0521-2881
IdRef : 168153335

Analysis and Simulation of Biomedical Images

Brian F. Hutton

Fonction : Auteur

University College of London [London]

Sébastien Ourselin

Fonction : Auteur

Centre for Medical Image Computing

Résumé

The last decade has seen a great proliferation of supervised learning pipelines for individual diagnosis and prognosis in Alzheimer's disease. As more pipelines are developed and evaluated in the search for greater performance, only those results that are relatively impressive will be selected for publication. We present an empirical study to evaluate the potential for optimistic bias in classification performance results as a result of this selection. This is achieved using a novel, resampling-based experiment design that effectively simulates the optimisation of pipeline specifications by individuals or collectives of researchers using cross validation with limited data. Our findings indicate that bias can plausibly account for an appreciable fraction (often greater than half) of the apparent performance improvement associated with the pipeline optimisation, particularly in small samples. We discuss the consistency of our findings with patterns observed in the literature and consider strategies for bias reduction and mitigation.

Mots clés

Alzheimer's disease Classification Cross validation Selection bias Overfitting ADNI

Domaines

Imagerie médicale Intelligence artificielle [cs.AI] Psychiatrie et santé mentale Neurobiologie Bio-informatique [q-bio.QM] Traitement du signal et de l'image [eess.SP] Applications [stat.AP] Machine Learning [stat.ML] Bio-Informatique, Biologie Systémique [q-bio.QM] Imagerie Anatomie, Histologie, Anatomopathologie [q-bio.TO]

Fichier principal

1-s2.0-S221315821630256X-main.pdf (1.59 Mo)

Origine : Publication financée par une institution

Marco Lorenzi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01843390

Soumis le : vendredi 27 juillet 2018-14:11:57

Dernière modification le : mercredi 15 mars 2023-08:58:09

Archivage à long terme le : dimanche 28 octobre 2018-13:11:08

Dates et versions

hal-01843390 , version 1 (27-07-2018)

Licence

Paternité

Identifiants

HAL Id : hal-01843390 , version 1
DOI : 10.1016/j.nicl.2016.12.018

Citer

Alex F. Mendelson, Maria A. Zuluaga, Marco Lorenzi, Brian F. Hutton, Sébastien Ourselin. Selection bias in the reported performances of AD classification pipelines. Neuroimage-Clinical, 2017, 14, pp.400 - 416. ⟨10.1016/j.nicl.2016.12.018⟩. ⟨hal-01843390⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

158 Consultations

119 Téléchargements

Selection bias in the reported performances of AD classification pipelines

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager