Training Set Class Distribution Analysis for Deep Learning Model - Application to Cancer Detection - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Training Set Class Distribution Analysis for Deep Learning Model - Application to Cancer Detection

Résumé

Deep learning models specifically CNNs have been used successfully in many tasks including medical image classification. CNN effectiveness depends on the availability of large training data set to train which is generally costly to obtain for new applications or new cases. However, there is a little concrete recommendation about training set creation. In this research, we analyze the impact of different class distributions in the training data to a CNN model. We consider the case of cancer detection task from histopathological images for cancer diagnosis and derive some useful hypotheses about the distribution of classes in the training data. We found that using all the training data leads to the best recall-precision trade-off, while training with a reduced number of examples from some classes, it is possible to inflect the model toward a desired accuracy on a given class.
Fichier principal
Vignette du fichier
reshma_26163.pdf (407.71 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02891748 , version 1 (07-07-2020)

Identifiants

  • HAL Id : hal-02891748 , version 1
  • OATAO : 26163

Citer

Ismat Ara Reshma, Margot Gaspard, Camille Franchet, Pierre Brousset, Emmanuel Faure, et al.. Training Set Class Distribution Analysis for Deep Learning Model - Application to Cancer Detection. 1st International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI 2019), Mar 2019, Barcelona, Spain. pp.123-127. ⟨hal-02891748⟩
109 Consultations
49 Téléchargements

Partager

Gmail Facebook X LinkedIn More