Pre-Training a Neural Language Model Improves the Sample Efficiency of an Emergency Room Classification Model

Binbin Xu; Cédric Gil-Jardiné; Frantz Thiessard; Éric Tellier; Marta Avalos; Emmanuel Lagarde

Chapitre D'ouvrage Année : 2020

Pre-Training a Neural Language Model Improves the Sample Efficiency of an Emergency Room Classification Model

(1, 2) , (1, 2, 3) , (1, 2) , (1, 2, 3) , (1, 2, 4) , (1, 2)

1
2
3
4

Binbin Xu

Fonction : Auteur
PersonId : 703
IdHAL : binbin-xu

Université de Bordeaux

Bordeaux population health

Cédric Gil-Jardiné

Fonction : Auteur

Université de Bordeaux

Bordeaux population health

Hôpital Pellegrin

Frantz Thiessard

Fonction : Auteur

Université de Bordeaux

Bordeaux population health

Éric Tellier

Fonction : Auteur

Université de Bordeaux

Bordeaux population health

Hôpital Pellegrin

Marta Avalos

Fonction : Auteur
PersonId : 742122
IdHAL : mavalosf
ORCID : 0000-0002-5471-2615
IdRef : 153689293

Université de Bordeaux

Bordeaux population health

Statistics In System biology and Translational Medicine

Emmanuel Lagarde

Fonction : Auteur
PersonId : 1151175
ORCID : 0000-0001-8031-7400
IdRef : 110886410

Université de Bordeaux

Bordeaux population health

Résumé

To build a French national electronic injury surveillance system based on emergency room visits, we aim to develop a coding system to classify their causes from clinical notes in free-text. Supervised learning techniques have shown good results in this area but require a large amount of expert annotated dataset which is time consuming and costly to obtain. We hypothesize that the Natural Language Processing Transformer model incorporating a generative self-supervised pre-training step can significantly reduce the required number of annotated samples for supervised fine-tuning. In this preliminary study, we test our hypothesis in the simplified problem of predicting whether a visit is the consequence of a traumatic event or not from free-text clinical notes. Using fully retrained GPT-2 models (without OpenAI pre-trained weights), we assess the gain of applying a self-supervised pre-training phase with unlabeled notes prior to the supervised learning task. Results show that the number of data required to achieve a ginve level of performance (AUC>0.95) was reduced by a factor of 10 when applying pre-training. Namely, for 16 times more data, the fully-supervised model achieved an improvement <1% in AUC. To conclude, it is possible to adapt a multipurpose neural language model such as the GPT-2 to create a powerful tool for classification of free-text notes with only a small number of labeled samples.

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Machine Learning [stat.ML] Automatique Méthodologie [stat.ME] Applications [stat.AP] Santé publique et épidémiologie

Fichier principal

18444-79384-1-PB.pdf (553.75 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Marta Avalos : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02611917

Soumis le : mardi 19 mai 2020-11:23:41

Dernière modification le : jeudi 14 mars 2024-15:05:31

Dates et versions

hal-02611917 , version 1 (19-05-2020)

Identifiants

HAL Id : hal-02611917 , version 1

Citer

Binbin Xu, Cédric Gil-Jardiné, Frantz Thiessard, Éric Tellier, Marta Avalos, et al.. Pre-Training a Neural Language Model Improves the Sample Efficiency of an Emergency Room Classification Model. Roman Barták, Eric Bell. Proceedings of the 33rd International Florida Artificial Intelligence Research Society Conference, The AAAI Press, 2020, 978-1-57735-821-3. ⟨hal-02611917⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM INRIA INRIA2 U1219

127 Consultations

310 Téléchargements

Pre-Training a Neural Language Model Improves the Sample Efficiency of an Emergency Room Classification Model

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager