Predicting the encoding of secondary diagnoses. An experience based on decision trees - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Article Dans Une Revue Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information Année : 2017

Predicting the encoding of secondary diagnoses. An experience based on decision trees

Résumé

In order to measure the medical activity, hospitals are required to manually encode diagnoses concerning an inpatient episode using the International Classification of Disease (ICD-10). This task is time consuming and requires substantial training for the staff. In this paper, we are proposing an approach able to speed up and facilitate the tedious manual task of coding patient information, especially while coding some secondary diagnoses that are not well described in the medical resources such as discharge letters and medical records. Our approach leverages data mining techniques, and specifically decision trees, in order to explore medical databases that encode such diagnoses knowledge. It uses the stored structured information (age, gender, diagnoses count, medical procedures, etc.) to build a decision tree which assigns the appropriate secondary diagnosis code into the corresponding inpatient episode. We have evaluated our approach on the PMSI database using fine and coarse levels of diagnoses granularity. Three types of experimentations have been performed using different techniques to balance datasets. The results show a significant variation in the evaluation scores between the different techniques for the same studied diagnoses. We highlight the efficiency of the random sampling techniques regardless of the type of diagnoses and the type of measure (F1-measure, recall and precision).
Fichier principal
Vignette du fichier
chahbadarian_22261.pdf (1.09 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02864391 , version 1 (11-06-2020)

Identifiants

Citer

Ghazar Chahbandarian, Nathalie Bricon-Souf, Imen Megdiche, Rémi Bastide, Jean-Christophe Steinbach. Predicting the encoding of secondary diagnoses. An experience based on decision trees. Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information, 2017, 22 (2), pp.69-94. ⟨10.3166/ISI.22.2.69-94⟩. ⟨hal-02864391⟩
95 Consultations
134 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More