Breast cancer and quality of life: medical information extraction from health forums - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Breast cancer and quality of life: medical information extraction from health forums

Résumé

Internet health forums are a rich textual resource with content generated through free exchanges among patients and, in certain cases, health professionals. We tackle the problem of retrieving clinically relevant information from such forums, with relevant topics being defined from clinical auto-questionnaires. Texts in forums are largely unstructured and noisy, calling for adapted preprocessing and query methods. We minimize the number of false negatives in queries by using a synonym tool to achieve query expansion of initial topic keywords. To avoid false positives, we propose a new measure based on a statistical comparison of frequent co-occurrences in a large reference corpus (Web) to keep only relevant expansions. Our work is motivated by a study of breast cancer patients' health-related quality of life (QoL). We consider topics defined from a breast-cancer specific QoL-questionnaire. We quantify and structure occurrences in posts of a specialized French forum and outline important future developments.
Fichier principal
Vignette du fichier
MIERevision.pdf (179.78 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01061891 , version 1 (08-09-2014)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

Citer

Thomas Opitz, Jérôme Azé, Sandra Bringay, Cyrille Joutard, Christian Lavergne, et al.. Breast cancer and quality of life: medical information extraction from health forums. 25th European Medical Informatics Conference (MIE), Aug 2014, Istanbul, Turkey. pp.1070-1074, ⟨10.3233/978-1-61499-432-9-1070⟩. ⟨hal-01061891⟩
1371 Consultations
539 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More