Seek&Hide. Anonymising a French SMS corpus using natural language processing techniques. - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Chapitre D'ouvrage Année : 2014

Seek&Hide. Anonymising a French SMS corpus using natural language processing techniques.

Résumé

This article presents the system Seek&Hide, a text message processing tool developed for the sud4science LR (http://www.sud4science.org/) project. It performs the anonymisation/de-iden- ti cation of a corpus. At present, it has been used to anonymise the sud4science LR corpus of French text messages collected during the project. is is done in two phases. In the rst phase, it automatically processes over 70% of the corpus. e rest of the corpus is processed in the second phase, aided by an expert annotator via a web interface speci cally designed to simplify the task.

Mots clés

Fichier non déposé

Dates et versions

hal-01485615 , version 1 (09-03-2017)

Identifiants

Citer

Pierre Accorsi, Namrata Patel, Cédric Lopez, Rachel Panckhurst, Mathieu Roche. Seek&Hide. Anonymising a French SMS corpus using natural language processing techniques.. Louise-Amélie Cougnon; Cédrick Fairon. SMS Communication. A linguistic approach, John Benjamins, pp.11-28, 2014, 978 90 272 0280 2/9789027270306. ⟨10.1075/bct.61⟩. ⟨hal-01485615⟩
560 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More