Building A Corporate Corpus For Threads Constitution - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Building A Corporate Corpus For Threads Constitution

Résumé

In this paper we describe the process of building a corporate corpus that will be used as a reference for modelling and computing threads from conversations generated using communication and collaboration tools. The overall goal of the reconstruction of threads is to be able to provide value to the collorator in various use cases, such as higlighting the important parts of a running discussion, reviewing the upcoming commitments or deadlines, etc. Since, to our knowledge, there is no available corporate corpus for the French language which could allow us to address this problem of thread constitution, we present here a method for building such corpora including different aspects and steps which allowed the creation of a pipeline to pseudo-anonymise data. Such a pipeline is a response to the constraints induced by the General Data Protection Regulation GDPR in Europe and the compliance to the secrecy of correspondence.
Fichier principal
Vignette du fichier
Building a corporate corpus for threads constitution.pdf (526.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03351533 , version 1 (22-09-2021)

Identifiants

  • HAL Id : hal-03351533 , version 1

Citer

Lionel Tadonfouet Tadjou, Fabrice Bourge, Tiphaine Marie, Laurent Romary, Eric Villemonte de La Clergerie. Building A Corporate Corpus For Threads Constitution. Student Research Workshop associated with the International Conference on Recent Advances in Natural Language Processing (RANLP’2021), Sep 2021, Online, Bulgaria. ⟨hal-03351533⟩
134 Consultations
155 Téléchargements

Partager

Gmail Facebook X LinkedIn More