Resources for Named Entity Recognition and Resolution in News Wires - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Resources for Named Entity Recognition and Resolution in News Wires

Résumé

In the applicative context of news wire enrichment with metadata, named entity recognition plays an important role, but requires to be followed by a resolution module that maps named entity mentions to entries in a reference database. In this paper, we describe NP, the named entity module embedded in the SXPipe shallow processing chain, that we used for extracting information from French news wires from the Agence France-Presse. We describe the construction of our reference database from freely available external resources, as well as our named entity detection, disambiguation and resolution modules. We also introduce a freely available and manually developped annotated corpus designed for the evaluation of named entity recognition and resolution tools, and provide evaluation figures for NP.
Fichier principal
Vignette du fichier
entity10np.pdf (78.91 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00521240 , version 1 (26-09-2010)

Identifiants

  • HAL Id : inria-00521240 , version 1

Citer

Rosa Stern, Benoît Sagot. Resources for Named Entity Recognition and Resolution in News Wires. Entity 2010 Workshop at LREC 2010, May 2010, Valletta, Malta. ⟨inria-00521240⟩
176 Consultations
355 Téléchargements

Partager

Gmail Facebook X LinkedIn More