How Far Association Rules and Statistical Indices help Structure Terminology? - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

How Far Association Rules and Statistical Indices help Structure Terminology?

Résumé

Automatic or semi-automatic structuring of terminology extracted from large corpora still remain a bottleneck issue for managing the fast growing textual sources. This paper aims at defining a methodology to tackle this point using a text mining process for association rules extraction. We show the ability of the rules to enhance the quality of the terminology by filtering the ambiguous, noisy terms of a domain of speciality. However, the mining process often generates a huge number of rules. This issue leads us to raise the question of how can we find a subset of rules that constitutes a valid relational structure according to the knowledge domain. We use statistical indices to rank the rules that are more capable of reflecting the complex semantic relations between terms. We also study how far some rules can help the expert with identifying synonymy/hypernymy relations or with filtering terms.
Fichier non déposé

Dates et versions

inria-00100767 , version 1 (26-09-2006)

Identifiants

  • HAL Id : inria-00100767 , version 1

Citer

Hacène Cherfi, Yannick Toussaint. How Far Association Rules and Statistical Indices help Structure Terminology?. Workshop of ECAI2002: Natural Language Processing and Machine Learning for Ontology Engineering OLT'02, In conjunction with ECAI 2002: 15th European Conference on Artificial Intelligence, Jul 2002, Lyon, France, pp.5-9. ⟨inria-00100767⟩
73 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More