A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Chapitre D'ouvrage Année : 2009

A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining

Résumé

A text mining process using association rules generates a very large number of rules. According to experts of the domain, most of these rules basically convey a common knowledge, i.e. rules which associate terms that experts may likely relate to each other. In order to focus on the result interpretation and discover new knowledge units, it is necessary to define criteria for classifying the extracted rules. Most of the rule classification methods are based on numerical quality measures. In this chapter, we introduce two classification methods: The first one is based on a classical numerical approach, i.e. using quality measures, and the other one is based on domain knowledge. We propose the second original approach in order to classify association rules according to qualitative criteria using domain model as background knowledge. Hence, we extend the classical numerical approach in an effort to combine data mining and semantic techniques for post mining and selection of association rules. We mined a corpus of texts in molecular biology and present the results of both approaches, compare them, and give a discussion on the benefits of taking into account a knowledge domain model of the data.
Fichier principal
Vignette du fichier
PM06Cherfi-final.pdf (307.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00437237 , version 1 (30-11-2009)

Identifiants

  • HAL Id : inria-00437237 , version 1

Citer

Hacène Cherfi, Amedeo Napoli, Yannick Toussaint. A Conformity Measure using Background Knowledge for Association Rules: Application to Text Mining. Yanchang Zhao and Chengqi Zhang and Longbing Cao. Post-Mining of Association Rules: Techniques for Effective Knowledge Extraction, IGI Global, 2009, 978-1605664040. ⟨inria-00437237⟩
133 Consultations
110 Téléchargements

Partager

Gmail Facebook X LinkedIn More