A Novel Approach to Feature Selection Based on Quality Estimation Metrics - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Chapitre D'ouvrage Année : 2016

A Novel Approach to Feature Selection Based on Quality Estimation Metrics

Résumé

Feature maximization (F-max) is an unbiased quality estimation metric of unsupervised classification (clustering) that favours clusters with a maximal feature F-measure value. In this article we show that an adaptation of this metric within the framework of supervised classification allows efficient feature selection and feature contrasting to be performed. We experiment the method on different types of textual data. In this context, we demonstrate that this technique significantly improves the performance of classification methods as compared with the use of state-of-the art feature selection techniques, notably in the case of the classification of unbalanced, highly multidimensional and noisy textual data gathered in similar classes.
Fichier non déposé

Dates et versions

hal-03181624 , version 1 (25-03-2021)

Licence

Paternité

Identifiants

Citer

Jean-Charles Lamirel, Pascal Cuxac, Kafil Hajlaoui. A Novel Approach to Feature Selection Based on Quality Estimation Metrics. Fabrice Guillet; Bruno Pinaud; Gilles Venturini; Djamel Abdelkader Zighed. Advances in Knowledge Discovery and Management, 615, Springer, pp.121-140, 2016, Studies in Computational Intelligence, 978-3-319-23751-0. ⟨10.1007/978-3-319-45763-5_7⟩. ⟨hal-03181624⟩
33 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More