Bicocca Open Archive

In an era where accumulating data is easy and storing it inexpensive, feature selection plays a central role in helping to reduce the high-dimensionality of huge amounts of otherwise meaningless data. In this paper, we propose a graph-based method for feature selection that ranks features by identifying the most important ones into arbitrary set of cues. Mapping the problem on an affinity graph - where features are the nodes - the solution is given by assessing the importance of nodes through some indicators of centrality, in particular, the Eigenvector Centrality (EC). The gist of EC is to estimate the importance of a feature as a function of the importance of its neighbors. Ranking central nodes individuates candidate features, which turn out to be effective from a classification point of view, as proved by a thoroughly experimental section. Our approach has been tested on 7 diverse datasets from recent literature (e.g., biological data, object recognition, among others), and compared against filter, embedded, and wrappers methods. The results are remarkable in terms of accuracy, stability and low execution time.

Roffo, G., Melzi, S. (2017). Ranking to Learn: Feature Ranking and Selection via Eigenvector Centrality. In New Frontiers in Mining Complex Patterns. NFMCP 2016 (pp.19-35). Springer [10.1007/978-3-319-61461-8_2].

Ranking to Learn: Feature Ranking and Selection via Eigenvector Centrality

Roffo Giorgio;Melzi Simone

2017

Abstract

In an era where accumulating data is easy and storing it inexpensive, feature selection plays a central role in helping to reduce the high-dimensionality of huge amounts of otherwise meaningless data. In this paper, we propose a graph-based method for feature selection that ranks features by identifying the most important ones into arbitrary set of cues. Mapping the problem on an affinity graph - where features are the nodes - the solution is given by assessing the importance of nodes through some indicators of centrality, in particular, the Eigenvector Centrality (EC). The gist of EC is to estimate the importance of a feature as a function of the importance of its neighbors. Ranking central nodes individuates candidate features, which turn out to be effective from a classification point of view, as proved by a thoroughly experimental section. Our approach has been tested on 7 diverse datasets from recent literature (e.g., biological data, object recognition, among others), and compared against filter, embedded, and wrappers methods. The results are remarkable in terms of accuracy, stability and low execution time.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Data mining; Feature selection; High dimensionality; Ranking;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				5th International Workshop on New Frontiers in Mining Complex Patterns, NFMCP 2016 was held in conjunction with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2016 - 19 September 2016 through 19 September 2016
			
	Anno del convegno
	
				2016
			
	Curatori della monografia
	
				Appice, A; Ceci, M; Loglisci, C; Masciari, E; Raś, Z
			
	Titolo degli atti
	
				New Frontiers in Mining Complex Patterns. NFMCP 2016
			
	ISBN del volume degli atti
	
				978-3-319-61460-1
			
	Collana o serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Data ahead of print o Data prima pubblicazione Online
	
				2-lug-2017
			
	Data di pubblicazione
	
				2017
			
	Numero del volume
	
				10312
			
	Pagina iniziale
	
				19
			
	Pagina finale
	
				35
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-319-61461-8_2
			
	URL alternativo
	
				https://link.springer.com/book/10.1007/978-3-319-61461-8?page=1#toc
			
	Fulltext
	
				reserved
			
	Citazione
	
				Roffo, G., Melzi, S. (2017). Ranking to Learn: Feature Ranking and Selection via Eigenvector Centrality. In New Frontiers in Mining Complex Patterns. NFMCP 2016 (pp.19-35). Springer [10.1007/978-3-319-61461-8_2].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
NFmcp2016_paper_13.pdf Solo gestori archivio Dimensione 630.25 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	630.25 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/350422

Citazioni

84

ND

Social impact