Feature selection, defined as the automatic selection of the most relevant features from a machine learning dataset, has three objectives: to improve the prediction performance of the predictors, to provide faster and more cost-effective predictors, and to offer a better understanding of the underlying process that generated the data. In the case of dimensional (e.g., temporal or spatial) data, feature selection is often approached using standard methods that neglect or denature the dimensional component. This paper provides a first step towards systematic and general dimensional feature selection, with a portfolio of supervised and unsupervised, filter-based selectors that can be naturally combined into an end-to-end methodology. In a hypothesis-testing setting, our experiments show that our approach can extract provably relevant features in both the temporal and spatial cases.

Cavina, P., Manzella, F., Pagliarini, G., Sciavicco, G., Stan, I. (2023). (Un)supervised Univariate Feature Extraction and Selection for Dimensional Data. In Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023). CEUR-WS.

(Un)supervised Univariate Feature Extraction and Selection for Dimensional Data

Stan, IE
2023

Abstract

Feature selection, defined as the automatic selection of the most relevant features from a machine learning dataset, has three objectives: to improve the prediction performance of the predictors, to provide faster and more cost-effective predictors, and to offer a better understanding of the underlying process that generated the data. In the case of dimensional (e.g., temporal or spatial) data, feature selection is often approached using standard methods that neglect or denature the dimensional component. This paper provides a first step towards systematic and general dimensional feature selection, with a portfolio of supervised and unsupervised, filter-based selectors that can be naturally combined into an end-to-end methodology. In a hypothesis-testing setting, our experiments show that our approach can extract provably relevant features in both the temporal and spatial cases.
paper
Dimensional data; Feature selection; Filter-based selection; Non-tabular data;
English
2nd Italian Conference on Big Data and Data Science (ITADATA 2023) - September 11-13, 2023
2023
Bena, N; Di Martino, B; Maratea, A; Sperduti, A; Di Nardo, E; Ciaramella, A; Montella, R; Ardagna, CA
Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023)
2023
3606
https://ceur-ws.org/Vol-3606/
none
Cavina, P., Manzella, F., Pagliarini, G., Sciavicco, G., Stan, I. (2023). (Un)supervised Univariate Feature Extraction and Selection for Dimensional Data. In Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023). CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/524126
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact