Bicocca Open Archive

In this work we present a large-scale comparison of 21 learning and aggregation methods proposed in the ensemble learning, social choice theory (SCT), information fusion and uncertainty management (IF-UM) and collective intelligence (CI) fields, based on a large collection of 40 benchmark datasets. The results of this comparison show that Bagging-based approaches reported performances comparable with XGBoost, and significantly outperformed other Boosting methods. In particular, ExtraTree-based approaches were as accurate as both XGBoost and Decision Tree-based ones while also being more computationally efficient. We also show how standard Bagging-based and IF-UM-inspired approaches outperformed the approaches based on CI and SCT. IF-UM-inspired approaches, in particular, reported the best performance (together with standard ExtraTrees), as well as the strongest resistance to label noise (together with XGBoost). Based on our results, we provide useful indications on the practical effectiveness of different state-of-the-art ensemble and aggregation methods in general settings.

Campagner, A., Ciucci, D., Cabitza, F. (2023). Aggregation models in ensemble learning: A large-scale comparison. INFORMATION FUSION, 90(February 2023), 241-252 [10.1016/j.inffus.2022.09.015].

Aggregation models in ensemble learning: A large-scale comparison

Campagner A.;Ciucci D.;Cabitza F.

2023

Abstract

In this work we present a large-scale comparison of 21 learning and aggregation methods proposed in the ensemble learning, social choice theory (SCT), information fusion and uncertainty management (IF-UM) and collective intelligence (CI) fields, based on a large collection of 40 benchmark datasets. The results of this comparison show that Bagging-based approaches reported performances comparable with XGBoost, and significantly outperformed other Boosting methods. In particular, ExtraTree-based approaches were as accurate as both XGBoost and Decision Tree-based ones while also being more computationally efficient. We also show how standard Bagging-based and IF-UM-inspired approaches outperformed the approaches based on CI and SCT. IF-UM-inspired approaches, in particular, reported the best performance (together with standard ExtraTrees), as well as the strongest resistance to label noise (together with XGBoost). Based on our results, we provide useful indications on the practical effectiveness of different state-of-the-art ensemble and aggregation methods in general settings.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Aggregation methods; Collective intelligence; Ensemble learning; Information fusion; Social choice theory; Uncertainty management;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				23-set-2022
			
	Data di pubblicazione
	
				2023
			
	Rivista
	
				INFORMATION FUSION
			
	Numero del volume
	
				90
			
	Fascicolo
	
				February 2023
			
	Pagina iniziale
	
				241
			
	Pagina finale
	
				252
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.inffus.2022.09.015
			
	Fulltext
	
				reserved
			
	Citazione
	
				Campagner, A., Ciucci, D., Cabitza, F. (2023). Aggregation models in ensemble learning: A large-scale comparison. INFORMATION FUSION, 90(February 2023), 241-252 [10.1016/j.inffus.2022.09.015].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S1566253522001476-main.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Tutti i diritti riservati Dimensione 2.74 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.74 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/394398

Citazioni

49

44

Social impact