Bicocca Open Archive

Semantic segmentation consists of classifying each pixel of an image and constitutes an essential step towards scene recognition and understanding. Deep convolutional encoder–decoder neural networks now constitute state-of-the-art methods in the field of semantic segmentation. The problem of street scenes’ segmentation for automotive applications constitutes an important application field of such networks and introduces a set of imperative exigencies. Since the models need to be executed on self-driving vehicles to make fast decisions in response to a constantly changing environment, they are not only expected to operate reliably but also to process the input images rapidly. In this paper, we explore genetic programming (GP) as a meta-model that combines four different efficiency-oriented networks for the analysis of urban scenes. Notably, we present and examine two approaches. In the first approach, we represent solutions as GP trees that combine networks’ outputs such that each output class’s prediction is obtained through the same meta-model. In the second approach, we propose representing solutions as lists of GP trees, each designed to provide a unique meta-model for a given target class. The main objective is to develop efficient and accurate combination models that could be easily interpreted, therefore allowing gathering some hints on how to improve the existing networks. The experiments performed on the Cityscapes dataset of urban scene images with semantic pixel-wise annotations confirm the effectiveness of the proposed approach. Specifically, our best-performing models improve systems’ generalization ability by approximately 5% compared to traditional ensembles, 30% for the less performing state-of-the-art CNN and show competitive results with respect to state-of-the-art ensembles. Additionally, they are small in size, allow interpretability, and use fewer features due to GP’s automatic feature selection.

Bakurov, I., Buzzelli, M., Schettini, R., Castelli, M., Vanneschi, L. (2023). Semantic segmentation network stacking with genetic programming. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 24(2), 1-37 [10.1007/s10710-023-09464-0].

Semantic segmentation network stacking with genetic programming

Bakurov I.;Buzzelli M.;Schettini R.;Castelli M.;Vanneschi L.

2023

Abstract

Semantic segmentation consists of classifying each pixel of an image and constitutes an essential step towards scene recognition and understanding. Deep convolutional encoder–decoder neural networks now constitute state-of-the-art methods in the field of semantic segmentation. The problem of street scenes’ segmentation for automotive applications constitutes an important application field of such networks and introduces a set of imperative exigencies. Since the models need to be executed on self-driving vehicles to make fast decisions in response to a constantly changing environment, they are not only expected to operate reliably but also to process the input images rapidly. In this paper, we explore genetic programming (GP) as a meta-model that combines four different efficiency-oriented networks for the analysis of urban scenes. Notably, we present and examine two approaches. In the first approach, we represent solutions as GP trees that combine networks’ outputs such that each output class’s prediction is obtained through the same meta-model. In the second approach, we propose representing solutions as lists of GP trees, each designed to provide a unique meta-model for a given target class. The main objective is to develop efficient and accurate combination models that could be easily interpreted, therefore allowing gathering some hints on how to improve the existing networks. The experiments performed on the Cityscapes dataset of urban scene images with semantic pixel-wise annotations confirm the effectiveness of the proposed approach. Specifically, our best-performing models improve systems’ generalization ability by approximately 5% compared to traditional ensembles, 30% for the less performing state-of-the-art CNN and show competitive results with respect to state-of-the-art ensembles. Additionally, they are small in size, allow interpretability, and use fewer features due to GP’s automatic feature selection.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Deep learning; Ensemble learning; Genetic programming; Semantic segmentation; Stacking;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				26-ott-2023
			
	Data di pubblicazione
	
				2023
			
	Rivista
	
				GENETIC PROGRAMMING AND EVOLVABLE MACHINES
			
	Numero del volume
	
				24
			
	Fascicolo
	
				2
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				37
			
	Article number
	
				15
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s10710-023-09464-0
			
	URL alternativo
	
				https://link.springer.com/article/10.1007/s10710-023-09464-0
			
	Fulltext
	
				open
			
	Citazione
	
				Bakurov, I., Buzzelli, M., Schettini, R., Castelli, M., Vanneschi, L. (2023). Semantic segmentation network stacking with genetic programming. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 24(2), 1-37 [10.1007/s10710-023-09464-0].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Bakurov-2023-Gen Programm Evol Mach-VoR.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 3.58 MB Formato Adobe PDF Visualizza/Apri	3.58 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/456869

Citazioni

1

1

Social impact