Bicocca Open Archive

Semantic segmentation architectures are mainly built upon an encoder-decoder structure. These models perform subsequent downsampling operations in the encoder. Since operations on high-resolution activation maps are computationally expensive, usually the decoder produces output segmentation maps by upsampling with parameters-free operators like bilinear or nearest-neighbor. We propose a Neural Network named Guided Upsampling Network which consists of a multiresolution architecture that jointly exploits high-resolution and large context information. Then we introduce a new module named Guided Upsampling Module (GUM) that enriches upsampling operators by introducing a learnable transformation for semantic maps. It can be plugged into any existing encoder-decoder architecture with little modifications and low additional computation cost. We show with quantitative and qualitative experiments how our network benefits from the use of GUM module. A comprehensive set of experiments on the publicly available Cityscapes dataset demonstrates that Guided Upsampling Network can efficiently process high-resolution images in real-time while attaining state-of-the art performances.

Mazzini, D. (2018). Guided Upsampling Network for Real-Time Semantic Segmentation. In British Machine Vision Conference 2018, BMVC 2018. BMVA Press.

Guided Upsampling Network for Real-Time Semantic Segmentation

Mazzini, D

2018

Abstract

Semantic segmentation architectures are mainly built upon an encoder-decoder structure. These models perform subsequent downsampling operations in the encoder. Since operations on high-resolution activation maps are computationally expensive, usually the decoder produces output segmentation maps by upsampling with parameters-free operators like bilinear or nearest-neighbor. We propose a Neural Network named Guided Upsampling Network which consists of a multiresolution architecture that jointly exploits high-resolution and large context information. Then we introduce a new module named Guided Upsampling Module (GUM) that enriches upsampling operators by introducing a learnable transformation for semantic maps. It can be plugged into any existing encoder-decoder architecture with little modifications and low additional computation cost. We show with quantitative and qualitative experiments how our network benefits from the use of GUM module. A comprehensive set of experiments on the publicly available Cityscapes dataset demonstrates that Guided Upsampling Network can efficiently process high-resolution images in real-time while attaining state-of-the art performances.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				CNN, Deep Learning, Computer Vision, Pattern Recognition, Semantic Segmentation
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				29th British Machine Vision Conference, BMVC 2018 - 3 September 2018 - 6 September 2018
			
	Anno del convegno
	
				2018
			
	Titolo degli atti
	
				British Machine Vision Conference 2018, BMVC 2018
			
	Data di pubblicazione
	
				2018
			
	URL alternativo
	
				http://bmvc2018.org/contents/papers/0423.pdf
			
	Fulltext
	
				none
			
	Citazione
	
				Mazzini, D. (2018). Guided Upsampling Network for Real-Time Semantic Segmentation. In British Machine Vision Conference 2018, BMVC 2018. BMVA Press.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/231637

Citazioni

58

ND

Social impact