Bicocca Open Archive

Accurate semantic segmentation ground-truths are difficult and expensive to obtain. On the other hand, the most promising approaches to automatically tackle this task, i.e. Deep Convolutional Neural Networks (CNNs), require high volumes of labeled data. We propose a new method based on deep learning for data augmentation in the context of semantic segmentation of highly-textured images. The method exploits a Generative Adversarial Network (GAN) to produce a semantic layout, then a texture synthesizer, based on a CNN, generates a new image according to the generated semantic layout and a reference real image taken from the training set. Even though our method is general and it can be utilized on a broad set of problems, we employed it on the real-world problem of detecting and localizing defects and cracks in road asphalts. We show how, starting from few labeled images, it is possible to augment small and long-tail datasets by producing new images with the associated semantic layouts. We prove the effectiveness of our approach by evaluating the performance of three different CNNs for semantic segmentation on the German Pavement Distress dataset and on a novel asphalt dataset collected by us. Results show a remarkable increase in performance, especially with low cardinality classes, when CNNs are trained on the augmented datasets with respect to original datasets.

Mazzini, D., Napoletano, P., Piccoli, F., Schettini, R. (2020). A Novel Approach to Data Augmentation for Pavement Distress Segmentation. COMPUTERS IN INDUSTRY, 121 [10.1016/j.compind.2020.103225].

A Novel Approach to Data Augmentation for Pavement Distress Segmentation

Mazzini D.;Napoletano P.;Piccoli F.;Schettini R.

2020

Abstract

Accurate semantic segmentation ground-truths are difficult and expensive to obtain. On the other hand, the most promising approaches to automatically tackle this task, i.e. Deep Convolutional Neural Networks (CNNs), require high volumes of labeled data. We propose a new method based on deep learning for data augmentation in the context of semantic segmentation of highly-textured images. The method exploits a Generative Adversarial Network (GAN) to produce a semantic layout, then a texture synthesizer, based on a CNN, generates a new image according to the generated semantic layout and a reference real image taken from the training set. Even though our method is general and it can be utilized on a broad set of problems, we employed it on the real-world problem of detecting and localizing defects and cracks in road asphalts. We show how, starting from few labeled images, it is possible to augment small and long-tail datasets by producing new images with the associated semantic layouts. We prove the effectiveness of our approach by evaluating the performance of three different CNNs for semantic segmentation on the German Pavement Distress dataset and on a novel asphalt dataset collected by us. Results show a remarkable increase in performance, especially with low cardinality classes, when CNNs are trained on the augmented datasets with respect to original datasets.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Data augmentation; Image generation;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				5-giu-2020
			
	Data di pubblicazione
	
				2020
			
	Rivista
	
				COMPUTERS IN INDUSTRY
			
	Numero del volume
	
				121
			
	Article number
	
				103225
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.compind.2020.103225
			
	Fulltext
	
				reserved
			
	Citazione
	
				Mazzini, D., Napoletano, P., Piccoli, F., Schettini, R. (2020). A Novel Approach to Data Augmentation for Pavement Distress Segmentation. COMPUTERS IN INDUSTRY, 121 [10.1016/j.compind.2020.103225].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Mazzini-2020-Computers in Industry-VoR.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Tutti i diritti riservati Dimensione 5.2 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	5.2 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/278481

Citazioni

46

42

Social impact