Accurate semantic segmentation ground-truths are difficult and expensive to obtain. On the other hand, the most promising approaches to automatically tackle this task, i.e. Deep Convolutional Neural Networks (CNNs), require high volumes of labeled data. We propose a new method based on deep learning for data augmentation in the context of semantic segmentation of highly-textured images. The method exploits a Generative Adversarial Network (GAN) to produce a semantic layout, then a texture synthesizer, based on a CNN, generates a new image according to the generated semantic layout and a reference real image taken from the training set. Even though our method is general and it can be utilized on a broad set of problems, we employed it on the real-world problem of detecting and localizing defects and cracks in road asphalts. We show how, starting from few labeled images, it is possible to augment small and long-tail datasets by producing new images with the associated semantic layouts. We prove the effectiveness of our approach by evaluating the performance of three different CNNs for semantic segmentation on the German Pavement Distress dataset and on a novel asphalt dataset collected by us. Results show a remarkable increase in performance, especially with low cardinality classes, when CNNs are trained on the augmented datasets with respect to original datasets.

Mazzini, D., Napoletano, P., Piccoli, F., Schettini, R. (2020). A Novel Approach to Data Augmentation for Pavement Distress Segmentation. COMPUTERS IN INDUSTRY, 121 [10.1016/j.compind.2020.103225].

A Novel Approach to Data Augmentation for Pavement Distress Segmentation

Mazzini D.;Napoletano P.;Piccoli F.
;
Schettini R.
2020

Abstract

Accurate semantic segmentation ground-truths are difficult and expensive to obtain. On the other hand, the most promising approaches to automatically tackle this task, i.e. Deep Convolutional Neural Networks (CNNs), require high volumes of labeled data. We propose a new method based on deep learning for data augmentation in the context of semantic segmentation of highly-textured images. The method exploits a Generative Adversarial Network (GAN) to produce a semantic layout, then a texture synthesizer, based on a CNN, generates a new image according to the generated semantic layout and a reference real image taken from the training set. Even though our method is general and it can be utilized on a broad set of problems, we employed it on the real-world problem of detecting and localizing defects and cracks in road asphalts. We show how, starting from few labeled images, it is possible to augment small and long-tail datasets by producing new images with the associated semantic layouts. We prove the effectiveness of our approach by evaluating the performance of three different CNNs for semantic segmentation on the German Pavement Distress dataset and on a novel asphalt dataset collected by us. Results show a remarkable increase in performance, especially with low cardinality classes, when CNNs are trained on the augmented datasets with respect to original datasets.
Articolo in rivista - Articolo scientifico
Data augmentation; Image generation;
English
5-giu-2020
2020
121
103225
reserved
Mazzini, D., Napoletano, P., Piccoli, F., Schettini, R. (2020). A Novel Approach to Data Augmentation for Pavement Distress Segmentation. COMPUTERS IN INDUSTRY, 121 [10.1016/j.compind.2020.103225].
File in questo prodotto:
File Dimensione Formato  
Mazzini-2020-Computers in Industry-VoR.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 5.2 MB
Formato Adobe PDF
5.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/278481
Citazioni
  • Scopus 40
  • ???jsp.display-item.citation.isi??? 37
Social impact