Bicocca Open Archive

Data augmentation is a fundamental technique in machine learning that plays a crucial role in expanding the size of training datasets. By applying various transformations or modifications to existing data, data augmentation enhances the generalization and robustness of machine learning models. In recent years, the development of several libraries has simplified the utilization of diverse data augmentation strategies across different tasks. This paper focuses on the exploration of the most widely adopted libraries specifically designed for data augmentation in computer vision tasks. Here, we aim to provide a comprehensive survey of publicly available data augmentation libraries, facilitating practitioners to navigate these resources effectively. Through a curated taxonomy, we present an organized classification of the different approaches employed by these libraries, along with accompanying application examples. By examining the techniques of each library, practitioners can make informed decisions in selecting the most suitable augmentation techniques for their computer vision projects. To ensure the accessibility of this valuable information, a dedicated public website named DALib has been created. This website serves as a centralized repository where the taxonomy, methods, and examples associated with the surveyed data augmentation libraries can be explored. By offering this comprehensive resource, we aim to empower practitioners and contribute to the advancement of computer vision research and applications through effective utilization of data augmentation techniques.

Amarù, S., Marelli, D., Ciocca, G., Schettini, R. (2023). DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision. JOURNAL OF IMAGING, 9(10) [10.3390/jimaging9100232].

DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision

Amarù, Sofia^Co-primo;Marelli, Davide^Co-primo;Ciocca, Gianluigi^Co-ultimo;Schettini, Raimondo^Co-ultimo

2023

Abstract

Data augmentation is a fundamental technique in machine learning that plays a crucial role in expanding the size of training datasets. By applying various transformations or modifications to existing data, data augmentation enhances the generalization and robustness of machine learning models. In recent years, the development of several libraries has simplified the utilization of diverse data augmentation strategies across different tasks. This paper focuses on the exploration of the most widely adopted libraries specifically designed for data augmentation in computer vision tasks. Here, we aim to provide a comprehensive survey of publicly available data augmentation libraries, facilitating practitioners to navigate these resources effectively. Through a curated taxonomy, we present an organized classification of the different approaches employed by these libraries, along with accompanying application examples. By examining the techniques of each library, practitioners can make informed decisions in selecting the most suitable augmentation techniques for their computer vision projects. To ensure the accessibility of this valuable information, a dedicated public website named DALib has been created. This website serves as a centralized repository where the taxonomy, methods, and examples associated with the surveyed data augmentation libraries can be explored. By offering this comprehensive resource, we aim to empower practitioners and contribute to the advancement of computer vision research and applications through effective utilization of data augmentation techniques.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				computer vision; data augmentation; deep learning; libraries;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				17-ott-2023
			
	Data di pubblicazione
	
				2023
			
	Rivista
	
				JOURNAL OF IMAGING
			
	Numero del volume
	
				9
			
	Fascicolo
	
				10
			
	Article number
	
				232
			
	DOI dell'articolo
	
				https://dx.doi.org/10.3390/jimaging9100232
			
	Fulltext
	
				open
			
	Citazione
	
				Amarù, S., Marelli, D., Ciocca, G., Schettini, R. (2023). DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision. JOURNAL OF IMAGING, 9(10) [10.3390/jimaging9100232].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Amarù-2023-J Imag-VoR.pdf accesso aperto Descrizione: Review Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 7.51 MB Formato Adobe PDF Visualizza/Apri	7.51 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/444538

Citazioni

5

1

Social impact