Data augmentation is a fundamental technique in machine learning that plays a crucial role in expanding the size of training datasets. By applying various transformations or modifications to existing data, data augmentation enhances the generalization and robustness of machine learning models. In recent years, the development of several libraries has simplified the utilization of diverse data augmentation strategies across different tasks. This paper focuses on the exploration of the most widely adopted libraries specifically designed for data augmentation in computer vision tasks. Here, we aim to provide a comprehensive survey of publicly available data augmentation libraries, facilitating practitioners to navigate these resources effectively. Through a curated taxonomy, we present an organized classification of the different approaches employed by these libraries, along with accompanying application examples. By examining the techniques of each library, practitioners can make informed decisions in selecting the most suitable augmentation techniques for their computer vision projects. To ensure the accessibility of this valuable information, a dedicated public website named DALib has been created. This website serves as a centralized repository where the taxonomy, methods, and examples associated with the surveyed data augmentation libraries can be explored. By offering this comprehensive resource, we aim to empower practitioners and contribute to the advancement of computer vision research and applications through effective utilization of data augmentation techniques.

Amarù, S., Marelli, D., Ciocca, G., Schettini, R. (2023). DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision. JOURNAL OF IMAGING, 9(10) [10.3390/jimaging9100232].

DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision

Marelli, Davide
Co-primo
;
Ciocca, Gianluigi
Co-ultimo
;
Schettini, Raimondo
Co-ultimo
2023

Abstract

Data augmentation is a fundamental technique in machine learning that plays a crucial role in expanding the size of training datasets. By applying various transformations or modifications to existing data, data augmentation enhances the generalization and robustness of machine learning models. In recent years, the development of several libraries has simplified the utilization of diverse data augmentation strategies across different tasks. This paper focuses on the exploration of the most widely adopted libraries specifically designed for data augmentation in computer vision tasks. Here, we aim to provide a comprehensive survey of publicly available data augmentation libraries, facilitating practitioners to navigate these resources effectively. Through a curated taxonomy, we present an organized classification of the different approaches employed by these libraries, along with accompanying application examples. By examining the techniques of each library, practitioners can make informed decisions in selecting the most suitable augmentation techniques for their computer vision projects. To ensure the accessibility of this valuable information, a dedicated public website named DALib has been created. This website serves as a centralized repository where the taxonomy, methods, and examples associated with the surveyed data augmentation libraries can be explored. By offering this comprehensive resource, we aim to empower practitioners and contribute to the advancement of computer vision research and applications through effective utilization of data augmentation techniques.
Articolo in rivista - Articolo scientifico
computer vision; data augmentation; deep learning; libraries;
English
17-ott-2023
2023
9
10
232
open
Amarù, S., Marelli, D., Ciocca, G., Schettini, R. (2023). DALib: A Curated Repository of Libraries for Data Augmentation in Computer Vision. JOURNAL OF IMAGING, 9(10) [10.3390/jimaging9100232].
File in questo prodotto:
File Dimensione Formato  
Amarù-2023-J Imag-VoR.pdf

accesso aperto

Descrizione: Review
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 7.51 MB
Formato Adobe PDF
7.51 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/444538
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact