Data preparation has an important role in data analysis, and it is time and resource-consuming, both in terms of human and computational resources. The "Discount quality for responsible data science" project aims to focus on data-quality-based data preparation, analyzing the main characteristics of related tasks, and proposing methods for improving the sustainability of the data preparation tasks, considering also new emerging techniques based on generative AI. The paper discusses the main challenges that emerged in the initial research work in the project, as well as possible strategies for developing more sustainable data preparation frameworks.

Pernici, B., Cappiello, C., Ramalli, E., Palmonari, M., Belotti, F., De Paoli, F., et al. (2024). The Future of Sustainable Data Preparation. In Proceedings of the 32nd Symposium on Advanced Database Systems (pp.486-497). CEUR-WS.

The Future of Sustainable Data Preparation

Palmonari M.;Belotti F.;De Paoli F.;
2024

Abstract

Data preparation has an important role in data analysis, and it is time and resource-consuming, both in terms of human and computational resources. The "Discount quality for responsible data science" project aims to focus on data-quality-based data preparation, analyzing the main characteristics of related tasks, and proposing methods for improving the sustainability of the data preparation tasks, considering also new emerging techniques based on generative AI. The paper discusses the main challenges that emerged in the initial research work in the project, as well as possible strategies for developing more sustainable data preparation frameworks.
paper
data preparation; sustainability; data space; data ecosystem; data pipelines; large language model (LLM); user experinece
English
32nd Symposium on Advanced Database Systems - June 23rd to 26th, 2024
2024
Proceedings of the 32nd Symposium on Advanced Database Systems
2024
3741
486
497
https://ceur-ws.org/Vol-3741/
none
Pernici, B., Cappiello, C., Ramalli, E., Palmonari, M., Belotti, F., De Paoli, F., et al. (2024). The Future of Sustainable Data Preparation. In Proceedings of the 32nd Symposium on Advanced Database Systems (pp.486-497). CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/525379
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact