The last years witnessed a steep rise in data generation worldwide and, consequently, the widespread adoption of software solutions able to support data-intensive applications. Competitiveness and innovation have strongly benefited from these new platforms and methodologies, and there is a great deal of interest around the new possibilities that Big Data analytics promise to make reality. Many companies currently engage in data-intensive processes as part of their core businesses; however, fully embracing the data-driven paradigm is still cumbersome, and establishing a production-ready, fine-tuned deployment is time-consuming, expensive, and resource-intensive. This situation calls for innovative models and techniques to streamline the process of deployment configuration for Big Data applications. In particular, the focus in this paper is on the rightsizing of Cloud deployed clusters, which represent a cost-eective alternative to installation on premises. This paper proposes a novel tool, integrated in a wider DevOps-inspired approach, implementing a parallel and distributed simulation-optimization technique that eciently and eectively explores the space of alternative Cloud configurations, seeking the minimum cost deployment that satisfies quality of service constraints. The soundness of the proposed solution has been thoroughly validated in a vast experimental campaign encompassing dierent applications and Big Data platforms.

Gianniti, E., Ciavotta, M., Ardagna, D. (2021). Optimizing Quality-Aware Big Data Applications in the Cloud. IEEE TRANSACTIONS ON CLOUD COMPUTING, 9(2), 737-752 [10.1109/TCC.2018.2874944].

Optimizing Quality-Aware Big Data Applications in the Cloud

Ciavotta, M;
2021

Abstract

The last years witnessed a steep rise in data generation worldwide and, consequently, the widespread adoption of software solutions able to support data-intensive applications. Competitiveness and innovation have strongly benefited from these new platforms and methodologies, and there is a great deal of interest around the new possibilities that Big Data analytics promise to make reality. Many companies currently engage in data-intensive processes as part of their core businesses; however, fully embracing the data-driven paradigm is still cumbersome, and establishing a production-ready, fine-tuned deployment is time-consuming, expensive, and resource-intensive. This situation calls for innovative models and techniques to streamline the process of deployment configuration for Big Data applications. In particular, the focus in this paper is on the rightsizing of Cloud deployed clusters, which represent a cost-eective alternative to installation on premises. This paper proposes a novel tool, integrated in a wider DevOps-inspired approach, implementing a parallel and distributed simulation-optimization technique that eciently and eectively explores the space of alternative Cloud configurations, seeking the minimum cost deployment that satisfies quality of service constraints. The soundness of the proposed solution has been thoroughly validated in a vast experimental campaign encompassing dierent applications and Big Data platforms.
Articolo in rivista - Articolo scientifico
distributed systems; Nonlinear programming; performance of systems;
English
9-ott-2018
2021
9
2
737
752
8486649
partially_open
Gianniti, E., Ciavotta, M., Ardagna, D. (2021). Optimizing Quality-Aware Big Data Applications in the Cloud. IEEE TRANSACTIONS ON CLOUD COMPUTING, 9(2), 737-752 [10.1109/TCC.2018.2874944].
File in questo prodotto:
File Dimensione Formato  
main.pdf

accesso aperto

Tipologia di allegato: Submitted Version (Pre-print)
Dimensione 7.89 MB
Formato Adobe PDF
7.89 MB Adobe PDF Visualizza/Apri
08486649.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 6.87 MB
Formato Adobe PDF
6.87 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/219473
Citazioni
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 5
Social impact