The last years have seen a steep rise in data generation worldwide, with the development and widespread adoption of several software projects targeting the Big Data paradigm. Many companies currently engage in Big Data analytics as part of their core business activities, nonetheless there are no tools or techniques to support the design of the underlying infrastructure configuration backing such systems. In particular, the focus in this paper is set on Cloud deployed clusters, which represent a cost-effective alternative to on premises installations. We propose a novel tool implementing a battery of optimization and prediction techniques integrated so as to efficiently assess several alternative resource configurations, in order to determine the minimum cost cluster deployment satisfying Quality of Service constraints. Further, the experimental campaign conducted on real systems shows the validity and relevance of the proposed method.

Ciavotta, M., Gianniti, E., Ardagna, D. (2016). D-SPACE4Cloud: A design tool for big data applications. In 16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016; Granada; Spain; 14 December 2016 through 16 December 2016 (pp.614-629). Springer Verlag [10.1007/978-3-319-49583-5_48].

D-SPACE4Cloud: A design tool for big data applications

Ciavotta, M;
2016

Abstract

The last years have seen a steep rise in data generation worldwide, with the development and widespread adoption of several software projects targeting the Big Data paradigm. Many companies currently engage in Big Data analytics as part of their core business activities, nonetheless there are no tools or techniques to support the design of the underlying infrastructure configuration backing such systems. In particular, the focus in this paper is set on Cloud deployed clusters, which represent a cost-effective alternative to on premises installations. We propose a novel tool implementing a battery of optimization and prediction techniques integrated so as to efficiently assess several alternative resource configurations, in order to determine the minimum cost cluster deployment satisfying Quality of Service constraints. Further, the experimental campaign conducted on real systems shows the validity and relevance of the proposed method.
paper
MapReduce; Optimization; Queueing networks; Theoretical Computer Science; Computer Science (all)
English
International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016 14-16 December
2016
16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016; Granada; Spain; 14 December 2016 through 16 December 2016
9783319495828
2016
10048
614
629
http://springerlink.com/content/0302-9743/copyright/2005/
reserved
Ciavotta, M., Gianniti, E., Ardagna, D. (2016). D-SPACE4Cloud: A design tool for big data applications. In 16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016; Granada; Spain; 14 December 2016 through 16 December 2016 (pp.614-629). Springer Verlag [10.1007/978-3-319-49583-5_48].
File in questo prodotto:
File Dimensione Formato  
ICA3PP16-Mic.pdf

Solo gestori archivio

Dimensione 4.06 MB
Formato Adobe PDF
4.06 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/186637
Citazioni
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
Social impact