This paper presents a subsystem of a comprehensive platform dedicated to data transformation, linking and extension of large data sets. Furthermore, we detail and discuss both the main requirements that have led to the design and development of the platform, and the devised approach, which is a direct outcome of the requirement elicitation and discussion phase. In particular, the platform supports both design and run time aspects of the data transformation process, which is reflected in the architecture. Some initial tests have been carried out on a prototype implementation of our architecture on data sets of ~1TB featuring promising performance.

Nikolov, N., Ciavotta, M., De Paoli, F. (2018). Data wrangling at scale: The experience of EW-Shopp. In ACM International Conference Proceeding Series (pp.1-4). 1515 BROADWAY, NEW YORK, NY 10036-9998 USA : Association for Computing Machinery [10.1145/3241403.3241437].

Data wrangling at scale: The experience of EW-Shopp

Ciavotta, M;De Paoli, F
2018

Abstract

This paper presents a subsystem of a comprehensive platform dedicated to data transformation, linking and extension of large data sets. Furthermore, we detail and discuss both the main requirements that have led to the design and development of the platform, and the devised approach, which is a direct outcome of the requirement elicitation and discussion phase. In particular, the platform supports both design and run time aspects of the data transformation process, which is reflected in the architecture. Some initial tests have been carried out on a prototype implementation of our architecture on data sets of ~1TB featuring promising performance.
paper
Big Data Processing; Data Enrichment; Data Extension; Data Integration; Data Wrangling; Linked Data;
Big Data Processing; Data Enrichment; Data Extension; Data Integration; Data Wrangling; Linked Data; Human-Computer Interaction; Computer Networks and Communications; 1707; Software
English
12th European Conference on Software Architecture, ECSA 2018
2018
ACM International Conference Proceeding Series
9781450364836
2018
1
4
a32
http://portal.acm.org/
partially_open
Nikolov, N., Ciavotta, M., De Paoli, F. (2018). Data wrangling at scale: The experience of EW-Shopp. In ACM International Conference Proceeding Series (pp.1-4). 1515 BROADWAY, NEW YORK, NY 10036-9998 USA : Association for Computing Machinery [10.1145/3241403.3241437].
File in questo prodotto:
File Dimensione Formato  
SACBD2018.pdf

accesso aperto

Tipologia di allegato: Submitted Version (Pre-print)
Dimensione 828.44 kB
Formato Adobe PDF
828.44 kB Adobe PDF Visualizza/Apri
ECSA2018.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 349 kB
Formato Adobe PDF
349 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/219493
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
Social impact