When tens and even hundreds of schemas are involved in the integration process, criteria are needed for choosing clusters of schemas to be integrated, so as to deal with the integration problem through an efficient iterative process. Schemas in clusters should be chosen according to cohesion and coupling criteria that are based on similarities and dissimilarities among schemas. In this paper, we propose an algorithm for a novel variant of the correlation clustering approach that addresses the problem of assisting a designer in integrating a large number of conceptual schemas. The novel variant introduces upper and lower bounds to the number of schemas in each cluster, in order to avoid too complex and too simple integration contexts respectively. We give a heuristic for solving the problem, being an NP hard combinatorial problem. An experimental activity demonstrates an appreciable increment in the effectiveness of the schema integration process when clusters are computed by means of the proposed algorithm w.r.t. the ones manually defined by an expert.

Batini, C., Bonizzoni, P., Comerio, M., Dondi, R., Pirola, Y., Salandra, F. (2015). A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 30(1), 214-224 [10.1007/s11390-015-1514-5].

A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas

Batini, C
;
Bonizzoni, P;Comerio, M;Pirola, Y;
2015

Abstract

When tens and even hundreds of schemas are involved in the integration process, criteria are needed for choosing clusters of schemas to be integrated, so as to deal with the integration problem through an efficient iterative process. Schemas in clusters should be chosen according to cohesion and coupling criteria that are based on similarities and dissimilarities among schemas. In this paper, we propose an algorithm for a novel variant of the correlation clustering approach that addresses the problem of assisting a designer in integrating a large number of conceptual schemas. The novel variant introduces upper and lower bounds to the number of schemas in each cluster, in order to avoid too complex and too simple integration contexts respectively. We give a heuristic for solving the problem, being an NP hard combinatorial problem. An experimental activity demonstrates an appreciable increment in the effectiveness of the schema integration process when clusters are computed by means of the proposed algorithm w.r.t. the ones manually defined by an expert.
Articolo in rivista - Articolo scientifico
clustering; conceptual schema; schema integration;
clustering; conceptual schema; schema integration
English
2015
30
1
214
224
none
Batini, C., Bonizzoni, P., Comerio, M., Dondi, R., Pirola, Y., Salandra, F. (2015). A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 30(1), 214-224 [10.1007/s11390-015-1514-5].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/75568
Citazioni
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
Social impact