Transfer Learning (TL) encompasses a number of Machine Learning Techniques that take a pre-trained model aimed at solving a task in a Source Domain and try to reuse it to improve the performance of a related task in a Target Domain An important issue in TL is that the effectiveness of those techniques is strongly dataset-dependent. In this work, we investigate the possible structural causes of the varying performance of Heterogeneous Transfer Learning (HTL) across domains characterized by different, but overlapping feature sets (this naturally determine a partition of the features into Source Domain specific subset, Target Domain specific subset, and shared subset). To this purpose, we use the Partial Information Decomposition (PID) framework, which breaks down the multivariate information that input variables hold about an output variable into three kinds of components: Unique, Synergistic, and Redundant. We consider that each domain can hold the PID components in implicit form: this restricts the information directly accessible to each domain. Based on the relative PID structure of the above mentioned feature subsets, the framework is able to tell, in principle: 1) which kind of information components are lost in passing from one domain to the other, 2) which kind of information components are at least implicitly available to a domain, and 3) what kind information components could be recovered through the bridge of the shared features. We show an example of a bridging scenario based on synthetic data.

Gianini, G., Barsotti, A., Mio, C., Lin, J. (2024). Heterogeneous Transfer Learning from a Partial Information Decomposition Perspective. In Management of Digital EcoSystems - 15th International Conference, MEDES 2023, Heraklion, Crete, Greece, May 5–7, 2023, Revised Selected Papers (pp.133-146). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-51643-6_10].

Heterogeneous Transfer Learning from a Partial Information Decomposition Perspective

Gianini, Gabriele
Primo
;
2024

Abstract

Transfer Learning (TL) encompasses a number of Machine Learning Techniques that take a pre-trained model aimed at solving a task in a Source Domain and try to reuse it to improve the performance of a related task in a Target Domain An important issue in TL is that the effectiveness of those techniques is strongly dataset-dependent. In this work, we investigate the possible structural causes of the varying performance of Heterogeneous Transfer Learning (HTL) across domains characterized by different, but overlapping feature sets (this naturally determine a partition of the features into Source Domain specific subset, Target Domain specific subset, and shared subset). To this purpose, we use the Partial Information Decomposition (PID) framework, which breaks down the multivariate information that input variables hold about an output variable into three kinds of components: Unique, Synergistic, and Redundant. We consider that each domain can hold the PID components in implicit form: this restricts the information directly accessible to each domain. Based on the relative PID structure of the above mentioned feature subsets, the framework is able to tell, in principle: 1) which kind of information components are lost in passing from one domain to the other, 2) which kind of information components are at least implicitly available to a domain, and 3) what kind information components could be recovered through the bridge of the shared features. We show an example of a bridging scenario based on synthetic data.
paper
Heterogeneous Transfer Learning; Partial Information Decomposition; Transferable Information Components;
English
15th International Conference, MEDES 2023 - May 5–7, 2023
2023
Management of Digital EcoSystems - 15th International Conference, MEDES 2023, Heraklion, Crete, Greece, May 5–7, 2023, Revised Selected Papers
9783031516429
2-feb-2024
2024
2022 CCIS
133
146
https://link.springer.com/chapter/10.1007/978-3-031-51643-6_10
partially_open
Gianini, G., Barsotti, A., Mio, C., Lin, J. (2024). Heterogeneous Transfer Learning from a Partial Information Decomposition Perspective. In Management of Digital EcoSystems - 15th International Conference, MEDES 2023, Heraklion, Crete, Greece, May 5–7, 2023, Revised Selected Papers (pp.133-146). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-51643-6_10].
File in questo prodotto:
File Dimensione Formato  
Gianini-2024-MEDES2023-VoR.pdf

Solo gestori archivio

Descrizione: pubblicato
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 2.11 MB
Formato Adobe PDF
2.11 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Gianini-2024-MEDES2023-preprint.pdf

accesso aperto

Descrizione: preprint
Tipologia di allegato: Submitted Version (Pre-print)
Licenza: Altro
Dimensione 12.81 MB
Formato Adobe PDF
12.81 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/460698
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact