Bicocca Open Archive

Explainable Artificial Intelligence (XAI) is a set of techniques that allows the understanding of both technical and non-technical aspects of Artificial Intelligence (AI) systems. XAI is crucial to help satisfying the increasingly important demand of trustworthy Artificial Intelligence, characterized by fundamental aspects such as respect of human autonomy, prevention of harm, transparency, accountability, etc. Within XAI techniques, counterfactual explanations aim to provide to end users a set of features (and their corresponding values) that need to be changed in order to achieve a desired outcome. Current approaches rarely take into account the feasibility of actions needed to achieve the proposed explanations, and in particular, they fall short of considering the causal impact of such actions. In this paper, we present Counterfactual Explanations as Interventions in Latent Space (CEILS), a methodology to generate counterfactual explanations capturing by design the underlying causal relations from the data, and at the same time to provide feasible recommendations to reach the proposed profile. Moreover, our methodology has the advantage that it can be set on top of existing counterfactuals generator algorithms, thus minimising the complexity of imposing additional causal constrains. We demonstrate the effectiveness of our approach with a set of different experiments using synthetic and real datasets (including a proprietary dataset of the financial domain).

Crupi, R., Castelnovo, A., Regoli, D., Gonzalez, B. (2024). Counterfactual explanations as interventions in latent space. DATA MINING AND KNOWLEDGE DISCOVERY, 38(5), 2733-2769 [10.1007/s10618-022-00889-2].

Counterfactual explanations as interventions in latent space

Crupi, R;Castelnovo, A;Regoli, D;Gonzalez, BS

2024

Abstract

Explainable Artificial Intelligence (XAI) is a set of techniques that allows the understanding of both technical and non-technical aspects of Artificial Intelligence (AI) systems. XAI is crucial to help satisfying the increasingly important demand of trustworthy Artificial Intelligence, characterized by fundamental aspects such as respect of human autonomy, prevention of harm, transparency, accountability, etc. Within XAI techniques, counterfactual explanations aim to provide to end users a set of features (and their corresponding values) that need to be changed in order to achieve a desired outcome. Current approaches rarely take into account the feasibility of actions needed to achieve the proposed explanations, and in particular, they fall short of considering the causal impact of such actions. In this paper, we present Counterfactual Explanations as Interventions in Latent Space (CEILS), a methodology to generate counterfactual explanations capturing by design the underlying causal relations from the data, and at the same time to provide feasible recommendations to reach the proposed profile. Moreover, our methodology has the advantage that it can be set on top of existing counterfactuals generator algorithms, thus minimising the complexity of imposing additional causal constrains. We demonstrate the effectiveness of our approach with a set of different experiments using synthetic and real datasets (including a proprietary dataset of the financial domain).

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Algorithmic recourse; Artificial intelligence; Causality; Counterfactual explanations; Explainable AI; Machine learning;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				7-nov-2022
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				DATA MINING AND KNOWLEDGE DISCOVERY
			
	Numero del volume
	
				38
			
	Fascicolo
	
				5
			
	Pagina iniziale
	
				2733
			
	Pagina finale
	
				2769
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s10618-022-00889-2
			
	Fulltext
	
				none
			
	Citazione
	
				Crupi, R., Castelnovo, A., Regoli, D., Gonzalez, B. (2024). Counterfactual explanations as interventions in latent space. DATA MINING AND KNOWLEDGE DISCOVERY, 38(5), 2733-2769 [10.1007/s10618-022-00889-2].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/446480

Citazioni

10

13

Social impact