Bicocca Open Archive

Graphical models based on Directed Acyclic Graphs (DAGs) are widely used to answer causal questions across a variety of scientific and social disciplines. However, observational data alone cannot distinguish in general between DAGs representing the same conditional independence assertions (Markov equivalent DAGs); as a consequence, the orientation of some edges in the graph remains indeterminate. Interventional data, produced by exogenous manipulations of variables in the network, enhance the process of structure learning because they allow to distinguish among equivalent DAGs, thus sharpening causal inference. Starting from an equivalence class of DAGs, a few procedures have been devised to produce a collection of variables to be manipulated in order to identify a causal DAG. Yet, these algorithmic approaches do not determine the sample size of the interventional data required to obtain a desired level of statistical accuracy. We tackle this problem from a Bayesian experimental design perspective, taking as input a sequence of target variables to be manipulated to identify edge orientation. We then propose a method to determine, at each intervention, the optimal sample size to produce an experiment which, with high assurance, will deliver an overall probability of decisive and correct evidence.

Castelletti, F., Consonni, G. (2024). Bayesian Sample Size Determination for Causal Discovery. STATISTICAL SCIENCE, 39(2), 305-321 [10.1214/23-STS905].

Bayesian Sample Size Determination for Causal Discovery

Castelletti F.;Consonni G.

2024

Abstract

Graphical models based on Directed Acyclic Graphs (DAGs) are widely used to answer causal questions across a variety of scientific and social disciplines. However, observational data alone cannot distinguish in general between DAGs representing the same conditional independence assertions (Markov equivalent DAGs); as a consequence, the orientation of some edges in the graph remains indeterminate. Interventional data, produced by exogenous manipulations of variables in the network, enhance the process of structure learning because they allow to distinguish among equivalent DAGs, thus sharpening causal inference. Starting from an equivalence class of DAGs, a few procedures have been devised to produce a collection of variables to be manipulated in order to identify a causal DAG. Yet, these algorithmic approaches do not determine the sample size of the interventional data required to obtain a desired level of statistical accuracy. We tackle this problem from a Bayesian experimental design perspective, taking as input a sequence of target variables to be manipulated to identify edge orientation. We then propose a method to determine, at each intervention, the optimal sample size to produce an experiment which, with high assurance, will deliver an overall probability of decisive and correct evidence.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Active learning; Bayes factor; Bayesian experimental design; directed acyclic graph; intervention;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				5-mag-2024
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				STATISTICAL SCIENCE
			
	Numero del volume
	
				39
			
	Fascicolo
	
				2
			
	Pagina iniziale
	
				305
			
	Pagina finale
	
				321
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1214/23-STS905
			
	Fulltext
	
				reserved
			
	Citazione
	
				Castelletti, F., Consonni, G. (2024). Bayesian Sample Size Determination for Causal Discovery. STATISTICAL SCIENCE, 39(2), 305-321 [10.1214/23-STS905].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Castelletti-2024-Statistical Science-Vor.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Tutti i diritti riservati Dimensione 445.66 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	445.66 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/503569

Citazioni

0

0

Social impact