Bicocca Open Archive

Graphical models are used to represent conditional independence relationships among variables by the means of a graph, with variables corresponding to graph's nodes. They are widely used in genomic studies, finance, energy forecasting, among other fields. More specifically, for a collection of q variables with conditional independence structure represented by an undirected graph, we assume that the underlying graph's structure is unknown. We are interested in inferring the graph's structure from data at hand. This procedure the bibliography is referred to as Structure Learning, where we use certain techniques for selecting a graphical model to depict conditional independence relationships between these q variables. We start from defining a model space which is consisted by a set of all possible graphical models; then we define a scoring function which enables us to score the different models of the model space and finally, we construct a search algorithm that will navigate through the model space to identify the optimal model that explains the problem at hand. The choice of a scoring function is crucial for optimizing the search procedure through the model space. Our approach to this problem is purely Bayesian for handling uncertainty in a more elaborate fashion. We will use estimates of posterior model probabilities for ranking the models at hand. The specification of a conditional prior on the column covariance matrix is not trivial because each graph under consideration induces a different independence structure and it affects the parameter space. Under this context, we cannot directly use improper priors, since they would result to indeterminate Bayes factors, thus we are required to carefully elicit a prior distribution under each graph, a task that becomes infeasible in higher dimensions. For creating an automated Bayesian scoring technique, we resort to Objective Bayes approaches, which are initiated by an improper prior distribution and their output is a fully usable prior distributions. In this thesis, we propose the use of two alternative Objective Bayes approaches for estimating posterior probabilities of models, namely the Expected Posterior prior approach and the Power-Expected Posterior Prior approach. Both approaches utilize the device of imaginary observations for providing usable prior distributions and are theoretically sounder than the Fractional Bayes Factor of O'Hagan. Our goal is to introduce both the Expected and Power-Expected Posterior prior approaches to the field of structure learning of undirected graphical models and evaluate their performance using certain stochastic search techniques. Diverse simulation scenarios are considered as well as a real-life data application.

Graphical models are used to represent conditional independence relationships among variables by the means of a graph, with variables corresponding to graph's nodes. They are widely used in genomic studies, finance, energy forecasting, among other fields. More specifically, for a collection of q variables with conditional independence structure represented by an undirected graph, we assume that the underlying graph's structure is unknown. We are interested in inferring the graph's structure from data at hand. This procedure the bibliography is referred to as Structure Learning, where we use certain techniques for selecting a graphical model to depict conditional independence relationships between these q variables. We start from defining a model space which is consisted by a set of all possible graphical models; then we define a scoring function which enables us to score the different models of the model space and finally, we construct a search algorithm that will navigate through the model space to identify the optimal model that explains the problem at hand. The choice of a scoring function is crucial for optimizing the search procedure through the model space. Our approach to this problem is purely Bayesian for handling uncertainty in a more elaborate fashion. We will use estimates of posterior model probabilities for ranking the models at hand. The specification of a conditional prior on the column covariance matrix is not trivial because each graph under consideration induces a different independence structure and it affects the parameter space. Under this context, we cannot directly use improper priors, since they would result to indeterminate Bayes factors, thus we are required to carefully elicit a prior distribution under each graph, a task that becomes infeasible in higher dimensions. For creating an automated Bayesian scoring technique, we resort to Objective Bayes approaches, which are initiated by an improper prior distribution and their output is a fully usable prior distributions. In this thesis, we propose the use of two alternative Objective Bayes approaches for estimating posterior probabilities of models, namely the Expected Posterior prior approach and the Power-Expected Posterior Prior approach. Both approaches utilize the device of imaginary observations for providing usable prior distributions and are theoretically sounder than the Fractional Bayes Factor of O'Hagan. Our goal is to introduce both the Expected and Power-Expected Posterior prior approaches to the field of structure learning of undirected graphical models and evaluate their performance using certain stochastic search techniques. Diverse simulation scenarios are considered as well as a real-life data application.

(2020). Objective Bayes Structure Learning in Gaussian Graphical Models. (Tesi di dottorato, Università degli Studi di Milano-Bicocca, 2020).

Objective Bayes Structure Learning in Gaussian Graphical Models

PETRAKIS, NIKOLAOS

2020

Abstract

Graphical models are used to represent conditional independence relationships among variables by the means of a graph, with variables corresponding to graph's nodes. They are widely used in genomic studies, finance, energy forecasting, among other fields. More specifically, for a collection of q variables with conditional independence structure represented by an undirected graph, we assume that the underlying graph's structure is unknown. We are interested in inferring the graph's structure from data at hand. This procedure the bibliography is referred to as Structure Learning, where we use certain techniques for selecting a graphical model to depict conditional independence relationships between these q variables. We start from defining a model space which is consisted by a set of all possible graphical models; then we define a scoring function which enables us to score the different models of the model space and finally, we construct a search algorithm that will navigate through the model space to identify the optimal model that explains the problem at hand. The choice of a scoring function is crucial for optimizing the search procedure through the model space. Our approach to this problem is purely Bayesian for handling uncertainty in a more elaborate fashion. We will use estimates of posterior model probabilities for ranking the models at hand. The specification of a conditional prior on the column covariance matrix is not trivial because each graph under consideration induces a different independence structure and it affects the parameter space. Under this context, we cannot directly use improper priors, since they would result to indeterminate Bayes factors, thus we are required to carefully elicit a prior distribution under each graph, a task that becomes infeasible in higher dimensions. For creating an automated Bayesian scoring technique, we resort to Objective Bayes approaches, which are initiated by an improper prior distribution and their output is a fully usable prior distributions. In this thesis, we propose the use of two alternative Objective Bayes approaches for estimating posterior probabilities of models, namely the Expected Posterior prior approach and the Power-Expected Posterior Prior approach. Both approaches utilize the device of imaginary observations for providing usable prior distributions and are theoretically sounder than the Fractional Bayes Factor of O'Hagan. Our goal is to introduce both the Expected and Power-Expected Posterior prior approaches to the field of structure learning of undirected graphical models and evaluate their performance using certain stochastic search techniques. Diverse simulation scenarios are considered as well as a real-life data application.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tutor non afferente a Bicocca
	
				CONSONNI, GUIDO
			
	Supervisori e coordinatori esterni
	
				PELUSO, STEFANO
			
	Parole chiave
	
				Graphical Models; Objective Bayes; FINCS; EPP; PEPP
			
	Parole chiave
	
				Graphical Models; Objective Bayes; FINCS; EPP; PEPP
			
	Settori scientifico-disciplinari (validi dal 09/05/2024)
	
				Settore STAT-01/A - Statistica
			
	* Lingua del contenuto
	
				English
			
	* Data di discussione
	
				7-feb-2020
			
	* Corso di dottorato
	
				STATISTICA E FINANZA MATEMATICA
			
	* Ciclo di dottorato
	
				32
			
	* Anno accademico di conseguimento titolo
	
				2018/2019
			
	Fulltext
	
				open
			
	Citazione
	
				(2020). Objective Bayes Structure Learning in Gaussian Graphical Models. (Tesi di dottorato, Università degli Studi di Milano-Bicocca, 2020).
			
	Appare nelle tipologie:
	
				07 - Tesi di dottorato Bicocca post 2009

File in questo prodotto:

File	Dimensione	Formato
phd_unimib_816489.pdf accesso aperto Descrizione: tesi di dottorato Dimensione 6.5 MB Formato Adobe PDF Visualizza/Apri	6.5 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/262921

Citazioni

ND

ND

Social impact