Bicocca Open Archive

Mastering the dynamics of social influence requires separating, in a database of information propagation traces, the genuine causal processes from temporal correlation, i.e., homophily and other spurious causes. However, most studies to characterize social influence, and, in general, most data-science analyses focus on correlations, statistical independence, or conditional independence. Only recently, there has been a resurgence of interest in causal data science, e.g., grounded on causality theories. In this paper we adopt a principled causal approach to the analysis of social influence from information-propagation data, rooted in the theory of probabilistic causation. Our approach consists of two phases. In the first one, in order to avoid the pitfalls of misinterpreting causation when the data spans a mixture of several subtypes ( Simpson's paradox ), we partition the set of propagation traces into groups, in such a way that each group is as less contradictory as possible in terms of the hierarchical structure of information propagation. To achieve this goal, we borrow the notion of agony [26] and define the Agony-bounded Partitioning problem, which we prove being hard, and for which we develop two efficient algorithms with approximation guarantees. In the second phase, for each group from the first phase, we apply a constrained MLE approach to ultimately learn a minimal causal topology. Experiments on synthetic data show that our method is able to retrieve the genuine causal arcs w.r.t. a ground-truth generative model. Experiments on real data show that, by focusing only on the extracted causal structures instead of the whole social graph, the effectiveness of predicting influence spread is significantly improved.

Bonchi, F., Mishra, B., Gullo, F., Ramazzotti, D. (2018). Probabilistic causal analysis of social influence. In International Conference on Information and Knowledge Management, Proceedings (pp.1003-1012). Association for Computing Machinery [10.1145/3269206.3271756].

Probabilistic causal analysis of social influence

Bonchi F.;Mishra B.;Gullo F.;Ramazzotti D.^Ultimo

2018

Abstract

Mastering the dynamics of social influence requires separating, in a database of information propagation traces, the genuine causal processes from temporal correlation, i.e., homophily and other spurious causes. However, most studies to characterize social influence, and, in general, most data-science analyses focus on correlations, statistical independence, or conditional independence. Only recently, there has been a resurgence of interest in causal data science, e.g., grounded on causality theories. In this paper we adopt a principled causal approach to the analysis of social influence from information-propagation data, rooted in the theory of probabilistic causation. Our approach consists of two phases. In the first one, in order to avoid the pitfalls of misinterpreting causation when the data spans a mixture of several subtypes ( Simpson's paradox ), we partition the set of propagation traces into groups, in such a way that each group is as less contradictory as possible in terms of the hierarchical structure of information propagation. To achieve this goal, we borrow the notion of agony [26] and define the Agony-bounded Partitioning problem, which we prove being hard, and for which we develop two efficient algorithms with approximation guarantees. In the second phase, for each group from the first phase, we apply a constrained MLE approach to ultimately learn a minimal causal topology. Experiments on synthetic data show that our method is able to retrieve the genuine causal arcs w.r.t. a ground-truth generative model. Experiments on real data show that, by focusing only on the extracted causal structures instead of the whole social graph, the effectiveness of predicting influence spread is significantly improved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				slide + paper
			
	Parole chiave
	
				Probabilistic causality, Social influence
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				27th ACM International Conference on Information and Knowledge Management, CIKM 2018 OCT 22-26
			
	Anno del convegno
	
				2018
			
	Curatori della monografia
	
				Paton N.,Candan S.,Wang H.,Allan J.,Agrawal R.,Labrinidis A.,Cuzzocrea A.,Zaki M.,Srivastava D.,Broder A.,Schuster A.
			
	Titolo degli atti
	
				International Conference on Information and Knowledge Management, Proceedings
			
	ISBN del volume degli atti
	
				9781450360142
			
	Data di pubblicazione
	
				2018
			
	Pagina iniziale
	
				1003
			
	Pagina finale
	
				1012
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1145/3269206.3271756
			
	Fulltext
	
				none
			
	Citazione
	
				Bonchi, F., Mishra, B., Gullo, F., Ramazzotti, D. (2018). Probabilistic causal analysis of social influence. In International Conference on Information and Knowledge Management, Proceedings (pp.1003-1012). Association for Computing Machinery [10.1145/3269206.3271756].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/285182

Citazioni

4

4

Social impact