Bicocca Open Archive

We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.

Friston, K., Rigoli, F., Ognibene, D., Mathys, C., Fitzgerald, T., Pezzulo, G. (2015). Active inference and epistemic value. COGNITIVE NEUROSCIENCE, 6(4), 187-214 [10.1080/17588928.2015.1020053].

Active inference and epistemic value

Friston K;Rigoli F;Ognibene D;Mathys C;Fitzgerald T;Pezzulo G

2015

Abstract

We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Active inference; Agency; Bayesian inference; Bayesian surprise; Bounded rationality; Epistemic value; Exploitation; Exploration; Free energy; Information gain; Utility theory;
			
	Parole chiave
	
				Active inference; Agency; Bayesian inference; Bounded rationality; Free energy; Utility theory; Information
gain; Bayesian surprise; Epistemic value; Exploration; Exploitation;
			
	Lingua del contenuto
	
				English
			
	Data di pubblicazione
	
				2015
			
	Rivista
	
				COGNITIVE NEUROSCIENCE
			
	Numero del volume
	
				6
			
	Fascicolo
	
				4
			
	Pagina iniziale
	
				187
			
	Pagina finale
	
				214
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1080/17588928.2015.1020053
			
	URL alternativo
	
				https://www.tandfonline.com/doi/abs/10.1080/17588928.2015.1020053?journalCode=pcns20
			
	Fulltext
	
				reserved
			
	Citazione
	
				Friston, K., Rigoli, F., Ognibene, D., Mathys, C., Fitzgerald, T., Pezzulo, G. (2015). Active inference and epistemic value. COGNITIVE NEUROSCIENCE, 6(4), 187-214 [10.1080/17588928.2015.1020053].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Active inference and epistemic value(1).pdf Solo gestori archivio Dimensione 1.05 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.05 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
175889282E20152E1020053.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Dimensione 1.13 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.13 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/301368

Citazioni

546

476

Social impact