“
“The notion that the brain has evolved to implement a predictive machinery for anticipation of future events has existed since early cybernetic theories (Ashby, 1952). The mechanisms
by which the brain learns the probabilistic structure of the world have been examined primarily from the perspective of reinforcement learning (RL), with a focus on how reward learning is driven by prediction errors (PEs) (Fletcher et al., 2001, McClure et al., 2003, O’Doherty et al., 2003, Pessiglione et al., 2006 and Wunderlich et al., 2011). Another perspective is provided by theories that view the brain IDH inhibitor drugs as approximating optimal Bayesian inference (Dayan et al., 1995, Doya et al., 2011, Friston, 2009, Knill and Pouget, 2004 and Körding and Wolpert, 2006). These theories go beyond reward learning and have been applied to many aspects of perception as, for example, in theories of “predictive coding” (Rao and Ballard, 1999) and the “free energy principle” (Friston et al., 2006). A central postulate of these Bayesian perspectives is that the brain continuously updates a hierarchical generative model of its sensory inputs to predict future events and infer on the causal structure of the world. This
belief updating process rests on multiple, hierarchically related PEs that are weighted by their precision. Notably, these PEs are not restricted to reward, but concern all types of sensory events as well as their underlying “laws,” e.g., probabilistic associations and how these change in time (volatility; Behrens et al., NVP-BKM120 solubility dmso 2007). Simply speaking, estimates of environmental volatility are updated in proportion to PEs about stimulus Histone demethylase probabilities; in turn, estimates of stimulus probabilities are updated by PEs about stimulus occurrences.
While several empirical studies have examined human behavior and brain activity from this Bayesian perspective, the hierarchical nature of PEs has received little attention so far. This is a significant gap, not only because hierarchically related PEs are at the heart of the Bayesian formalism, but also because PEs at different hierarchical levels may be linked to different neuromodulatory transmitter systems. While dopamine (DA) has long been related to the encoding of PEs about reward (Daw and Doya, 2006 and Schultz et al., 1997), other modulatory neurotransmitters have been linked to more abstract roles, such as encoding of “expected uncertainty” by acetylcholine (ACh) (Yu and Dayan, 2002 and Yu and Dayan, 2005). Notably, this was (implicitly) operationalized as a higher-level PE in that it represents the difference between a conditional probability (degree of cue validity) and certainty. Other computational concepts of ACh suggested that it may be representing the learning rate (Doya, 2002).