First, there are multiple separate
stripes (groups of units) in the prefrontal cortex and
striatum layers. Each stripe can be independently updated, such that this system can remember several different things at the same time, each with a different "updating policy" of when memories are updated and maintained. The active maintenance of the memory is in prefrontal cortex (PFC), and the updating signals (and updating policy more generally) come from the
striatum units (a subset of
basal ganglia units).
PVLV provides
reinforcement learning signals to train up the dynamic gating system in the basal ganglia.
Sensory input and motor output The sensory input is connected to the
posterior cortex which is connected to the motor output. The sensory input is also linked to the
PVLV system.
Posterior cortex The posterior cortex form the hidden layers of the input/output mapping. The PFC is connected with the posterior cortex to contextualize this input/output mapping.
PFC The PFC (for output gating) has a localist one-to-one representation of the input units for every stripe. Thus, you can look at these PFC representations and see directly what the network is maintaining. The PFC maintains the working memory needed to perform the task.
Striatum This is the dynamic gating system representing the
striatum units of the
basal ganglia. Every even-index unit within a stripe represents "Go", while the odd-index units represent "NoGo." The Go units cause updating of the PFC, while the NoGo units cause the PFC to maintain its existing memory representation. There are groups of units for every stripe. In the PBWM model in
Emergent, the matrices represent the striatum.
PVLV All of these layers are part of
PVLV system. The PVLV system controls the dopaminergic modulation of the basal ganglia (BG). Thus, BG/PVLV form an actor-critic architecture where the PVLV system learns when to update.
SNrThal SNrThal represents the
substantia nigra pars reticulata (SNr) and the associated area of the
thalamus, which produce a competition among the Go/NoGo units within a given stripe and mediates competition using
k-winners-take-all dynamics. If there is more overall Go activity in a given stripe, then the associated SNrThal unit gets activated, and it drives updating in PFC. For every stripe, there is one unit in SNrThal.
VTA and SNc Ventral tegmental area (VTA) and
substantia nigra pars compacta (SNc) are part of the
dopamine layer. This layer models midbrain dopamine neurons. They control the
dopaminergic modulation of the basal ganglia. ==See also==