Cochlear nucleus The
cochlear nucleus is the first site of the neuronal processing of the newly converted "digital" data from the inner ear (see also
binaural fusion). In mammals, this region is anatomically and physiologically split into two regions, the
dorsal cochlear nucleus (DCN), and
ventral cochlear nucleus (VCN). The VCN is further divided by the nerve root into the posteroventral cochlear nucleus (PVCN) and the anteroventral cochlear nucleus (AVCN).
Trapezoid body The
trapezoid body is a bundle of
decussating fibers in the ventral pons that carry information used for binaural computations in the brainstem. Some of these
axons come from the
cochlear nucleus and
cross over to the other side before traveling on to the
superior olivary nucleus. This is believed to help with
localization of sound.
Superior olivary complex The
superior olivary complex is located in the
pons, and receives projections predominantly from the ventral cochlear nucleus, although the dorsal cochlear nucleus projects there as well, via the ventral acoustic stria. Within the
superior olivary complex lies the lateral superior olive (LSO) and the medial superior olive (MSO). The former is important in detecting interaural level differences while the latter is important in distinguishing
interaural time difference. The inferior colliculus also receives descending inputs from the
auditory cortex and auditory
thalamus (or
medial geniculate nucleus).
Medial geniculate nucleus The
medial geniculate nucleus is part of the thalamic relay system.
Primary auditory cortex The
primary auditory cortex is the first region of
cerebral cortex to receive auditory input. Perception of sound is associated with the left posterior
superior temporal gyrus (STG). The superior temporal gyrus contains several important structures of the brain, including
Brodmann areas 41 and 42, marking the location of the
primary auditory cortex, the cortical region responsible for the sensation of basic characteristics of sound such as pitch and rhythm. We know from research in nonhuman primates that the primary auditory cortex can probably be divided further into functionally differentiable subregions. The neurons of the primary auditory cortex can be considered to have
receptive fields covering a range of
auditory frequencies and have selective responses to harmonic pitches. Neurons integrating information from the two ears have receptive fields covering a particular region of auditory space. The primary auditory cortex is surrounded by secondary auditory cortex, and interconnects with it. These secondary areas interconnect with further processing areas in the
superior temporal gyrus, in the dorsal bank of the
superior temporal sulcus, and in the
frontal lobe. In humans, connections of these regions with the
middle temporal gyrus are probably important for
speech perception. The frontotemporal system underlying auditory perception allows us to distinguish sounds as speech, music, or noise.
The auditory ventral and dorsal streams Material was copied from this source, which is available under a Creative Commons Attribution 4.0 International License. From the primary auditory cortex emerge two separate pathways: the auditory ventral stream and auditory dorsal stream. The auditory ventral stream includes the anterior superior temporal gyrus, anterior superior temporal sulcus, middle temporal gyrus and temporal pole. Neurons in these areas are responsible for sound recognition, and extraction of meaning from sentences. The auditory dorsal stream includes the posterior superior temporal gyrus and sulcus,
inferior parietal lobule and intra-parietal sulcus. Both pathways project in humans to the inferior frontal gyrus. The most established role of the auditory dorsal stream in primates is sound localization. In humans, the auditory dorsal stream in the left hemisphere is also responsible for speech repetition and articulation, phonological long-term encoding of word names, and verbal working memory.
Ascending Auditory Pathway: Coding Mechanisms and Experience-Dependent Plasticity The ascending auditory pathway transmits acoustic information from the cochlea to auditory cortex through a series of subcortical nuclei, including the cochlear nucleus, superior olivary complex, inferior colliculus, and medial geniculate body. Throughout these stages, several organizational principles—particularly tonotopy, the ordered mapping of frequency—are preserved from the cochlea to cortex. The pathway supports both sound localization and sound identification, with the latter being critical for speech and language.
Rate Coding in the Auditory Nerve The basilar membrane converts frequency information into a place code, but intensity is also encoded through firing rate. Increased sound levels produce larger basilar membrane displacements, leading to greater hair-cell depolarization and higher firing rates in auditory nerve fibers. Auditory nerve fibers vary in spontaneous firing rates. High-spontaneous-rate fibers respond at very low thresholds but saturate quickly, whereas medium- and low-spontaneous-rate fibers encode higher intensities. This “division of labor” enables fine-grained representation of intensity across a wide dynamic range, crucial for speech perception.
Limitations of Place Coding Pure place coding cannot account for all aspects of pitch perception. As harmonic frequencies increase, cochlear filters become too broad to resolve adjacent harmonics. This limitation becomes more pronounced at higher sound intensities, motivating the need for temporal coding mechanisms.
Temporal Coding and Phase Locking In temporal (time-based) coding, auditory nerve fibers synchronize their firing to specific phases of the acoustic waveform, a process known as phase locking. Although neurons do not fire on every cycle, they fire preferentially near waveform peaks, allowing representation of periodicity even when place cues are ambiguous. Phase locking underlies the frequency-following response (FFR), a scalp-recorded electrophysiological signal that mirrors the timing, pitch, and harmonic structure of the stimulus. Speech-evoked FFRs reproduce stimulus periodicity with such fidelity that the response waveform can generate intelligible speech when played back.
Subcortical Response Timing Auditory processing in the brainstem occurs on the scale of milliseconds. Recordings from humans and nonhuman primates show activation in the inferior colliculus within 5–10 ms of stimulus onset, with thalamic and cortical responses following shortly after. This rapid temporal precision is essential for speech onset detection, phoneme discrimination, and sound localization.
Experience-Dependent Plasticity in the Auditory Brainstem Although cortical areas exhibit the most dramatic plasticity, evidence shows that the auditory brainstem is also shaped by experience via descending corticofugal pathways. Musical Training Musicians show: larger FFR amplitudes faster onset responses enhanced representation of F0 and harmonics superior pitch tracking of dynamic contours (e.g., Mandarin tones) These enhancements appear to reflect strengthened cortico-brainstem interactions. Short-Term Training Even brief (≈1 hour) phonetic discrimination training increases FFR harmonic power, whereas passive listening does not. This demonstrates rapid, learning-dependent modulation of subcortical encoding.
Descending (Corticofugal) Pathways The descending auditory pathway projects from auditory cortex to the inferior colliculus, superior olivary complex, and cochlear nucleus. These projections adjust subcortical gain, sharpen tuning, and enhance behaviorally relevant features. Thus, while sensory information travels upward, experience and attention travel downward, dynamically shaping early auditory coding.
Relevance to Speech and Language The ascending auditory pathway supports language by encoding: formant structure and harmonic content rapid temporal cues essential for stop consonants and prosody pitch contours relevant for tone languages stable representations shaped by linguistic experience Tonotopy is preserved into auditory cortex, where neurons in the superior temporal gyrus and sulcus integrate this information into language-specific categories. == Clinical significance ==