Anticipated ITD statistics built-in human sound localization

Published on Apr 18, 2018in bioRxiv
· DOI :10.1101/303347
Rodrigo Pavão5
Estimated H-index: 5
(UFABC: Universidade Federal do ABC),
Elyse S. Sussman36
Estimated H-index: 36
(Albert Einstein College of Medicine)
+ 1 AuthorsJose L. Pena16
Estimated H-index: 16
(Albert Einstein College of Medicine)
The variability of natural scenes places perceptual processes in the realm of statistical inference, where sensory evidence must be weighted by its reliability. Absent prior information, estimating environmental variability requires real-time sampling and computations. However, a portion of environmental variability can be assumed invariant across conditions. Perceptual tasks relying on time-dependent information may be vastly enhanced if the invariant statistical structure of sensory cues is built into the underlying neural processing. We investigated this question in human sound localization, where the statistics of spatial cues can be estimated. Localizing low frequency sounds in the horizontal plane relies on interaural time differences (ITD). We estimated the ITD statistics across frequency and azimuth from human head-related transfer functions (HRTFs). The mean ITD varied with azimuth following a sigmoid relationship, whose slope is steepest at the center. In addition, ITD was more variable over time for sounds located in the periphery compared to the center, in a frequency-dependent manner. We investigated the role of these statistics -- ITD slope and ITD variability -- in low-frequency lateralization of human subjects, to test the hypothesis that high-order sensory statistics are represented in the human brain influencing spatial discriminability and novelty detection. Thresholds for discriminating ITD changes reported by classical studies (Mills, 1958) were predicted by a model that considered both ITD slope and ITD variability. To further test our hypothesis, EEG novelty responses were recorded in human subjects undergoing an oddball stimulation sequence, where repetitive ("standard") tones of a given ITD were combined with sporadic ("deviant") tones of a different ITD. By using insert earphones, ITD was shifted with zero variability across time and location. Mismatch negativity (MMN) brain signals were used as an index of discriminability between standard and deviant stimuli. We found that MMNs were weaker for standards in the periphery, where the ITD slope is lower and the ITD variability is higher. Overall, the amplitude of novelty EEG signals was predicted by the difference in ITD between the standard and deviant normalized by the anticipated discriminability of the standard location, indicating that change detection is weighted by expected statistics of the sensory input. In sum, our results show that spatial discriminability thresholds and deviant detection are consistent with a representation of anticipated ITD statistics in the human brain, supporting the hypothesis that high-order statistics are built into human perceptual processes biasing behavior.
  • References (49)
  • Citations (1)
📖 Papers frequently viewed together
129 Citations
12 Citations
3 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Fanny Cazettes (Albert Einstein College of Medicine)H-Index: 6
#2Brian J. Fischer (SU: Seattle University)H-Index: 13
Last. Jose L. Pena (Albert Einstein College of Medicine)H-Index: 16
view all 4 authors...
The midbrain map of auditory space commands sound-orienting responses in barn owls. Owls precisely localize sounds in frontal space but underestimate the direction of peripheral sound sources. This bias for central locations was proposed to be adaptive to the decreased reliability in the periphery of sensory cues used for sound localization by the owl. Understanding the neural pathway supporting this biased behavior provides a means to address how adaptive motor commands are implemented by neuro...
3 CitationsSource
#1Gloria G. ParrasH-Index: 2
#2Javier Nieto-DiegoH-Index: 3
Last. Manuel S. Malmierca (University of Salamanca)H-Index: 38
view all 6 authors...
Perception is characterized by a reciprocal exchange of predictions and prediction error signals between neural regions. However, the relationship between such sensory mismatch responses and hierarchical predictive processing has not yet been demonstrated at the neuronal level in the auditory pathway. We recorded single-neuron activity from different auditory centers in anaesthetized rats and awake mice while animals were played a sequence of sounds, designed to separate the responses due to pre...
31 CitationsSource
#1Brian J. Fischer (SU: Seattle University)H-Index: 13
#2Jose L. Pena (Albert Einstein College of Medicine)H-Index: 16
Integration of multiple sensory cues can improve performance in detection and estimation tasks. There is an open theoretical question of the conditions under which linear or nonlinear cue combination is Bayes-optimal. We demonstrate that a neural population decoded by a population vector requires nonlinear cue combination to approximate Bayesian inference. Specifically, if cues are conditionally independent, multiplicative cue combination is optimal for the population vector. The model was teste...
2 CitationsSource
Interaural time difference (ITD) is a major cue to sound localization in humans and animals. For a given subject and position in space, ITD depends on frequency. This variation is analyzed here using a head related transfer functions (HRTFs) database collected from the literature and comprising human HRTFs from 130 subjects and animal HRTFs from six specimens of different species. For humans, the ITD is found to vary with frequency in a way that shows consistent differences with respect to a sph...
11 CitationsSource
#1Fanny Cazettes (Albert Einstein College of Medicine)H-Index: 6
#2Brian J. Fischer (SU: Seattle University)H-Index: 13
Last. Jose L. Pena (Albert Einstein College of Medicine)H-Index: 16
view all 3 authors...
Optimal use of sensory information requires that the brain estimates the reliability of sensory cues, but the neural correlate of cue reliability relevant for behavior is not well defined. Here, we addressed this issue by examining how the reliability of spatial cue influences neuronal responses and behavior in the owl9s auditory system. We show that the firing rate and spatial selectivity changed with cue reliability due to the mechanisms generating the tuning to the sound localization cue. We ...
13 CitationsSource
#1Dylan Rich (SU: Seattle University)H-Index: 1
#2Fanny Cazettes (Albert Einstein College of Medicine)H-Index: 6
Last. Brian J. Fischer (SU: Seattle University)H-Index: 13
view all 5 authors...
Bayesian models are often successful in describing perception and behavior, but the neural representation of probabilities remains in question. There are several distinct proposals for the neural representation of probabilities, but they have not been directly compared in an example system. Here we consider three models: a non-uniform population code where the stimulus-driven activity and distribution of preferred stimuli in the population represent a likelihood function and a prior, respectivel...
14 CitationsSource
#1Fanny Cazettes (Albert Einstein College of Medicine)H-Index: 6
#2Brian J. Fischer (SU: Seattle University)H-Index: 13
Last. Jose L. Pena (Albert Einstein College of Medicine)H-Index: 16
view all 3 authors...
The robust representation of the environment from unreliable sensory cues is vital for the efficient function of the brain. However, how the neural processing captures the most reliable cues is unknown. The interaural time difference (ITD) is the primary cue to localize sound in horizontal space. ITD is encoded in the firing rate of neurons that detect interaural phase difference (IPD). Due to the filtering effect of the head, IPD for a given location varies depending on the environmental contex...
16 CitationsSource
#1Wiktor Mlynarski (MPG: Max Planck Society)H-Index: 5
#2Jürgen Jost (MPG: Max Planck Society)H-Index: 45
Binaural sound localization is usually considered a discrimination task, where interaural phase (IPD) and level (ILD) disparities at narrowly tuned frequency channels are utilized to identify a position of a sound source. In natural conditions however, binaural circuits are exposed to a stimulation by sound waves originating from multiple, often moving and overlapping sources. Therefore statistics of binaural cues depend on acoustic properties and the spatial configuration of the environment. Di...
11 CitationsSource
#1Elyse S. Sussman (Albert Einstein College of Medicine)H-Index: 36
#2S. Chen (Albert Einstein College of Medicine)H-Index: 1
Last. Elizabeth A. Dinces (Albert Einstein College of Medicine)H-Index: 5
view all 4 authors...
The goal of this review article is to redefine what the mismatch negativity (MMN) component of event-related potentials reflects in auditory scene analysis, and to provide an overview of how the MMN serves as a valuable tool in Cognitive Neuroscience research. In doing so, some of the old beliefs (five common ‘myths’) about MMN will be dispelled, such as the notion that MMN is a simple feature discriminator and that attention itself modulates MMN elicitation. A revised description of what MMN tr...
49 CitationsSource
#1Neil L. Aaronson (Richard Stockton College of New Jersey)H-Index: 3
#2William M. HartmannH-Index: 31
The Woodworth model and formula for interaural time difference is frequently used as a standard in physiological and psychoacoustical studies of binaural hearing for humans and other animals. It is a frequency-independent, ray-tracing model of a rigid spherical head that is expected to agree with the high-frequency limit of an exact diffraction model. The predictions by the Woodworth model for antipodal ears and for incident plane waves are here compared with the predictions of the exact model a...
21 CitationsSource
Cited By1
#1Jose L. Pena (Albert Einstein College of Medicine)H-Index: 16
#2Fanny CazettesH-Index: 6
Last. Brian J. Fischer (SU: Seattle University)H-Index: 13
view all 4 authors...
A major cue to infer sound direction is the difference in arrival time of the sound at the left and right ears, called interaural time difference (ITD). The neural coding of ITD and its similarity across species has been strongly debated. In the barn owl, an auditory specialist relying on sound localization to capture prey, ITDs within the physiological range determined by the head width are topographically represented at each frequency. The topographic representation suggests that sound directi...
#1Andrew D. Brown (UW: University of Washington)H-Index: 12
#2Victor Benichoux (Pasteur Institute)H-Index: 7
Last. Daniel J. Tollin (University of Colorado Denver)H-Index: 23
view all 5 authors...
Abstract Sensory performance is constrained by the information in the stimulus and the precision of the involved sensory system(s). Auditory spatial acuity is robust across a broad range of sound frequencies and source locations, but declines at eccentric lateral angles. The basis of such variation is not fully understood. Low-frequency auditory spatial acuity is mediated by sensitivity to interaural time difference (ITD) cues. While low-frequency spatial acuity varies across azimuth and some ph...
2 CitationsSource