Seven types of masking are discussed: multi-component contrast gain control, one-component transducer saturation, two- component phase inhibition, multiplicative noise, high spatial frequency phase locked interference, stimulus uncertainty, and noise intrusion. In the present vision research community, multi-component contrast gain is gaining in popularity while the one- and two-component masking models are losing adherents. In this paper we take the presently unpopular stance and argue against multi-component gain control models. We have a two-pronged approach. First, we discuss examples where high contrast maskers that overlap the test stimulus in both position and spatial frequency nevertheless produce little masking. Second, we show that alternatives to gain control are still viable, as long as uncertainty and noise intrusion effects are included. Finally, a classification is offered for different types of uncertainty effects that can produce large masking behavior.
KEYWORDS: Transducers, Interference (communication), Spatial frequencies, Systems modeling, Visualization, Psychophysics, Signal to noise ratio, Visual system, Target detection, Signal detection
The properties of spatial vision mechanisms are often explored psychophysically with simultaneous masking paradigms. A variety of hypotheses have been proposed to explain how the mask pattern utilized in these paradigms increases threshold. Numerous studies have investigated the properties of a particular origin of masking hypothesis but few have attempted to compare the properties of masking at several points in the process. Our study isolates masking due to lateral divisive inhibition at a point where mechanism responses are combined, and compares it with masking of the same target due to a nonlinearity either intrinsic to a mechanism or directly operating on the response of a single mechanism. We also measure the slopes of psychometric functions to examine the relationship between uncertainty and mask contrast. Studies of simultaneous masking utilizing a pedestal mask (an identical test and mask pattern) have measured facilitation for low contrast masks. This decrease in threshold from the solo target threshold is commonly referred to as the 'dipper' effect and has been explained as an increase in signal-to- noise ratio from the high unmasked level occurring as the visual system becomes more certain of target location. The level of uncertainty is indicated by the slope of sensitivity to the target as a function of target contrast in the threshold region. In these studies, high contrast masks have evoked an increase in target threshold. There have been many theories explaining this threshold increase. Some suggest that masking is the result of an intrinsic nonlinearity within a mechanism or of a contrast nonlinearity that operates directly on the output of a single mechanism. Others put the source of masking at a gain control operation which occurs when a surrounding set of mechanisms divide the response of a single mechanism by their summed response. Still others attribute the masking to noise that is multiplicative relative to the neural response signal, or noise that intrudes on the detecting mechanism from neighboring mechanisms. A detailed review of this debate is provided by the paper by Klein et al., 3016-02 in this Proceedings. Threshold elevation functions that show the relationship between mask spatial frequency and masking magnitude cannot illuminate this debate, as we demonstrated at ARVO (1994). For that study, we generated threshold elevation functions (the ratio of unmasked versus masked target threshold) for multi-channel systems using computational models that invoked either divisive inhibition, a set of transducer nonlinearities or multiplicative noise. Threshold elevation functions were indistinguishable when each masking process was assumed to have similar strength. These results led us to design the experiment presented here, which attempts to compare the effects of two of these masking processes, lateral divisive inhibition and nonlinear transducer compression.
The properties of human stereoscopic mechanisms may be derived from dichoptic interaction and masking effects on stereoscopic detection thresholds in any relevant stimulus domain (spatial frequency, temporal frequency, disparity, orientation, etc.). The present study focuses on the spatial properties of mechanisms underlying stereoscopic depth detection. The computational approach is based on the full exploration of plausible model structures to characterize their idiosyncrasies, which often allows exclusion of proposed mechanisms by comparison with data obtained under conditions in which the idiosyncrasies should be expressed. For example, we conducted a detailed analysis of threshold elevation functions (TEFs) under plausible channel shapes, combination rules and masking behavior derived from previous studies. The analysis reveals that TEFs may be much narrower than and differ in shape from the underlying mechanisms. For example, only two discrete channels are required to produce TEFs peaking close to each fixed test frequency, with no relation to channel peaks. We apply this analysis to the stereospatial masking functions collected by Yang and Blake (1991) to determine the likely channel structure underlying the empirical masking performance. The analysis generally supports the two-mechanism model that they propose but shows that the assumptions underlying their estimates of the unmasked sensitivity function are incorrect. The analysis excludes stereospatial channels tuned below 2.5 c/deg, a region in which Schor, Wood, and Ogawa (1984) obtained evidence for many narrowly tuned channels by measuring disparity thresholds for targets with different peak tunings in the two eyes. Our computational model for the latter data is consistent with the lowest tuned channel being at 2.5 c/deg, this channel being narrowly tuned to dichoptic contrast differences, as described by Legge and Gu (1989) and Halpern and Blake (1988). Thus, all such stereo tuning data can be explained in a model in which all stereoscopic channels are tuned above 2.5 c/deg.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.