Paper
1 October 1991 Learning spatially coherent properties of the visual world in connectionist networks
Suzanna Becker, Geoffrey E. Hinton
Author Affiliations +
Abstract
In the unsupervised learning paradigm, a network of neuron-like units is presented with an ensemble of input patterns from a structured environment, such as the visual world, and learns to represent the regularities in that input. The major goal in developing unsupervised learning algorithms is to find objective functions that characterize the quality of the network's representation without explicitly specifying the desired outputs of any of the units. The sort of objective functions considered cause a unit to become tuned to spatially coherent features of visual images (such as texture, depth, shading, and surface orientation), by learning to predict the outputs of other units which have spatially adjacent receptive fields. Simulations show that using an information-theoretic algorithm called IMAX, a network can be trained to represent depth by observing random dot stereograms of surfaces with continuously varying disparities. Once a layer of depth-tuned units has developed, subsequent layers are trained to perform surface interpolation of curved surfaces, by learning to predict the depth of one image region based on depth measurements in surrounding regions. An extension of the basic model allows a population of competing neurons to learn a distributed code for disparity, which naturally gives rise to a representation of discontinuities.
© (1991) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Suzanna Becker and Geoffrey E. Hinton "Learning spatially coherent properties of the visual world in connectionist networks", Proc. SPIE 1569, Stochastic and Neural Methods in Signal Processing, Image Processing, and Computer Vision, (1 October 1991); https://doi.org/10.1117/12.48380
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Stochastic processes

Visualization

Image processing

Sensors

Machine learning

Computer vision technology

Machine vision

RELATED CONTENT

Reinforcement learning in computer vision
Proceedings of SPIE (April 13 2018)
Shape-from-focus: surface reconstruction of hybrid surfaces
Proceedings of SPIE (October 01 1991)
Spatio-temporal curvature measures for flow-field analysis
Proceedings of SPIE (September 01 1991)
Robust regularized image restoration
Proceedings of SPIE (October 01 1991)

Back to Top