Book contents
- Frontmatter
- Contents
- Preface
- Contributors
- 1 The Evolution of Object Categorization and the Challenge of Image Abstraction
- 2 A Strategy for Understanding How the Brain Accomplishes Object Recognition
- 3 Visual Recognition Circa 2008
- 4 On What It Means to See, and WhatWe Can Do About It
- 5 Generic Object Recognition by Inference of 3-D Volumetric Parts
- 6 What Has fMRI Taught Us About Object Recognition?
- 7 Object Recognition Through Reasoning About Functionality: A Survey of Related Work
- 8 The Interface Theory of Perception: Natural Selection Drives True Perception to Swift Extinction
- 9 Words and Pictures: Categories, Modifiers, Depiction, and Iconography
- 10 Structural Representation of Object Shape in the Brain
- 11 Learning Hierarchical Compositional Representations of Object Structure
- 12 Object Categorization in Man, Monkey, and Machine: Some Answers and Some Open Questions
- 13 Learning Compositional Models for Object Categories from Small Sample Sets
- 14 The Neurophysiology and Computational Mechanisms of Object Representation
- 15 From Classification to Full Object Interpretation
- 16 Visual Object Discovery
- 17 Towards Integration of Different Paradigms in Modeling, Representation, and Learning of Visual Categories
- 18 Acquisition and Disruption of Category Specificity in the Ventral Visual Stream: The Case of Late Developing and Vulnerable Face-Related Cortex
- 19 Using Simple Features and Relations
- 20 The Proactive Brain: Using Memory-Based Predictions in Visual Recognition
- 21 Spatial Pyramid Matching
- 22 Visual Learning for Optimal Decisions in the Human Brain
- 23 Shapes and Shock Graphs: From Segmented Shapes to Shapes Embedded in Images
- 24 Neural Encoding of Scene Statistics for Surface and Object Inference
- 25 Medial Models for Vision
- 26 Multimodal Categorization
- 27 Comparing 2-D Images of 3-D Objects
- Index
- Plate section
27 - Comparing 2-D Images of 3-D Objects
Published online by Cambridge University Press: 20 May 2010
- Frontmatter
- Contents
- Preface
- Contributors
- 1 The Evolution of Object Categorization and the Challenge of Image Abstraction
- 2 A Strategy for Understanding How the Brain Accomplishes Object Recognition
- 3 Visual Recognition Circa 2008
- 4 On What It Means to See, and WhatWe Can Do About It
- 5 Generic Object Recognition by Inference of 3-D Volumetric Parts
- 6 What Has fMRI Taught Us About Object Recognition?
- 7 Object Recognition Through Reasoning About Functionality: A Survey of Related Work
- 8 The Interface Theory of Perception: Natural Selection Drives True Perception to Swift Extinction
- 9 Words and Pictures: Categories, Modifiers, Depiction, and Iconography
- 10 Structural Representation of Object Shape in the Brain
- 11 Learning Hierarchical Compositional Representations of Object Structure
- 12 Object Categorization in Man, Monkey, and Machine: Some Answers and Some Open Questions
- 13 Learning Compositional Models for Object Categories from Small Sample Sets
- 14 The Neurophysiology and Computational Mechanisms of Object Representation
- 15 From Classification to Full Object Interpretation
- 16 Visual Object Discovery
- 17 Towards Integration of Different Paradigms in Modeling, Representation, and Learning of Visual Categories
- 18 Acquisition and Disruption of Category Specificity in the Ventral Visual Stream: The Case of Late Developing and Vulnerable Face-Related Cortex
- 19 Using Simple Features and Relations
- 20 The Proactive Brain: Using Memory-Based Predictions in Visual Recognition
- 21 Spatial Pyramid Matching
- 22 Visual Learning for Optimal Decisions in the Human Brain
- 23 Shapes and Shock Graphs: From Segmented Shapes to Shapes Embedded in Images
- 24 Neural Encoding of Scene Statistics for Surface and Object Inference
- 25 Medial Models for Vision
- 26 Multimodal Categorization
- 27 Comparing 2-D Images of 3-D Objects
- Index
- Plate section
Summary
Introduction
Visual classification presents many challenges, but in this chapter we will focus on the problem of generalization. This is the ability to learn about a new class of object from some images, and then to recognize instances of that class in very different images. To do this we must somehow account for variations in appearance due to changes in viewpoint, alterations in lighting, variations in the shape of different objects within the same class, and a myriad of other causes.
At the core of this generalization is the question: how best can we determine the similarity between a small set of 2-D images? Most of our visual input comes in the form of 2-D images. Our ability to classify is based on experience with a large number of images, but if we want to know how to get the most from many images, it makes sense to begin by asking how we can get the most from just a few images.
One of the most important constraints that we can use when comparing 2-D images is our knowledge that these are images of the 3-D world. We do not need to treat images as generic 2-D signals, and to compare them as such. Rather, we can base our comparisons on knowledge derived from the 3-D world. Geometry provides strong constraints on how the 2-D appearance of a 3-D object can change with viewpoint. The physics of light and the material properties of objects that reflect this light constrain how much variation there can be in the appearance of an object as the lighting varies.
- Type
- Chapter
- Information
- Object CategorizationComputer and Human Vision Perspectives, pp. 502 - 516Publisher: Cambridge University PressPrint publication year: 2009