Masking within and across visual dimensions: Psychophysical evidence for perceptual segregation of color and motion

SAMUEL W. CHEADLE; SEMIR ZEKI

doi:10.1017/S0952523811000228

Masking within and across visual dimensions: Psychophysical evidence for perceptual segregation of color and motion

Published online by Cambridge University Press: 11 August 2011

SAMUEL W. CHEADLE and

SEMIR ZEKI

Show author details

SAMUEL W. CHEADLE*: Affiliation:
Wellcome Laboratory of Neurobiology, Anatomy Department, University College London, London, UK
SEMIR ZEKI: Affiliation:
Wellcome Laboratory of Neurobiology, Anatomy Department, University College London, London, UK
*: *Address correspondence and reprint requests to: Samuel W. Cheadle, Wellcome Laboratory of Neurobiology, Anatomy Department, University College London, London WC1E 6BT, UK. E-mail: [email protected]

Article contents

Abstract
Introduction
Experiment 1: Feature-selective masking
Experiment 2: Time course of the homogeneous masking effect
Experiment 3: Extending the time course: Forward and backward masking
Discussion
Footnotes
References

Rights & Permissions

Abstract

Visual masking can result from the interference of perceptual signals. According to the principle of functional specialization, interference should be greatest when signal and mask belong to the same visual attribute (e.g., color or motion) and least when they belong to different ones. We provide evidence to support this view and show that the time course of masking is visual attribute specific. First, we show that a color target is masked most effectively by color (homogeneous target-mask pair) and least effectively by motion (heterogeneous pair) and vice versa for a motion target. Second, we show that the time at which the mask is most effective depends strongly on the target-mask pairing. Heterogeneous masking is strongest when the mask is presented before the target (forward masking) but this is not true of homogeneous masking. This finding supports a delayed cross-feature interaction due to segregated processing sites. Third, lengthening the stimulus onset asynchrony between target and mask leads to a faster improvement in color than in motion detectability, lending support for a faster color processing system and consistent with reports of perceptual asynchrony in vision. In summary, we present three lines of psychophysical evidence, all of which support a segregated neural coding scheme for color and motion in the human brain.

Keywords

Functional specialization Visual masking Psychophysics

Type: Research Articles
Information: Visual Neuroscience , Volume 28 , Issue 5 , September 2011 , pp. 445 - 451

DOI: https://doi.org/10.1017/S0952523811000228 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2011The online version of this article is published within an Open Access environment subject to the conditions of the Creative Commons Attribution-NonCommercial-ShareAlike licence <http://creativecommons.org/licenses/by-nc-sa/3.0/>. The written permission of Cambridge University Press must be obtained for commercial re-use.

Introduction

Our seemingly effortless ability to perceive a world in which all the different visual attributes are in apparently precise temporal and spatial registration belies a complex cortical machinery, which decomposes the visual image into constituents such as form, color and motion, and processes them in separate and specialized visual areas. The evidence for this functional specialization in the primate visual brain comes from anatomical, electrophysiological (Zeki, Reference Zeki1978; DeYoe & van Essen, Reference DeYoe and Van Essen1988; Livingstone & Hubel, Reference Livingstone and Hubel1988; Zeki & Shipp, Reference Zeki and Shipp1988), and human imaging and clinical studies (Meadows, Reference Meadows1974; Zeki, Reference Zeki1990, Reference Zeki1991; Zeki et al., Reference Zeki, Watson, Lueck, Friston, Kennard and Frackowiak1991; Zihl et al., Reference Zihl, von Cramon, Mai and Schmid1991). This functional specialization has, moreover, temporal consequences since we perceive different attributes at different times, color taking temporal precedence over orientation, and orientation over motion (Moutoussis & Zeki, Reference Moutoussis and Zeki1997a ,Reference Moutoussis and Zeki b ; Zeki & Moutoussis, Reference Zeki and Moutoussis1997; Barbur et al., Reference Barbur, Wolf and Lennie1998; Arnold et al., Reference Arnold, Clifford and Wenderoth2001).

Of all the visual attributes, perhaps the easiest to separate both physiologically and perceptually are color and motion, color being associated with activity of the V4 complex and motion with activity of a separate system, based primarily on the area V5 (Zeki, Reference Zeki1978; Livingstone & Hubel, Reference Livingstone and Hubel1988; Zeki et al., Reference Zeki, Watson, Lueck, Friston, Kennard and Frackowiak1991). The evidence in favor of the separation of motion and color also comes from psychophysical experiments, which show that motion detection is impaired under conditions of equiluminance (Ramachandran & Gregory, Reference Ramachandran and Gregory1978; Cavanagh et al., Reference Cavanagh, Tyler and Favreau1984), indicating that the motion system, although sensitive to chromatic signals, does not contain neurons tuned to specific hues (Gouras & Kruger, Reference Gouras and Kruger1979; Dobkins & Albright, Reference Dobkins and Albright1994). Additional psychophysical evidence is consistent with functional specialization for other visual dimensions (Krumhansl, Reference Krumhansl1984; Livingston & Hubel, 1987; Theeuwes, Reference Theeuwes1992; Hong & Shevell, Reference Hong and Shevell2006; Hong & Blake, Reference Hong and Blake2009).

In the study reported here, we investigate functional specialization psychophysically using a visual masking paradigm, by examining the strength of interference between two perceptual signals, either arising from the same visual attribute (homogeneous target-mask pairs) or from different ones (heterogeneous target-mask pairs). Masking refers to the impaired detectability of a target stimulus when immediately preceded or succeeded by a task-irrelevant visual input, referred to as the mask (Breitmeyer & Ogmen, Reference Breitmeyer and Ogmen2006). Visual temporal masking has been reported in both the motion (Braddick, Reference Braddick1973; Ferrera & Wilson, Reference Ferrera and Wilson1987) and the color domain (Schmidt, Reference Schmidt2002; Breitmeyer et al., Reference Breitmeyer, Ro and Singhal2004) but not across the two. Moreover, although masking of a target color with a color mask has been reported in two studies (Schmidt, Reference Schmidt2002; Breitmeyer et al., Reference Breitmeyer, Ro and Singhal2004), both employed a metacontrast masking technique, in which the target and mask regions were nonspatially overlapping. Because this type of masking has been hypothesized to rely on a form of “motion deblurring” (Ansorge et al., Reference Ansorge, Francis, Herzog and Ögmen2007) rather than direct interference between target and mask signals, we chose to use the simplified backwards masking technique, in which the target and the mask overlap in space. This alone would enable us to draw conclusions regarding a functional specialization.

In our study, we manipulated the relationship between the target and mask, such that the target-mask pairing was either homogeneous (e.g., color target and color mask) or heterogeneous (e.g., color target and motion mask). If regions or cells in the visual system are nonspecialized and respond to multiple visual features (integrated representations), mask strength should remain constant across conditions (Fig. 1, Panel C). If cortical representations are exclusively integrated, it should be impossible to selectively mask one feature (e.g., color), while sparing the other (e.g., motion). This would not be true if the demonstrated functional specialization in the cortex is perceptually potent, that is, if signals from target and mask are processed in separate cortical sites or by different cells, when competition or interference will take place over a different time course, and is likely to be weaker (Fig. 1, Panel B).

Fig. 1. Interaction of target and mask signals. (A) illustrates the physical stimulus, comprised of target and mask. (B) and (C) illustrate two possible ways, in which the visual cortex may represent the target and mask. In the case of a segregated representation (B), color and motion activate distinct and separate nodes (signified by the separate black arrow and green dot), whereas in the integrated case (C), both direction of motion and color are represented within the same node (signified by the green arrow). A color-specific masking effect would support the existence of distinct processing nodes because the interference produced by the mask (dashed black line) acts only on the target color node.

Our study is divided into three experiments. In the first, we report the effect of homogeneous and heterogeneous target-mask pairs at both short and long stimulus onset asynchronies (SOAs); functional specialization predicts weaker masking in the case of heterogeneous pairs. In the second experiment, we investigate the time course of homogeneous pair masking in more detail, with the aim of exposing perceptual asynchronies between the visual features of color and motion. In the third section, we test the prediction that heterogeneous masking only occurs when the mask is given sufficient processing time (i.e., when the mask occurs prior to the target).

Our results constitute a psychophysical demonstration of functional specialization for the processing of color and motion in the human visual system.

Experiment 1: Feature-selective masking

Method

Apparatus

For all experiments, stimuli were displayed on a Sony Trinitron Multi-scan E450 monitor (refresh rate of 140 Hz; Sony, Tokyo, Japan) and generated using the Cogent toolbox for MatLab on a windows XP machine.

Stimuli and Procedure

The target stimulus contained both color and motion, while the mask featured only a single attributeFootnote ¹ . Stimuli were presented on a gray background (6.9 cd/m²). The target was a fast moving (145 deg s⁻¹; left or right) colored circle (Fig. 2). It was presented for 35 ms and covered a region of 5.1 deg. Two types of mask were tested, a color mask which consisted of a uniformly colored bar (10.2 × 5.1 deg; 200 ms duration; Fig. 3A) and a motion mask generated from the horizontal cyclic left–right motion of two fast moving white circles (Fig. 3B), covering the target region. The target colors were green and yellow, while the mask colors were red and blueFootnote ² . Therefore, the target and mask colors could either be opponent or nonopponent pairs. Fig. 2 shows the four target-mask color pairs.

Fig. 2. Illustration of the different target and mask color pairs. The target was either yellow or green and the mask either red or blue. There the pairs consisted of either opponent or nonopponent colors. Note that the motion component of the stimulus is not shown.

Fig. 3. Schematic illustration of stimulus and task used in Experiment 2. In the examples (A) and (B), the target is identical in both cases; a rightward moving green dot. (A) Color masking stimulus, consisting of a red rectangle. (B) Motion masking stimulus, consisting of horizontal motion generated by achromatic white dots. For each display condition, observers were run on separate blocks, in which they had to report the color or motion direction of the target.

In the first experiment, one short and one long SOA condition was tested (0–21 msFootnote ³ and 504 ms, respectively). The long SOA is useful in ruling out confounding factors that could account for poor discrimination performance, such as general task difficulty or response confusion arising from the integration of target/mask information. Eighty trials per SOA were tested for each subject.

Observers

Ten subjects (average age 29 years; seven females) were tested on the initial version containing two different SOAs. All had normal or corrected to normal vision.

Procedure

Observers were instructed to report either the color or direction of motion (separate sessions) of the target and to ignore all features of the mask. The experiment used a two alternative forced-choice design and was performed in four sessions, run in a counterbalanced order. Each session was composed of blocks of 40 trials, with a break given after each. Observers completed a single practice block for each new task.

Results

Fig. 4A displays proportion-correct results for all conditions when a motion mask is used. At short SOAs, motion judgments are impaired (mean = 60%), but this is not true of color judgments (mean = 95%) where performance is at ceiling. A reversed pattern is shown in the complementary condition, employing a color mask (Fig. 4B).

Fig. 4. Proportion correct for all observers (n = 12), for two SOA conditions (short and long), and judgements of either color (first column, blue) or motion (second column, red). Panel (A) displays the case where a motion mask is used. Panel (B) displays the case where a color mask is used. Error bars represent 1 standard error (SE) (within subjects).

Statistical comparison reveals a significant difference at short SOAs for both mask types (Motion mask: t(11) = 8.86, P < 0.001; Color mask: t(11) = 4.63, P < 0.001), thus demonstrating a feature-selective masking effect. Conversely, there is no significant difference in scores for the long SOA conditions [Motion mask: t(11) = 1.48, P = 0.166; Color mask: t(11) = 1.65, P = 0.13], for which masking was predicted to be minimal. Crucially, masking is not only significantly stronger within a visual dimension but is also weak or absent across dimensions. For judgments of color, the motion mask had little or no effect; performance remained at ceiling (95%). Similarly, for judgments of motion, the color mask appears relatively ineffectual, although performance in this condition drops slightly (<90%). Thus, for the display settings used in this experiment, it is possible to strongly mask one feature, while having no effect on the other.

In a separate analysis of the color masking data, we segregated trials into those containing opponent and nonopponent color pairs. The results failed to show a greater masking effect for opponent color pairs, t(9) = 1.7, P = 0.13.

Experiment 2: Time course of the homogeneous masking effect

Feature selectivity of visual masking, as demonstrated in Experiment 1, lends clear support to the idea of segregated color and motion processing. Another method to investigate this separation is to examine differences in the masking time course. Previous studies, using a different paradigm, have argued for a faster color processing system than for motion, resulting in the generation of a color percept 70–80 ms before that of motion (Moutoussis & Zeki, Reference Moutoussis and Zeki1997a ). Can this perceptual asynchrony be revealed using a masking paradigm? More specifically, is detectability of color greater than that of motion, at the same SOA, for masks of equal strength? In this experiment, we measure color and motion detectability for homogeneous target-mask pairs, using a range of different target and mask intervals (SOAs).