|Jonathan R. Williford and Rüdiger von der Heydt (2013), Scholarpedia, 8(10):30040.||doi:10.4249/scholarpedia.30040||revision #186629 [link to/cite this article]|
Understanding visual scenes requires the visual system to infer the three-dimensional structure of the world from the two-dimensional projections on our retinae. This task is complicated by the fact that objects closer to the viewer will often block or occlude the view of objects farther away, as depicted in Figure 1. This produces visual borders that are owned by the closer, occluding objects. For example, the border at point A in Figure 1 is owned by the koala. Gestalt psychologists were the first to notice the importance of border-ownership in perception. Since the occluding borders define the shape of the foreground, the perception of form depends on the correct assignment of these borders (e.g. Figure 2). Edgar Rubin created an intriguing figure in which borders can be perceptually assigned either way, with the effect that different shapes are perceived depending on how the borders are assigned (Figure 3, Rubin, 1915). Rubin noticed that perception becomes unstable with such figures, slowly flipping back and forth between the alternative interpretations. These compulsive alternations of perception and their specific timing made it obvious for the first time that mechanisms exist in the brain that strive for an interpretation of the image, and that there is a neural substrate that represents the interpretations.
When looking at natural scenes, perceiving which borders belong to which objects seems like a trivial task. However, it is unknown how the visual system is able to accomplish this feat, and no computational solution is currently known that approaches the reliability of the primate visual system (see Hoiem, Efros, & Hebert, 2011 and Borenstein & Ullman, 2008 for recent approaches).
The process of distinguishing foreground and background in drawings or other two-dimensional displays is generally referred to as "figure-ground organization", which seems to imply that the brain labels regions as figure (the closer, occluding region) and ground (the more distant, occluded region). In a more complex example involving multiple occlusion levels, the different regions might be labeled with a relative depth order. Labeling regions this way and assigning border-ownership might seem like equivalent ways of representing the occlusion structure of a scene. However, cases in which either an object occludes itself or multiple objects mutually occlude each other (Figure 1) cannot be represented by region labeling. In contrast, one can code any complexity of occlusion structure by assigning border-ownership of the occluding contours.
Studies of the visual cortex have provided evidence for both kinds of coding. Region labeling was discovered by Victor Lamme (1995) who found that certain neurons fired at a higher rate in response to stimulus elements in a figure region compared to elements in the ground region (Figure 4C). Border-ownership coding was discovered by Zhou, Friedman, and von der Heydt (2000) who noticed that certain neurons responded to borders with different firing rates depending on whether the border was owned by a figure on one side or the other (Figure 4B). To date, it is not clear how these two coding schemes in the visual cortex are related.
This article reviews border-ownership coding in the early visual cortex. Studies since Zhou et al. (2000) have not only revealed how the brain represents figure-ground organization but have also provided insight into the mechanisms of object representation and selective attention. For example, Qui, Sugihara, and von der Heydt (2007) found that the border-ownership representation emerges independently of attention, but provides a structure for object-based attention. O'Herron and von der Heydt found that border-ownership signals persist on the order of a second (2009, 2011) and can be "remapped" across eye movements and with object movements (2013). On average, the border-ownership signal appears to be consistent over multiple stimuli, such as various geometric shapes, transparent overlaying figures (Qiu & von der Heydt, 2007), stereoscopic displays (Qiu & von der Heydt, 2005), and dynamic occlusion displays (von der Heydt, Qiu, & He, 2003). It is interesting also that border-ownership does actually affect shape processing in infero-temporal cortex (IT), as suggested by the perceptual demonstrations of Figures 2 and 3. Baylis and Driver, who had previously studied the effect of border-ownership in perception, demonstrated that the responses of contour-shape selective IT neurons depend on border-ownership (Baylis and Driver, 2001).
While these studies are based on recordings of neurons in monkey visual cortex, the existence of border-ownership-selective neurons has also been demonstrated in the human visual cortex. In a psychophysical study, von der Heydt, Macuda, and Qiu (2005) demonstrated a border-ownership selective tilt after effect. Fang, Boyaci, and Kersten (2009) used an adaptation paradigm and fMRI to demonstrate a border-ownership-selective BOLD signal.
Border-ownership coding: Configural cues
Zhou et al. recorded isolated single cell neuronal activity from awake behaving macaques. For the initial experiment, called the standard test, the border of a uniform colored square was aligned to the receptive field of the neuron, rotated to match the neuron's preferred orientation. The stimuli within the classical receptive field (CRF) were kept constant while changing the side of the figure by concurrently reversing the colors of the figure and ground, as shown in Figure 5 (compare top and bottom displays). The border-ownership signal of a neuron was defined by the response to locally identical stimuli when the figure is on its preferred side compared to non-preferred side. For example, in the neuron shown in Figure 4B, the border-ownership signal is the difference between the red and blue line. Zhou et al. studied the visual areas V1, V2, and V4 and found border-ownership selective neurons in each area (Figure 6). In V2, these were more than 50% of the orientation selective neurons that responded to contrast edges (which constitute about 80% of all V2 neurons). In V1, less than 20% were border-ownership selective. In V4, the fraction was around 50%, but this is the percentage of neurons that could be activated with figure edges, which were about half of the cells encountered.
Due to the large proportion of border-ownership selective neurons and the amount of receptive field overlap in the early visual areas, there will be many neurons whose receptive field encode a given piece of a figure's border. Roughly half of these neurons will prefer the figure to be on one side of the border, while the other neurons will prefer the opposite side. The actual side of the figure will then be encoded at each location by the ratio of the firing rates of the neurons in two pools with opposing side preferences.
In addition to the standard test, some cells were also presented with C-figures and overlapping rectangles, like those shown in Figure 5. Both of these displays elicited a significant border-ownership in a smaller proportion of cells than the single square. However, when both the single square and one of the other figures elicited significant border-ownership signals, they were nearly always consistent (see Figure 27 of Zhou et al., 2000). It was also shown (for two example neurons) that the border-ownership signal was position invariant in the direction orthogonal to the border.
Extending the neurophysiological approach to a different perceptual situation, Qiu and von der Heydt (2007) showed that the same neurons also code for border-ownership according to the perception of transparent overlay. When 4 squares are arranged like in Figure 7B, it looks like one semi-transparent bar is overlaying another. With this interpretation, the border in the receptive field (indicated with the red oval) would be owned by the left. If the corners are rounded, as in Figure 7C, the perception is broken and 4 separate squares are seen and the border is owned by the right. In fact, the average border-ownership signal switched in agreement with the perceptual interpretation.
Zhang and von der Heydt (2010) explored the contribution of individual edges to the border-ownership assignment by using contour-defined squares (akin to the Cornsweet illusion figure) and decomposing the contour into fragments. Fragments on the preferred side-of-figure produced facilitation, while fragments on the opposite side produced suppression. The timing of the contributions of the fragments was similar regardless of their proximity to the CRF.
The motion around visual borders can indicate the side of the occluding figure. When the region on one side of the border, but not the other, moves along with the border, then the side with consistent motion is seen as the side owning the border. Textures or other image features from the background will disappear or appear at the border. Von der Heydt, Qiu, and He (2003) tested neurons with displays where moving random dot patterns defined the border-ownership. There was significant correlation between the border-ownership signal elicited by the standard test and that elicited by moving dots.
Random-dot stereograms are paired images where the forms of objects are only visible when the images are viewed stereoscopically. Von der Heydt, Zhou, and Friedman (2000) used such stereograms to study the form processing of the supragranular layers of V1 and area V2. Both visual areas contain neurons that respond preferentially to surfaces at a specific depth. However, in area V2, but not V1, neurons were found that responded to borders of figures defined by stereoscopic depth and were tuned to the orientation of the borders, just as they were tuned to contrast-defined edges. Furthermore, most of these neurons also fired at a higher rate when a specific side of the border (the preferred side) was closer than the other. In other words, border-ownership signals can be elicited by the stereoscopic depth order of the edge.
Another study explored the relationship between the border-ownership signal elicited by a solid colored square and the border-ownership signal elicited by stereoscopic depth (Qiu and von der Heydt, 2005). In area V2, 22% of the neurons (37/174) were selective for border-ownership with the contrast-defined figure (without depth) as well as for border-ownership defined by depth in random-dot stereograms (which are devoid of contrast edges). Of this subset, 81% (30/37) had the same preferred side for both stimuli. This correlation shows that the neurons combine different figure-ground cues in a meaningful way. One cue is stereoscopic depth order, the other cue is the global configuration of edges. At contours of occluding objects in the real world, stereoscopic depth is 'near' on the object side relative to the background side. Thus, the observed correlation shows that the visual system treats a figure on a computer display like a real object occluding a background.
Framework for attention
The relationship between selective attention and border-ownership coding was explored by Qiu, Sugihara, and von der Heydt (2007). Their findings suggest that the mechanisms responsible for border-ownership coding provide a structure for object-based attention. They used a shape discrimination task to manipulate selective attention, and independently varied border-ownership. While fixating, the monkeys were presented with 3 figures and were rewarded when they correctly discriminated the cued figure as being either a rectangle or a trapezoid. One of the figures was cued at the beginning of a block of trials, and then one of the other figures for the next block etc.
The authors discovered that border-ownership coding at any of the figures still occurred when attention was directed elsewhere on the screen. The strength of the border-ownership signals at one figure decreased only slightly when the attention was directed away compared to when attention was at that figure.
Qiu, Sugihara, and von der Heydt (2007) also found an asymmetry of the attention effect. When two overlapping figures are presented, a neuron responding to the occluding border shows response enhancement when the figure on one side is attended compared the figure on the other side, irrespective of border-ownership (given by the direction of overlap). Each neuron has its preferred side of attention. Interestingly, preferred side of attention and preferred side of border-ownership are correlated. As a result, the responses to the four combinations of overlap and side of attention, averaged over neurons, vary as depicted in Figure 8. The correlation of the attention effect with the border-ownership preference suggests that the same mechanism that gives rise to border-ownership also mediates attentional modulation.
Persistence and remapping of border-ownership signals
When looking at visual scenes, humans and many other animals do not maintain a fixed eye position, but continuously make saccades, several times per second. Even though a large part of our visual system is retinotopically organized, we maintain a stable visual perception. O'Herron and von der Heydt (2009,2011) discovered that the border-ownership signals in V2 neurons often persist for over a second when the figure-ground assignment becomes ambiguous (see Figure 9). Even more interesting, this border-ownership persistence can be remapped during saccades and moves with the ambiguous displays if they jump to a new location (O'Herron & von der Heydt, 2013). These findings show that border-ownership selectivity reflects a mechanism that helps to maintain a stable visual percept.
O'Herron and von der Heydt (2009) aligned an edge of a square to the CRF at the preferred orientation, as shown in Figure 9. A stereoscopic display was used to make the circle appear as a window, with the outside region appearing a few cm in front of the stimuli within the circular window. The square was presented for 500 ms, and then switched to an ambiguous display (Figure 9A) where the border could be owned by either side. The authors analyzed the border-ownership modulation in the persistence phase (from 200 ms to 1000 ms after ambiguous display onset). Looking at the spike counts during this interval persistence varied a lot between cells, from no persistence to nearly complete persistence. However, the time course of the population signal showed a slow steady decay with a time constant of 400 ms.
One possible explanation for this persistence might be that an afterimage is responsible for this effect. This was ruled out by continuously inverting the colors during the figure display at a fast rate before switching to the static phase with the ambiguous display. While the responses would sometimes oscillate in the figure phase, due to selectivity for edge contrast polarity, the border-ownership signal during the persistence phase was virtually identical to that after steady figure presentation. Afterimages, on the other hand, would be significantly reduced by the periodic color inversion.
The border-ownership signal did not depend much on the duration of the figure phase (50, 250, or 500 ms), suggesting that the signal doesn't accumulate. Also, when two figure-ambiguous sequences were presented in succession within a fixation period, the persisting signal in the last phase depended only on the immediately preceding figure display.
A border-ownership signal could also be produced by presenting the ambiguous edge with a few dots with 'far' disparity added on one side. This side is then perceived as background, and the border as owned by the other side. The population border-ownership signal pointed to that side accordingly, and persisted after the dots were removed.
O'Herron and von der Heydt (2009) also showed that the persistence is not the result of attention being attracted by the figure. It would be conceivable that attention would then linger on that side after the square is replaced by an ambiguous edge. This possibility was rejected by adding a second square to the display and having one of the squares, chosen randomly, appear 300 ms before the other. After another 300 ms during which both squares were present, a surface with two circular windows appeared in front that left only one edge of each square visible (one of which was in the CRF of the neuron being recorded). If an automatic shift of attention were responsible for border-ownership persistence, the border-ownership from the first figure should be interrupted by the display of the second figure. However, similar persistence was seen regardless of whether the figure at the CRF was shown first or second.
Neural models and constraints
As shown in the studies reviewed above, the primate brain has a remarkable ability to calculate border-ownership quickly, even when doing so requires contextual integration over large areas of the visual field. It is a challenge to model how border-ownership coding can be calculated so quickly, considering that the context information is spread out widely in cortex, and neural conduction velocity is limited. Based on the possible neural mechanisms of propagating context signals across the retinotopic cortical representation one can distinguish three general classes of models: feedforward, horizontal, and feedback.
Many neurons have regions outside of their CRF that modulate their response. These modulatory surrounds can be either suppressive or facilitative. Walker, Ohzawa, and Freeman (1999) found that the surround regions, measured with grating stimuli, are generally suppressive, and often asymmetric about the CRF. Motivated by these findings, Sakai and Nishimura (2006) showed that a model with asymmetric surround regions (a facilatory region on one side and a suppressive region on the other, Figure 10A), stochastically chosen for each neuron, can account in a statistical sense for the data of Zhou et al. (2000). Supèr, Romeo, and Keil (2010) proposed a feed-forward model that uses two stages of concentric center-surround mechanisms to calculate, first figure-ground modulation, and subsequently border-ownership assignment. However, the feedforward models are physiologically implausible. First, the anatomically defined forward connections are precisely what defines the CRF, whereas the non-classical surround is mediated by horizontal connections and feedback from higher areas (Angelucci, Levitt, & Lund, 2001). Second, the cited studies on surround modulation cannot explain the large range of the context influence in border-ownership modulation (10 times the extent of the CRF and more). And third, neither of the two model studies addresses the problem of limited conduction velocity and the short latency of border-ownership signals. The virtue of these models is their simplicity, but it is unclear if they can explain critical findings such as the strong border-ownership signals for displays of transparent overlay (Qiu & von der Heydt, 2007).
Lateral propagation models
Zhaoping (2005) proposed a model in which border-ownership is calculated within V2, relying on lateral connections (Figure 10B). Local borders are represented by two sets of cells, one for each direction of border assignment. The activity of these cells spreads through lateral connections, providing either enhancement or suppression, depending on the shape of the activating contour. By propagating activity along the representation of the contour, the network assigns border-ownership to the predominantly concave side of the contour. It does this correctly even for stretches of contour where the figure is on the convex side as in the case of a C-shaped figure. Sugihara, Qiu, and von der Heydt (2011) argue that such a model cannot explain the short latency of border-ownership signals because of the large distances the signals would have to travel along the contour representation in cortex and the low conduction velocity of horizontal fibers. They measured the latencies of the border-ownership signals for different sizes of squares and calculated the cortical distance to the nearest point of context information. They found that the recorded latencies did not increase as much as predicted by the model.
Feedback models of border-ownership coding (Craft, Schütze, Niebur, & von der Heydt, 2007; Jehee, Lamme, & Roelfsema, 2007) rely on higher level areas that have larger receptive fields and modulate the activity in the lower level areas via back projections. Craft et al. proposed a “grouping cell model” in which signals from edge selective cells in V2 are integrated by grouping cells (G) at a higher level. The G cells project back to the same cells they receive input from, facilitating their responses. The G cells have annular integration fields, which makes them most sensitive to compact shapes. For example, when a square figure excites the red set of V2 cells, the corresponding G cell is strongly activated and, by feedback, enhances the responses in the red set, whereas the G cell on the other side receives input only from one edge and is therefore only weakly activated. The feedback makes the V2 cells border-ownership selective.
This model can explain the large context integration and the short latency of the border-ownership signals, because the grouping cells can be in another cortical area so that the feedback signals would travel through white matter fibers which conduct about ten times faster than cortical horizontal fibers (Girard, Hupe, & Bullier, 2001). Also, the length of the connections does not increase in direct proportion to the size of the figure representation in V2 cortex, as would the required length of horizontal fiber connections. This explains the relative invariance of the latency with variation of the size of the squares.
Note that different stimuli may use different types of processing in order to calculate border-ownership. For example, displays in which objects are defined by the configuration of contours may require feedback projections to provide the context information to neurons in V1 and V2, while border-ownership in random-dot stereoscopic displays might be calculated in a feedforward manner.
Another important argument for the grouping cell model is that it can easily be extended to explain selective attention, which is known to spread within objects (Egly, Driver, & Rafal, 1994). Because a single grouping cell can facilitate all the feature neurons connected to it, a top-down attention signal only needs to excite a small cluster of grouping cells to enhance the entire contour of an object (Mihalas, Dong, von der Heydt, & Niebur, 2011). A simple consequence of the connection scheme of Figure 10C is the asymmetry of the attention effect observed by Qiu, Sugihara, and von der Heydt (2007): A given border-ownership cell, for example the red cell in the center of Figure 10C, is facilitated when a corresponding grouping cell is activated, that is, only when a figure on its preferred side of border-ownership is attended. Attention to a figure on the other side does not facilitate the cell, because the grouping cells on the other side project back to the opposing (blue) border-ownership cell.
O'Herron & von der Heydt (2013) showed how this model could be extended to explain the remapping of border-ownership signals.
- Angelucci, A.; Levitt, J. B. and Lund, J. S. (2002). Anatomical origins of the classical receptive field and modulatory surround field of single neurons in macaque visual cortical area V1. Progress in Brain Research 136: 373-388. doi:10.1016/S0079-6123(02)36031-X.
- Baylis, Gordon C. and Driver, Jon (2001). Shape-coding in IT cells generalizes over contrast and mirror reversal, but not figure-ground reversal. Nature Neuroscience 4 (9): 937-942. doi:10.1038/nn0901-937. ISSN 1097-6256. Archived from the original on 2013-07-19. http://www.nature.com/neuro/journal/v4/n9/abs/nn0901-937.html. Retrieved on 2013-07-19.
- Borenstein, Eran and Ullman, Shimon (2008). Combined top-down/bottom-up segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on 30 (12): 2109–2125. Archived from the original on 2013-06-30. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4408584. Retrieved on 2013-06-30.
- Bregman, Albert S. and Pomerantz, James R. (1981). "Asking the "what for" question in auditory perception". in Michael Kubovy (ed.). Perceptual organization. pp. 99-118.
- Craft, E.; Schütze, H.; Niebur, E. and von der Heydt, R. (2007). A neural model of figure-ground organization. Journal of Neurophysiology 97 (6): 4310-4326.
- Egly, Robert; Driver, Jon and Rafal, Robert D. (1994). Shifting visual attention between objects and locations: Evidence from normal and parietal lesion subjects. Journal of Experimental Psychology: General 123: 161-177. doi:10.1037/0096-3418.104.22.168. ISSN (Print); 1939-2222 (Electronic) 0096-3445 (Print); 1939-2222 (Electronic).
- Fang, F.; Boyaci, H. and Kersten, D. (2009). Border Ownership Selectivity in Human Early Visual Cortex and its Modulation by Attention. The Journal of Neuroscience 29 (2): 460-465. doi:10.1523/JNEUROSCI.4628-08.2009. ISSN 1529-2401 0270-6474, 1529-2401. Archived from the original on 2012-12-10. http://www.jneurosci.org/content/29/2/460. Retrieved on 2012-12-10.
- Girard, P.; Hupe, J. M. and Bullier, J. (2001). Feedforward and feedback connections between areas V1 and V2 of the monkey have similar rapid conduction velocities. Journal of Neurophysiology 85: 1328-1331. doi:10.1016/S0928-4257(97)81426-X.
- Hoiem, D; Efros, A. A. and Hebert, M. (2011). Recovering Occlusion Boundaries from an Image. International Journal of Computer Vision 91 (3): 328-346. doi:10.1007/s11263-010-0400-4. ISSN 1573-1405 0920-5691, 1573-1405.
- Jehee, J. F.; Lamme, V. A. and Roelfsema, P. R. (2007). Boundary assignment in a recurrent network architecture. Vision Research 47: 1153-1165. doi:10.1016/j.visres.2006.12.018.
- Lamme, V. A. (1995). The neurophysiology of figure-ground segregation in primary visual cortex. The Journal of Neuroscience 15 (2): 1605-1615. doi:10.1523/jneurosci.15-02-01605.1995.
- Mihalas, S.; Dong, Yi; von der Heydt, R. and Niebur, E. (2011). Mechanisms of perceptual organization provide auto-zoom and auto-localization for attention to objects. Proceedings of the National Academy of Sciences 108 (18): 7583-7588. doi:10.1073/pnas.1014655108. ISSN 1091-6490 0027-8424, 1091-6490. Archived from the original on 2013-06-28. http://www.pnas.org/content/108/18/7583. Retrieved on 2013-06-28.
- O'Herron, P. and von der Heydt, R. (2009). Short-term memory for figure-ground organization in the visual cortex. Neuron 61 (5): 801-809.
- O'Herron, P. and von der Heydt, R. (2011). Representation of object continuity in the visual cortex. Journal of vision 11 (2). doi:10.1167/11.2.12.
- O'Herron, P. and von der Heydt, R. (2013). Remapping of Border Ownership in the Visual Cortex. The Journal of Neuroscience 33 (5): 1964-1974. doi:10.1523/JNEUROSCI.2797-12.2013. ISSN 1529-2401 0270-6474, 1529-2401.
- Qiu, F. T.; Sugihara, T. and von der Heydt, R. (2007). Figure-ground mechanisms provide structure for selective attention. Nature Neuroscience 10 (11): 1492-1499. ISSN 10976256.
- Qiu, F. T. and von der Heydt, R. (2005). Figure and ground in the visual cortex: V2 combines stereoscopic cues with Gestalt rules. Neuron 47 (1): 155-166. doi:10.1016/j.neuron.2005.05.028.
- Qiu, F. T. and von der Heydt, R. (2007). Neural representation of transparent overlay. Nature neuroscience 10 (3): 283-284. doi:10.1038/nn1853.
- Rubin, Edgar (1915). Synsoplevede figurer: studier i psykologisk analyse. 1. del. Gyldendalske Boghandel, Nordisk Forlag.
- Sakai, K. and Nishimura, H. (2006). Surrounding suppression and facilitation in the determination of border ownership. Journal of Cognitive Neuroscience 18 (4): 562-579. doi:10.1162/jocn.2006.18.4.562.
- Sugihara, T.; Qiu, F. T. and von der Heydt, R. (2011). The speed of context integration in the visual cortex. Journal of Neurophysiology 106 (1): 374-385. doi:10.1152/jn.00928.2010. ISSN 1522-1598 0022-3077, 1522-1598.
- Supèr, H.; Romeo, A. and Keil, M. (2010). Feed-Forward Segmentation of Figure-Ground and Assignment of Border-Ownership. PLoS ONE 5 (5): e10705. doi:10.1371/journal.pone.0010705.
- von der Heydt, R.; Macuda, T. and Qiu, F. T. (2005). Border-ownership-dependent tilt aftereffect. Journal of the Optical Society of America A 22 (10): 2222-2229. doi:10.1364/JOSAA.22.002222. Archived from the original on 2012-12-10. http://josaa.osa.org/abstract.cfm?URI=josaa-22-10-2222. Retrieved on 2012-12-10.
- von der Heydt, R.; Qiu, F. T. and He, Z. J. (2003). Neural mechanisms in border ownership assignment: motion parallax and gestalt cues. Journal of Vision 3 (9): 666-666. doi:10.1167/3.9.666. ISSN 1534-7362 , 1534-7362.
- von der Heydt, R.; Zhou, H. and Friedman, H. S. (2000). Representation of stereoscopic edges in monkey visual cortex. Vision research 40 (15): 1955-1967. doi:10.1016/S0042-6989(00)00044-4.
- Walker, G. A.; Ohzawa, I. and Freeman, R. D. (1999). Asymmetric Suppression Outside the Classical Receptive Field of the Visual Cortex. The Journal of Neuroscience 19 (23): 10536-10553. doi:10.1017/S0952523800173055. ISSN 1529-2401 0270-6474, 1529-2401.
- Zhang, N. R. and von der Heydt, R. (2010). Analysis of the context integration mechanisms underlying figure-ground organization in the visual cortex. The Journal of Neuroscience 30 (19): 6482-6496. doi:10.1523/JNEUROSCI.5168-09.2010.
- Zhaoping, L. (2005). Border ownership from intracortical interactions in visual area V2. Neuron 47 (1): 143-153. doi:10.1016/j.neuron.2005.04.005.
- Zhou, H.; Friedman, H. S. and von der Heydt, R. (2000). Coding of border ownership in monkey visual cortex. The Journal of Neuroscience 20 (17): 6594-6611. doi:10.1113/jphysiol.2002.033555.
- Nakayama, Ken; Shimojo, Shinsuke and Silverman, Gerald H. (1989). Stereoscopic depth: its relation to image segmentation, grouping, and the recognition of occluded objects. Perception 18 (1): 55-68. doi:10.1068/p180055.
- Nakayama, Ken; He, Zijiang J. and Shimojo, Shinsuke (1995). Visual surface representation: A critical link between lower-level and higher-level vision. Visual cognition: An invitation to cognitive science 2: 1-70. doi:10.1163/156856893X00135.
- Driver, Jon and Baylis, Gordon C. (1996). Edge-Assignment and Figure-Ground Segmentation in Short-Term Visual Matching. Cognitive Psychology 31 (3): 248-306. doi:10.1006/cogp.1996.0018. ISSN 0010-0285.