Predicting
eye fixations is typically done with some form of the saliency map, e.g. (Itti, Tatler, Rajashekar).
However the structural analysis of those approaches is rather sparse and can
not explain why humans for instance foveate near intersecting lines (see
below). In contrast, the structural decomposition
I have developed, can for instance explain all the pop-out phenomena observed
in human visual search,
by simply taking the variance of the vector descriptors (contour & areas). To apply the
methodology to fixation prediction as the one below, I need to elaborate it a
bit more, by for instance creating grouping algorithms. Together with Ben Tatler at the
University of Dundee, I have also started to analyse saccadic target selection in natural scenes.
If shown a line drawing, humans tend to foveate clusters of intersecting
lines (from Noton &
Stark 1971):