Eye movement prediction and variability on natural video data sets

Michael Dorr*, Eleonora Vig, Erhardt Barth

*Corresponding author for this work
4 Citations (Scopus)

Abstract

We here study the predictability of eye movements when viewing high-resolution natural videos. We use three recently published gaze data sets that contain a wide range of footage, from scenes of almost still-life character to professionally made, fast-paced advertisements and movie trailers. Intersubject gaze variability differs significantly between data sets, with variability being lowest for the professional movies. We then evaluate three state-of-the-art saliency models on these data sets. A model that is based on the invariants of the structure tensor and that combines very generic, sparse video representations with machine learning techniques outperforms the two reference models; performance is further improved for two data sets when the model is extended to a perceptually inspired colour space. Finally, a combined analysis of gaze variability and predictability shows that eye movements on the professionally made movies are the most coherent (due to implicit gaze-guidance strategies of the movie directors), yet the least predictable (presumably due to the frequent cuts). Our results highlight the need for standardized benchmarks to comparatively evaluate eye movement prediction algorithms.

Original languageEnglish
JournalVisual Cognition
Volume20
Issue number4-5
Pages (from-to)495-514
Number of pages20
ISSN1350-6285
DOIs
Publication statusPublished - 01.04.2012

Fingerprint

Dive into the research topics of 'Eye movement prediction and variability on natural video data sets'. Together they form a unique fingerprint.

Cite this