Representing Pairwise Spatial and Temporal Relations for Action Recognition - Robotics Institute Carnegie Mellon University

Representing Pairwise Spatial and Temporal Relations for Action Recognition

Conference Paper, Proceedings of (ECCV) European Conference on Computer Vision, pp. 508 - 521, September, 2010

Abstract

The popular bag-of-words paradigm for action recognition tasks is based on building histograms of quantized features, typically at the cost of discarding all information about relationships between them. However, although the beneficial nature of including these relationships seems obvious, in practice finding good representations for feature relationships in video is difficult. We propose a simple and computationally efficient method for expressing pairwise relationships between quantized features that combines the power of discriminative representations with key aspects of Naive Bayes. We demonstrate how our technique can augment both appearance- and motion-based features, and that it significantly improves performance on both types of features.

BibTeX

@conference{Matikainen-2010-10525,
author = {Pyry K. Matikainen and Martial Hebert and Rahul Sukthankar},
title = {Representing Pairwise Spatial and Temporal Relations for Action Recognition},
booktitle = {Proceedings of (ECCV) European Conference on Computer Vision},
year = {2010},
month = {September},
editor = {K. Daniilidis, P. Maragos, N. Paragios},
pages = {508 - 521},
keywords = {action recognition, spatial relationships},
}