First-Person Pose Recognition using Egocentric Workspaces - Robotics Institute Carnegie Mellon University

First-Person Pose Recognition using Egocentric Workspaces

G. Rogez, J. Supancic, and D. Ramanan
Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 4325 - 4333, June, 2015

Abstract

We tackle the problem of estimating the 3D pose of an individual's upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volume in front of the camera. We call this volume an egocentric workspace. A notable property is that hand appearance correlates with workspace location. To exploit this correlation, we classify arm+hand configurations in a global egocentric coordinate frame, rather than a local scanning window. This greatly simplify the architecture and improves performance. We propose an efficient pipeline which 1) generates synthetic workspace exemplars for training using a virtual chest-mounted camera whose intrinsic parameters match our physical camera, 2) computes perspective-aware depth features on this entire volume and 3) recognizes discrete arm+hand pose classes through a sparse multi-class SVM. We achieve state-of-the-art hand pose recognition performance from egocentric RGB-D images in real-time.

BibTeX

@conference{Rogez-2015-121187,
author = {G. Rogez and J. Supancic and D. Ramanan},
title = {First-Person Pose Recognition using Egocentric Workspaces},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2015},
month = {June},
pages = {4325 - 4333},
}