Learning to Parse Images of Articulated Bodies

Conference Paper, Proceedings of (NeurIPS) Neural Information Processing Systems, pp. 1129 - 1136, December, 2006

Abstract

We consider the machine vision task of pose estimation from static images, specifically for the case of articulated objects. This problem is hard because of the large number of degrees of freedom to be estimated. Following a established line of research, pose estimation is framed as inference in a probabilistic model. In our experience however, the success of many approaches often lie in the power of the features. Our primary contribution is a novel casting of visual inference as an iterative parsing process, where one sequentially learns better and better features tuned to a particular image. We show quantitative results for human pose estimation on a database of over 300 images that suggest our algorithm is competitive with or surpasses the state-of-the-art. Since our procedure is quite general (it does not rely on face or skin detection), we also use it to estimate the poses of horses in the Weizmann database.

BibTeX

@conference{Ramanan-2006-121229,
author = {D. Ramanan},
title = {Learning to Parse Images of Articulated Bodies},
booktitle = {Proceedings of (NeurIPS) Neural Information Processing Systems},
year = {2006},
month = {December},
pages = {1129 - 1136},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.