Canonical Locality Preserving Latent Variable Model for Discriminative Pose Inference

Y. Tian, L. Sigal, F. De la Torre, and Y. Jia

Journal Article, Image and Vision Computing, Vol. 31, No. 3, pp. 223 - 230, March, 2013

Abstract

Discriminative approaches for human pose estimation model the functional mapping, or conditional distribution, between image features and 3D poses. Learning such multi-modal models in high dimensional spaces, however, is challenging with limited training data; often resulting in over-fitting and poor generalization. To address these issues Latent Variable Models (LVMs) have been introduced. Shared LVMs learn a low dimensional representation of common causes that give rise to both the image features and the 3D pose. Discovering the shared manifold structure can, in itself, however, be challenging. In addition, shared LVM models are often non-parametric, requiring the model representation to be a function of the training set size. We present a parametric framework that addresses these shortcomings. In particular, we jointly learn latent spaces for both image features and 3D poses by maximizing the non-linear dependencies in the projected latent space, while preserving local structure in the original space; we then learn a multi-modal conditional density between these two low-dimensional spaces in the form of Gaussian Mixture Regression. With this model we can address the issue of over-fitting and generalization, since the data is denser in the learned latent space, as well as avoid the need for learning a shared manifold for the data. We quantitatively compare the performance of the proposed method to several state-of-the-art alternatives, and show that our method gives a competitive performance.

BibTeX

@article{Tian-2013-120780,
author = {Y. Tian and L. Sigal and F. De la Torre and Y. Jia},
title = {Canonical Locality Preserving Latent Variable Model for Discriminative Pose Inference},
journal = {Image and Vision Computing},
year = {2013},
month = {March},
volume = {31},
number = {3},
pages = {223 - 230},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.