Articulation-Aware Canonical Surface Mapping

Nilesh Kulkarni, Abhinav Gupta, David Fouhey, and Shubham Tulsiani

Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 449 - 458, June, 2020

Abstract

We tackle the tasks of: 1) predicting a Canonical Surface Mapping (CSM) that indicates the mapping from 2D pixels to corresponding points on a canonical template shape , and 2) inferring the articulation and pose of the template corresponding to the input image. While previous approaches rely on keypoint supervision for learning, we present an approach that can learn without such annotations. Our key insight is that these tasks are geometrically related, and we can obtain supervisory signal via enforcing consistency among the predictions. We present results across a diverse set of animal object categories, showing that our method can learn articulation and CSM prediction from image collections using only foreground mask labels for training. We empirically show that allowing articulation helps learn more accurate CSM prediction, and that enforcing the consistency with predicted CSM is similarly critical for learning meaningful articulation.

BibTeX

@conference{Kulkarni-2020-127129,
author = {Nilesh Kulkarni and Abhinav Gupta and David Fouhey and Shubham Tulsiani},
title = {Articulation-Aware Canonical Surface Mapping},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2020},
month = {June},
pages = {449 - 458},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.