Categorizing Cubes: Revisiting Pose Normalization

M. Hejrati and D. Ramanan

Conference Paper, Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV '16), March, 2016

Abstract

This paper introduces and analyzes the novel task of categorical classification of cuboidal objects - e.g., distinguishing washing machines versus filing cabinets. To do so, it makes use of recent methods for automatic alignment of cuboidal objects in images. Given such geometric alignments, the natural approach for recognition might extract pose-normalized appearance features from a canonically-aligned coordinate frame. Though such approaches are extraordinarily common, we demonstrate that they are not optimal, both theoretically and empirically. One reason is that such approaches require accurate shape alignment. However, even with ground-truth alignment, pose-normalized representations may still be sub-optimal. Instead, we introduce methods based on pose-synthesis, a somewhat simple approach of augmenting training data with geometrically perturbed training samples. We demonstrate, both theoretically and empirically, that synthesis is a surprisingly simple but effective strategy that allows for state-of-the-art categorization and automatic 3D alignment. To aid our empirical analysis, we introduce a novel dataset for cuboidal object categorization.

BibTeX

@conference{Hejrati-2016-121181,
author = {M. Hejrati and D. Ramanan},
title = {Categorizing Cubes: Revisiting Pose Normalization},
booktitle = {Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV '16)},
year = {2016},
month = {March},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.