Monocular object detection using 3d geometric primitives - Robotics Institute Carnegie Mellon University

Monocular object detection using 3d geometric primitives

Peter Carr, Yaser Sheikh, and Iain Matthews
Conference Paper, Proceedings of (ECCV) European Conference on Computer Vision, pp. 864 - 878, October, 2012

Abstract

Multiview object detection methods achieve robustness in adverse imaging conditions by exploiting projective consistency across views. In this paper, we present an algorithm that achieves performance comparable to multiview methods from a single camera by employing geometric primitives as proxies for the true 3D shape of objects, such as pedestrians or vehicles. Our key insight is that for a calibrated camera, geometric primitives produce predetermined location-specific patterns in occupancy maps. We use these to define spatially-varying kernel functions of projected shape. This leads to an analytical formation model of occupancy maps as the convolution of locations and projected shape kernels. We estimate object locations by deconvolving the occupancy map using an efficient template similarity scheme. The number of objects and their positions are determined using the mean shift algorithm. The approach is highly parallel because the occupancy probability of a particular geometric primitive at each ground location is an independent computation. The algorithm extends to multiple cameras without requiring significant bandwidth. We demonstrate comparable performance to multiview methods and show robust, realtime object detection on full resolution HD video in a variety of challenging imaging conditions.

BibTeX

@conference{Carr-2012-122242,
author = {Peter Carr and Yaser Sheikh and Iain Matthews},
title = {Monocular object detection using 3d geometric primitives},
booktitle = {Proceedings of (ECCV) European Conference on Computer Vision},
year = {2012},
month = {October},
pages = {864 - 878},
}