Where’s Waldo: Matching People in Images of Crowds

Rahul Garg, Steven M. Seitz, Deva Ramanan, and Noah Snavely

Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 1793 - 1800, June, 2011

Abstract

Given a community-contributed set of photos of a crowded public event, this paper addresses the problem of finding all images of each person in the scene. This problem is very challenging due to large changes in camera viewpoints, severe occlusions, low resolution and photos from tens or hundreds of different photographers. Despite these challenges, the problem is made tractable by exploiting a variety of visual and contextual cues-appearance, time-stamps, camera pose and co-occurrence of people. This paper demonstrates an approach that integrates these cues to enable high quality person matching in community photo collections downloaded from Flickr.com.

BibTeX

@conference{Garg-2011-121214,
author = {Rahul Garg and Steven M. Seitz and Deva Ramanan and Noah Snavely},
title = {Where's Waldo: Matching People in Images of Crowds},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2011},
month = {June},
pages = {1793 - 1800},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.