A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video - Robotics Institute Carnegie Mellon University

A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video

Sangmin Oh, Anthony Hoogs, Amitha Perera, Naresh Cuntoor, Chia-Chih Chen, Jong Taek Lee, Saurajit Mukherjee, J. K. Aggarwal, Hyungtae Lee, Larry Davis, Eran Swears, Xioyang Wang, Qiang Ji, Kishore Reddy, Mubarak Shah, Carl Vondrick, Hamed Pirsiavash, Deva Ramanan, Jenny Yuen, Antonio Torralba, Bi Song, Anesco Fong, Amit Roy-Chowdhury, and Mita Desai
Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 3153 - 3160, June, 2011

Abstract

We introduce a new large-scale video dataset designed to assess the performance of diverse visual event recognition algorithms with a focus on continuous visual event recognition (CVER) in outdoor areas with wide coverage. Previous datasets for action recognition are unrealistic for real-world surveillance because they consist of short clips showing one action by one individual [15, 8]. Datasets have been developed for movies [11] and sports [12], but, these actions and scene conditions do not apply effectively to surveillance videos. Our dataset consists of many outdoor scenes with actions occurring naturally by non-actors in continuously captured videos of the real world. The dataset includes large numbers of instances for 23 event types distributed throughout 29 hours of video. This data is accompanied by detailed annotations which include both moving object tracks and event examples, which will provide solid basis for large-scale evaluation. Additionally, we propose different types of evaluation modes for visual recognition tasks and evaluation metrics along with our preliminary experimental results. We believe that this dataset will stimulate diverse aspects of computer vision research and help us to advance the CVER tasks in the years ahead.

BibTeX

@conference{Oh-2011-121213,
author = {Sangmin Oh and Anthony Hoogs and Amitha Perera and Naresh Cuntoor and Chia-Chih Chen and Jong Taek Lee and Saurajit Mukherjee and J. K. Aggarwal and Hyungtae Lee and Larry Davis and Eran Swears and Xioyang Wang and Qiang Ji and Kishore Reddy and Mubarak Shah and Carl Vondrick and Hamed Pirsiavash and Deva Ramanan and Jenny Yuen and Antonio Torralba and Bi Song and Anesco Fong and Amit Roy-Chowdhury and Mita Desai},
title = {A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2011},
month = {June},
pages = {3153 - 3160},
}