AutoCaption: Automatic Caption Generation for Personal Photos - Robotics Institute Carnegie Mellon University

AutoCaption: Automatic Caption Generation for Personal Photos

Krishnan Ramnath, Simon Baker, Lucy Vanderwende, Motaz El-saban, Sudipta Sinha, Anitha Kannan, Noran Hassan, Michel Galley, Yi Yang, Deva Ramanan, Ro Bergamo, and Lorrenzo Torresani
Conference Paper, Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV '14), pp. 1050 - 1057, March, 2014

Abstract

AutoCaption is a system that helps a smartphone user generate a caption for their photos. It operates by uploading the photo to a cloud service where a number of parallel modules are applied to recognize a variety of entities and relations. The outputs of the modules are combined to generate a large set of candidate captions, which are returned to the phone. The phone client includes a convenient user interface that allows users to select their favorite caption, reorder, add, or delete words to obtain the grammatical style they prefer. The user can also select from multiple candidates returned by the recognition modules.

BibTeX

@conference{Ramnath-2014-121195,
author = {Krishnan Ramnath and Simon Baker and Lucy Vanderwende and Motaz El-saban and Sudipta Sinha and Anitha Kannan and Noran Hassan and Michel Galley and Yi Yang and Deva Ramanan and Ro Bergamo and Lorrenzo Torresani},
title = {AutoCaption: Automatic Caption Generation for Personal Photos},
booktitle = {Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV '14)},
year = {2014},
month = {March},
pages = {1050 - 1057},
}