Enhanced Visual Scene Understanding through Human-Robot Dialog - Robotics Institute Carnegie Mellon University

Enhanced Visual Scene Understanding through Human-Robot Dialog

M. Johnson-Roberson, Jeannette Bohg, Gabriel Skantze, Joakim Gustafson, Rolf Carlson, Babak Rasolzadeh, and Danica Kragic
Conference Paper, Proceedings of (IROS) IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3342 - 3348, September, 2011

Abstract

We propose a novel human-robot-interaction framework for robust visual scene understanding. Without any a-priori knowledge about the objects, the task of the robot is to correctly enumerate how many of them are in the scene and segment them from the background. Our approach builds on top of state-of-the-art computer vision methods, generating object hypotheses through segmentation. This process is combined with a natural dialog system, thus including a `human in the loop' where, by exploiting the natural conversation of an advanced dialog system, the robot gains knowledge about ambiguous situations. We present an entropy-based system allowing the robot to detect the poorest object hypotheses and query the user for arbitration. Based on the information obtained from the human-robot dialog, the scene segmentation can be re-seeded and thereby improved. We present experimental results on real data that show an improved segmentation performance compared to segmentation without interaction.

BibTeX

@conference{Johnson-Roberson-2011-130250,
author = {M. Johnson-Roberson and Jeannette Bohg and Gabriel Skantze and Joakim Gustafson and Rolf Carlson and Babak Rasolzadeh and Danica Kragic},
title = {Enhanced Visual Scene Understanding through Human-Robot Dialog},
booktitle = {Proceedings of (IROS) IEEE/RSJ International Conference on Intelligent Robots and Systems},
year = {2011},
month = {September},
pages = {3342 - 3348},
}