SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency

Devendra Chaplot, Murtaza Dalal, Saurabh Gupta, Jitendra Malik, and Ruslan Salakhutdinov

Conference Paper, Proceedings of (NeurIPS) Neural Information Processing Systems, December, 2021

View Publication

Abstract

In this paper, we explore how we can build upon the data and models of Internet images and use them to adapt to robot vision without requiring any extra labels. We present a framework called Self-supervised Embodied Active Learning (SEAL). It utilizes perception models trained on internet images to learn an active exploration policy. The observations gathered by this exploration policy are labelled using 3D consistency and used to improve the perception model. We build and utilize 3D semantic maps to learn both action and perception in a completely self-supervised manner. The semantic map is used to compute an intrinsic motivation reward for training the exploration policy and for labelling the agent observations using spatio-temporal 3D consistency and label propagation. We demonstrate that the SEAL framework can be used to close the action-perception loop: it improves object detection and instance segmentation performance of a pretrained perception model by just moving around in training environments and the improved perception model can be used to improve Object Goal Navigation.

BibTeX

@conference{Chaplot and Dalal-2021-142754,
author = {Devendra Chaplot and Murtaza Dalal and Saurabh Gupta and Jitendra Malik and Ruslan Salakhutdinov},
title = {SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency},
booktitle = {Proceedings of (NeurIPS) Neural Information Processing Systems},
year = {2021},
month = {December},
keywords = {exploration, active perception, object goal navigation},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.