Learning to navigate in unseen cluttered structured environments

Vidhi Jain ad Ganesh Iyer and Katia Sycara

Workshop Paper, NeurIPS '20 WiML Virtual Workshop, December, 2020

View Publication

Abstract

Learning end-to-end policy for navigation with a focus on intelligent exploration has been found to be a challenging task in embodied AI. While methods like soft-Q learning and ensembles of policies have demonstrated navigation behaviors in completely observed maps, we currently do not have ways of extending these policies to unexplored or partially explored environments. To this end, we propose a modular hierarchical formulation by decomposing the navigation task into two sub-problems: selecting the next best goal in the visible space, followed by efficiently navigating to this space in the partial map setting. When navigating in unknown environments, humans often invoke a decision criteria. This could be the next sub-goal, and the decision could either be based on some semantic context, or structural priors. Learning how to select the next sub-goal for higher level task such as coverage of the map
is a challenging open question. We consider frontiers, which are the points at the boundary of known regions of a map, as points to facilitate exploration in a map. Therefore, we define such frontiers as sub-goals, and learn a policy that can select such frontiers on partial grid maps.

BibTeX

@workshop{Jain-2020-129121,
author = {Vidhi Jain ad Ganesh Iyer and Katia Sycara},
title = {Learning to navigate in unseen cluttered structured environments},
booktitle = {Proceedings of NeurIPS '20 WiML Virtual Workshop},
year = {2020},
month = {December},
keywords = {Reinforcement learning, robotics},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.