Learning Adaptive Sampling Distributions for Motion Planning by Self-Imitation

Ratnesh Madaan, Sam Zeng, Brian Okorn, and Sebastian Scherer

Workshop Paper, IROS '18 Workshop on Machine Learning in Robot Motion Planning, October, 2018

Abstract

Sampling based motion planning algorithms are widely used due to their effectiveness on problems with large state spaces by incremental tree growth in conjunction with uniform, random sampling. The major bottleneck in the performance of such algorithms is the amount of collision checks performed, which in turns depends on the sampling distribution itself. In this work, we present a framework to learn an adaptive, non-stationary sampling distribution which explicitly minimizes the search effort, given by the amount of collision checks performed. Our framework models the sequential nature of the problem by leveraging both the instantaneous search tree over the robot configuration space, as well as the workspace environment, by encoding them with a conditional variational auto-encoder, to learn a stochastic sampling policy. We encode the workspace environment with a convolutional network, and the configuration space planning tree with a recurrent neural network. We introduce an approximate oracle which can return multiple label samples for a partially solved planning problem, by forward simulating it. We use an imitation via iterative supervised learning framework to learn a stochastic sampling policy. We call this self-supervised imitation of an oracle generated by forward simulation as self-imitation. We validate our approach on a 4D kinodynamic helicopter planning problem with glideslope and curvature constraints, and a 2D holonomic problem.

BibTeX

@workshop{Madaan-2018-126737,
author = {Ratnesh Madaan and Sam Zeng and Brian Okorn and Sebastian Scherer},
title = {Learning Adaptive Sampling Distributions for Motion Planning by Self-Imitation},
booktitle = {Proceedings of IROS '18 Workshop on Machine Learning in Robot Motion Planning},
year = {2018},
month = {October},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.