Learning Robot Skills with Temporal Variational Inference

Conference Paper, Proceedings of (ICML) International Conference on Machine Learning, pp. 8624 - 8633, July, 2020

Abstract

In this paper, we address the discovery of robotic options from demonstrations in an unsupervised manner. Specifically, we present a framework to jointly learn low-level control policies and higher-level policies of how to use them from demonstrations of a robot performing various tasks. By representing options as continuous latent variables, we frame the problem of learning these options as latent variable inference. We then present a temporally causal variant of variational inference based on a temporal factorization of trajectory likelihoods, that allows us to infer options in an unsupervised manner. We demonstrate the ability of our framework to learn such options across three robotic demonstration datasets, and provide our code.

BibTeX

@conference{Shankar-2020-126754,
author = {Tanmay Shankar and Abhinav Gupta},
title = {Learning Robot Skills with Temporal Variational Inference},
booktitle = {Proceedings of (ICML) International Conference on Machine Learning},
year = {2020},
month = {July},
pages = {8624 - 8633},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.