Task-Oriented Active Learning of Model Preconditions for Inaccurate Dynamics Models

Alex LaGrassa, Moonyoung Lee, and Oliver Kroemer

Conference Paper, Proceedings of (ICRA) International Conference on Robotics and Automation, May, 2024

View Publication

Abstract

When planning with an inaccurate dynamics model, a practical strategy is to restrict planning to regions of state-action space where the model is accurate: also known as a model precondition. Empirical real-world trajectory data is valuable for defining data-driven model preconditions regardless of the model form (analytical, simulator, learned, etc...). However, real-world data is often expensive and dangerous to collect. In order to achieve data efficiency, this paper presents an algorithm for actively selecting trajectories to learn a model precondition for an inaccurate pre-specified dynamics model. Our proposed techniques address challenges arising from the sequential nature of trajectories, and potential benefit of prioritizing task-relevant data. The experimental analysis shows how algorithmic properties affect performance in three planning scenarios: icy gridworld, simulated plant watering, and real-world plant watering. Results demonstrate an improvement of approximately 80% after only four real-world trajectories when using our proposed techniques. More material can be found on our project website: url{https://sites.google.com/view/active-mde}

BibTeX

@conference{LaGrassa-2024-140282,
author = {Alex LaGrassa and Moonyoung Lee and Oliver Kroemer},
title = {Task-Oriented Active Learning of Model Preconditions for Inaccurate Dynamics Models},
booktitle = {Proceedings of (ICRA) International Conference on Robotics and Automation},
year = {2024},
month = {May},
keywords = {planning, active learning, model preconditions},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.