Learning-Based Planning for Improving Science Return of Earth Observation Satellites
Abstract
Earth observing satellites are powerful tools for collecting scientific information about our planet, however they have limitations: they cannot easily deviate from their orbital trajectories, their sensors have a limited field of view, and pointing and operating these sensors can take a large amount of the spacecraft’s resources. It is important for these satellites to optimize the data they collect and include only the most important or informative measurements. Dynamic targeting is an emerging concept in which satellite resources and data from a lookahead instrument are used to intelligently reconfigure and point a primary instrument. Simulation studies have shown that dynamic targeting increases the amount of scientific information gathered versus conventional sampling strategies. In this work, we present two different learning-based approaches to dynamic targeting, using reinforcement and imitation learning, respectively. These learning methods build on a dynamic programming solution to plan a sequence of sampling locations. We evaluate our approaches against existing heuristic methods for dynamic targeting, showing the benefits of using learning for this application. Imitation learning performs on average 10.0% better than the best heuristic method, while reinforcement learning performs on average 13.7% better. We also show that both learning methods can be trained effectively with small amounts of data.
BibTeX
@conference{Breitfeld-2024-144894,author = {Abigail Breitfeld and Alberto Candela and Juan Delfa and Akseli Kangaslahti and Itai Zilberstein and Steve Chien and David Wettergreen},
title = {Learning-Based Planning for Improving Science Return of Earth Observation Satellites},
booktitle = {Proceedings of International Symposium on Artificial Intelligence, Robotics and Automation in Space},
year = {2024},
month = {November},
}