Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark

Akshat Agarwal, Ryan Hope, and Katia Sycara

Workshop Paper, AAAI '19 Reinforcement Learning in Games Workshop, January, 2019

View Publication

Abstract

Research in deep reinforcement learning (RL) has coalesced around improving performance on benchmarks like the Arcade Learning Environment. However, these benchmarks do not emphasize two important characteristics that are often present in real-world domains: requirement of changing strategy conditioned on latent contexts, and temporal sensitivity. As a result, research in RL has not given these challenges their due, resulting in algorithms which do not understand critical changes in context, and have little notion of real world time. This paper introduces the game of Space Fortress as a RL benchmark which specifically targets these characteristics. We show that existing state-of-the-art RL algorithms are unable to learn to play the Space Fortress game, and then confirm that this poor performance is due to the RL algorithms' context insensitivity. We also identify independent axes along which to vary context and temporal sensitivity, allowing Space Fortress to be used as a testbed for understanding both characteristics in combination and also in isolation. We release Space Fortress as an open-source Gym environment.

Notes
Code available at https://github.com/agakshat/spacefortress

BibTeX

@workshop{Agarwal-2019-112116,
author = {Akshat Agarwal and Ryan Hope and Katia Sycara},
title = {Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark},
booktitle = {Proceedings of AAAI '19 Reinforcement Learning in Games Workshop},
year = {2019},
month = {January},
keywords = {Reinforcement Learning; Benchmark; Context},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.