Hierarchical Reinforcement Learning Method for Autonomous Vehicle Behavior Planning - Robotics Institute Carnegie Mellon University

Hierarchical Reinforcement Learning Method for Autonomous Vehicle Behavior Planning

Zhiqian Qiao, Zachariah Tyree, Priyantha Mudalige, Jeff Schneider, and John M. Dolan
Conference Paper, Proceedings of (IROS) IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 6084 - 6089, October, 2020

Abstract

Behavioral decision making is an important aspect of autonomous vehicles (AV). In this work, we propose a behavior planning structure based on hierarchical reinforcement learning (HRL) which is capable of performing autonomous vehicle planning tasks in simulated environments with multiple sub-goals. In this hierarchical structure, the network is capable of 1) learning one task with multiple sub-goals simultaneously; 2) extracting attentions of states according to changing subgoals during the learning process; 3) reusing the well-trained network of sub-goals for other tasks with the same sub-goals. A hybrid reward mechanism is designed for different hierarchical layers in the proposed HRL structure. Compared to traditional RL methods, our algorithm is more sample-efficient, since its modular design allows reusing the policies of sub-goals across similar tasks for various transportation scenarios. The results show that the proposed method converges to an optimal policy faster than traditional RL methods.

BibTeX

@conference{Qiao-2020-129557,
author = {Zhiqian Qiao and Zachariah Tyree and Priyantha Mudalige and Jeff Schneider and John M. Dolan},
title = {Hierarchical Reinforcement Learning Method for Autonomous Vehicle Behavior Planning},
booktitle = {Proceedings of (IROS) IEEE/RSJ International Conference on Intelligent Robots and Systems},
year = {2020},
month = {October},
pages = {6084 - 6089},
keywords = {autonomous driving, hierarchical reinforcement learning, behavior planning, intersections},
}