Hierarchical Reinforcement Learning Method for Autonomous Vehicle Behavior Planning
Abstract
Behavioral decision making is an important aspect of autonomous vehicles (AV). In this work, we propose a behavior planning structure based on hierarchical reinforcement learning (HRL) which is capable of performing autonomous vehicle planning tasks in simulated environments with multiple sub-goals. In this hierarchical structure, the network is capable of 1) learning one task with multiple sub-goals simultaneously; 2) extracting attentions of states according to changing subgoals during the learning process; 3) reusing the well-trained network of sub-goals for other tasks with the same sub-goals. A hybrid reward mechanism is designed for different hierarchical layers in the proposed HRL structure. Compared to traditional RL methods, our algorithm is more sample-efficient, since its modular design allows reusing the policies of sub-goals across similar tasks for various transportation scenarios. The results show that the proposed method converges to an optimal policy faster than traditional RL methods.
BibTeX
@conference{Qiao-2020-129557,author = {Zhiqian Qiao and Zachariah Tyree and Priyantha Mudalige and Jeff Schneider and John M. Dolan},
title = {Hierarchical Reinforcement Learning Method for Autonomous Vehicle Behavior Planning},
booktitle = {Proceedings of (IROS) IEEE/RSJ International Conference on Intelligent Robots and Systems},
year = {2020},
month = {October},
pages = {6084 - 6089},
keywords = {autonomous driving, hierarchical reinforcement learning, behavior planning, intersections},
}