Emergent Hierarchical Control Structures: Learning Reactive / Hierarchical Relationships in Reinforcement Environments - Robotics Institute Carnegie Mellon University

Emergent Hierarchical Control Structures: Learning Reactive / Hierarchical Relationships in Reinforcement Environments

Bruce Digney
Conference Paper, Proceedings of 4th International Conference on Simulation of Adaptive Behavior: From Animals to Animats (SAB '96), pp. 363 - 372, September, 1996

Abstract

The use of externally imposed hierarchical structures to reduce the complexity of learning control is common. However, it is acknowledged that learning the hierarchical structure itself is an important step towards more general (learning of many things as required) and less bounded (learning of a single thing as specified) learning. Presented in this paper is a reinforcement learning algorithm called Nested Q-learning that generates a hierarchical control structure in reinforcement learning domains. The emergent structure combined with learned bottom-up reactive reactions results in a reactive hierarchical control system. Effectively, the learned hierarchy decomposes what would otherwise be a monolithic evaluation function into many smaller evaluation functions that can be recombined without the loss of previously learned information.

BibTeX

@conference{Digney-1996-16335,
author = {Bruce Digney},
title = {Emergent Hierarchical Control Structures: Learning Reactive / Hierarchical Relationships in Reinforcement Environments},
booktitle = {Proceedings of 4th International Conference on Simulation of Adaptive Behavior: From Animals to Animats (SAB '96)},
year = {1996},
month = {September},
pages = {363 - 372},
}