Human Behavior Modeling with Maximum Entropy Inverse Optimal Control
Abstract
In our research, we view human behavior as a structured sequence of context-sensitive decisions. We develop a conditional probabilistic model for predicting human decisions given the contextual situation. Our approach employs the principle of maximum entropy within the Markov Decision Process framework. Modeling human behavior is reduced to recovering a context-sensitive utility function that explains demonstrated behavior within the probabilistic model. In this work, we review the development of our probabilistic model (Ziebart et al. 2008a) and the results of its application to modeling the context-sensitive route preferences of drivers (Ziebart et al. 2008b). We additionally expand the approach's applicability to domains with stochastic dynamics, present preliminary experiments on modeling time-usage, and discuss remaining challenges for applying our approach to other human behavior modeling problems.
BibTeX
@conference{Ziebart-2009-9932,author = {Brian D. Ziebart and Andrew L. Maas and J. Andrew (Drew) Bagnell and Anind Dey},
title = {Human Behavior Modeling with Maximum Entropy Inverse Optimal Control},
booktitle = {Proceedings of AAAI '09 Spring Symposium on Human Behavior Modeling},
year = {2009},
month = {March},
pages = {92 - 97},
}