Fast Reinforcement Learning of Dialog Strategies

David Goddeau and Joelle Pineau

Conference Paper, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '00), Vol. 2, pp. 1233 - 1236, June, 2000

View Publication

Abstract

Dialog management is a critical component of an effective spoken language application. It is also one of the most difficult and time consuming to engineer. This paper examines the application of reinforcement learning and Markov Decision Processes (MDP's) to the problem of learning the dialog strategies. It extends work done at AT&T [1] [2] in two directions. First it examines the ability of RL to learn optimal strategies in the presence of speech recognition errors. Second, it describes a technique for reducing the amount of data required to train these models. This is significant as the difficulty of training MDP-based dialog managers is a serious roadblock to deploying them in realistic applications.

BibTeX

@conference{Goddeau-2000-8049,
author = {David Goddeau and Joelle Pineau},
title = {Fast Reinforcement Learning of Dialog Strategies},
booktitle = {Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '00)},
year = {2000},
month = {June},
volume = {2},
pages = {1233 - 1236},
keywords = {reinforcement learning, dialog models, dialogue},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.