Barycentric Interpolator for Continuous Space and Time Reinforcement Learning - Robotics Institute Carnegie Mellon University

Barycentric Interpolator for Continuous Space and Time Reinforcement Learning

Remi Munos and Andrew Moore
Conference Paper, Proceedings of (NeurIPS) Neural Information Processing Systems, pp. 1024 - 1030, December, 1998

Abstract

In order to find the optimal control of continuous state-space and time reinforcement learning (RL) problems, we approximate the value function (VF) with a particular class of functions called the barycentric interpolators. We establish sufficient conditions under which a RL algorithm converges to the optimal VF, even when we use approximate models of the state dynamics and the reinforcement functions.

BibTeX

@conference{Munos-1998-14809,
author = {Remi Munos and Andrew Moore},
title = {Barycentric Interpolator for Continuous Space and Time Reinforcement Learning},
booktitle = {Proceedings of (NeurIPS) Neural Information Processing Systems},
year = {1998},
month = {December},
pages = {1024 - 1030},
publisher = {MIT Press},
keywords = {Reinforcement learning, optimal control, discretization methods},
}