Name: Training Strategies for Time Series: Learning for Filtering and Reinforcement Learning
Start: 2016-09-21T09:00:00-04:00
End: 2016-09-21T00:00:00-04:00

Arun Venkatraman Carnegie Mellon University

Wednesday, September 21
9:00 am to 12:00 am

Training Strategies for Time Series: Learning for Filtering and Reinforcement Learning

Event Location: GHC 4405

Abstract: Data driven approaches to modeling time-series are important in a variety of applications from market prediction in economics to the simulation of robotic systems. However, traditional supervised machine learning techniques designed for i.i.d. data often perform poorly on these sequential problems. This thesis proposes that time series and sequential prediction, whether for forecasting, filtering, or reinforcement learning, can be effectively achieved by directly training recurrent prediction procedures rather then building generative probabilistic models.

To this end, we introduce a new training algorithm for learned time-series models, Data as Demonstrator (DaD), that theoretically and empirically improves multi-step prediction performance on model classes such as recurrent neural networks, kernel regressors, and random forests. Additionally, experimental results indicate that DaD can accelerate model-based reinforcement learning. We next show that latent-state time-series models, where a sufficient state parametrization may be unknown, can be learned effectively in a supervised way. Our approach, Predictive State Inference Machines (PSIMs), directly optimizes – through a DaD-style training procedure – the inference performance without local optima by identifying the recurrent hidden state as a predictive belief over statistics of future observations. Fundamental to our learning framework is that the prediction of observable quantities is a lingua franca for building AI systems. We propose three extensions that leverage this general idea and adapt it to a variety of problems. The first aims to improve the training time and performance of more sophisticated recurrent neural networks. The second extends the PSIM framework to controlled dynamical systems. The third looks to train recurrent architectures for reinforcement learning problems.

Committee:J. Andrew Bagnell, Co-chair

Martial Hebert, Co-chair

Jeff Schneider

Byron Boots, Georgia Institute of Technology

PhD Thesis Proposal

September

Event Navigation

PhD Thesis Proposal

September

Share This Event!

Event Navigation