DynaMMo: Mining and Summarization of Coevolving Sequences with Missing Values - Robotics Institute Carnegie Mellon University

DynaMMo: Mining and Summarization of Coevolving Sequences with Missing Values

Junlei Li, James McCann, Nancy Pollard, and Christos Faloutsos
Conference Paper, Proceedings of 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09), pp. 507 - 516, June, 2009

Abstract

Given multiple time sequences with missing values, we propose DynaMMo which summarizes, compresses, and finds latent variables. The idea is to discover hidden variables and learn their dynamics, making our algorithm able to function even when there are missing values. We performed experiments on both real and synthetic datasets spanning several megabytes, including motion capture sequences and chlorine levels in drinking water. We show that our proposed DynaMMo method (a) can successfully learn the latent variables and their evolution; (b) can provide high compression for little loss of reconstruction accuracy; (c) can extract compact but powerful features for segmentation, interpretation, and forecasting; (d) has complexity linear on the duration of sequences.

BibTeX

@conference{Li-2009-10248,
author = {Junlei Li and James McCann and Nancy Pollard and Christos Faloutsos},
title = {DynaMMo: Mining and Summarization of Coevolving Sequences with Missing Values},
booktitle = {Proceedings of 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09)},
year = {2009},
month = {June},
pages = {507 - 516},
keywords = {Time Series; Missing Value; Bayesian Network; Expectation Maximization (EM)},
}