3D Human Motion Estimation via Motion Compression and Refinement - Robotics Institute Carnegie Mellon University

3D Human Motion Estimation via Motion Compression and Refinement

Zhengyi Luo, S. Alireza Golestaneh, and Kris M. Kitani
Conference Paper, Proceedings of Asian Conference on Computer Vision (ACCV '20), pp. 324 - 340, November, 2020

Abstract

We develop a technique for generating smooth and accurate 3D human pose and motion estimates from RGB video sequences. Our technique, which we call Motion Estimation via Variational Autoencoder (MEVA), decomposes a temporal sequence of human motion into a smooth motion representation using auto-encoder-based motion compression and a residual representation learned through motion refinement. This two-step encoding process of human motion can represent a wide variety of general human motions while also retaining person-specific motion details. Experiments show that our method produces both smooth and accurate 3D human pose and motion estimates.

BibTeX

@conference{Luo-2020-126385,
author = {Zhengyi Luo and S. Alireza Golestaneh and Kris M. Kitani},
title = {3D Human Motion Estimation via Motion Compression and Refinement},
booktitle = {Proceedings of Asian Conference on Computer Vision (ACCV '20)},
year = {2020},
month = {November},
pages = {324 - 340},
}