3D Human Motion Estimation via Motion Compression and Refinement
Conference Paper, Proceedings of Asian Conference on Computer Vision (ACCV '20), pp. 324 - 340, November, 2020
Abstract
We develop a technique for generating smooth and accurate 3D human pose and motion estimates from RGB video sequences. Our technique, which we call Motion Estimation via Variational Autoencoder (MEVA), decomposes a temporal sequence of human motion into a smooth motion representation using auto-encoder-based motion compression and a residual representation learned through motion refinement. This two-step encoding process of human motion can represent a wide variety of general human motions while also retaining person-specific motion details. Experiments show that our method produces both smooth and accurate 3D human pose and motion estimates.
BibTeX
@conference{Luo-2020-126385,author = {Zhengyi Luo and S. Alireza Golestaneh and Kris M. Kitani},
title = {3D Human Motion Estimation via Motion Compression and Refinement},
booktitle = {Proceedings of Asian Conference on Computer Vision (ACCV '20)},
year = {2020},
month = {November},
pages = {324 - 340},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.