Recycle-GAN: Unsupervised Video Retargeting

A. Bansal, S. Ma, D. Ramanan, and Y. Sheikh

Conference Paper, Proceedings of (ECCV) European Conference on Computer Vision, pp. 119 - 135, September, 2018

Abstract

We introduce a data-driven approach for unsupervised video retargeting that translates content from one domain to another while preserving the style native to a domain, i.e., if contents of John Oliver's speech were to be transferred to Stephen Colbert, then the generated content/speech should be in Stephen Colbert's style. Our approach combines both spatial and temporal information along with adversarial losses for content translation and style preservation. In this work, we first study the advantages of using spatiotemporal constraints over spatial constraints for effective retargeting. We then demonstrate the proposed approach for the problems where information in both space and time matters such as face-to-face translation, flower-to-flower, wind and cloud synthesis, sunrise and sunset.

BibTeX

@conference{Bansal-2018-121141,
author = {A. Bansal and S. Ma and D. Ramanan and Y. Sheikh},
title = {Recycle-GAN: Unsupervised Video Retargeting},
booktitle = {Proceedings of (ECCV) European Conference on Computer Vision},
year = {2018},
month = {September},
pages = {119 - 135},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.