Text to Visual Synthesis with Appearance Models

J. Melenchon, F. De la Torre, I. Iriondo, F. Alıas, E. Martınez, and L. Vicent

Conference Paper, Proceedings of International Conference on Image Processing (ICIP '03), pp. 237 - 240, September, 2003

Abstract

This paper presents a new method named text to visual synthesis with appearance models (TEVISAM) for generating videorealistic talking heads. In a first step, the system learns a person-specific facial appearance model (PSFAM) automatically. PSFAM allows modeling all facial components (e.g. eyes, mouth, etc) independently and it will be used to animate the face from the input text dynamically. As reported by other researches, one of the key aspects in visual synthesis is the coarticulation effect. To solve such a problem, we introduce a new interpolation method in the high dimensional space of appearance allowing to create photorealistic and videorealistic avatars. In this work, preliminary experiments synthesizing virtual avatars from text are reported. Summarizing, in this paper we introduce three novelties: first, we make use of color PSFAM to animate virtual avatars; second, we introduce a nonlinear high dimensional interpolation to achieve videorealistic animations; finally, this method allows to generate new expressions modeling the different facial elements.

BibTeX

@conference{Melenchon-2003-120960,
author = {J. Melenchon and F. De la Torre and I. Iriondo and F. Alıas and E. Martınez and L. Vicent},
title = {Text to Visual Synthesis with Appearance Models},
booktitle = {Proceedings of International Conference on Image Processing (ICIP '03)},
year = {2003},
month = {September},
pages = {237 - 240},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.