VASC Seminar
Controllable Visual Imagination
Abstract: Generative models have empowered human creators to visualize their imaginations without artistic skills and labor. A prominent example is large-scale text-to-image generation models. However, these models often are difficult to control and do not respect 3D perspective geometry and temporal consistency of videos. In this talk, I will showcase several of our recent efforts to [...]
Discovering and Erasing Undesired Concepts
Abstract: The rapid growth of generative models allows an ever-increasing variety of capabilities. Yet, these models may also produce undesired content such as unsafe or misleading images, private information, or copyrighted material. In this talk, I will discuss practical methods to prevent undesired generations. First, I will show how the challenge of avoiding undesired generations [...]
The New Era of Video Generation
Abstract: Traditional video production is slow, expensive, and requires specialized skills. Founded by CMU alumni, HeyGen is an AI-native video platform designed to revolutionize the video creation process by making visual storytelling accessible to all. We've successfully grown to more than 20M users, and tens of millions revenue in less than one year, with recognition [...]
Autoregressive Models: Foundations and Open Questions
Abstract: The success of Autoregressive (AR) models in language today is so tremendous that their scope has, in turn, been largely narrowed to specific instantiations. In this talk, we will revisit the foundations of classical AR models, discussing essential concepts that may have been overlooked in modern practice. We will then introduce our recent research [...]