Loading view.
Seminar
Building Scalable Visual Intelligence: From Represention to Understanding and Generation
3305 Newell-Simon HallAbstract: In this talk, we will dive into our recent work on vision-centric generative AI, focusing on how it helps with understanding and creating visual content like images and videos. We'll cover the latest advances, including multimodal large language models for visual understanding and diffusion transformers for visual generation. We'll explore how these two areas [...]