VASC Seminar
Yong Jae Lee
Associate Professor
Department of Computer Sciences , University of Wisconsin-Madison

Large Multimodal (Vision-Language) Models for Image Generation and Understanding

Newell-Simon Hall 3305

Abstract: Large Language Models and Large Vision Models, also known as Foundation Models, have led to unprecedented advances in language understanding, visual understanding, and AI. In particular, many computer vision problems including image classification, object detection, and image generation have benefited from the capabilities of such models trained on internet-scale text and visual data. In [...]

RI Seminar
Fei Miao
Associate Professor
Department of Computer Science & Engineering, University of Connecticut

Learning and Control for Safety, Efficiency, and Resiliency of Embodied AI

1305 Newell Simon Hall

Abstract: The rapid evolution of ubiquitous sensing, communication, and computation technologies has revolutionized of cyber-physical systems (CPS) across virous domains like robotics, smart grids, aerospace, and smart cities. Integrating learning into dynamic systems control presents significant Embodied AI opportunities. However, current decision-making frameworks lack comprehensive understanding of the tridirectional relationship among communication, learning and control, [...]

VASC Seminar
Mohamed Elhoseiny
Assistant Professor
Computer Science, KAUST

Imaginative Vision Language Models: Towards human-level imaginative AI skills transforming species discovery, content creation, self-driving cars, and emotional health

3305 Newell-Simon Hall

Abstract:   Most existing AI learning methods can be categorized into supervised, semi-supervised, and unsupervised methods. These approaches rely on defining empirical risks or losses on the provided labeled and/or unlabeled data. Beyond extracting learning signals from labeled/unlabeled training data, we will reflect in this talk on a class of methods that can learn beyond the vocabulary [...]

VASC Seminar
Kenneth Marino
Research Scientist
Google DeepMind

World Knowledge in the Time of Large Models

Newell-Simon Hall 3305

Abstract:  This talk will discuss the massive shift that has come about in the vision and ML community as a result of the large pre-trained language and language and vision models such as Flamingo, GPT-4, and other models. We begin by looking at the work on knowledge-based systems in CV and robotics before the large model [...]

RI Seminar
Marc Deisenroth
DeepMind Chair of Machine Learning and Artificial Intelligence
University College London

Data-Efficient Learning for Robotics and Reinforcement Learning

1305 Newell Simon Hall

Abstract: Data efficiency, i.e., learning from small datasets, is of practical importance in many real-world applications and decision-making systems. Data efficiency can be achieved in multiple ways, such as probabilistic modeling, where models and predictions are equipped with meaningful uncertainty estimates, transfer learning, or the incorporation of valuable prior knowledge. In this talk, I will [...]

VASC Seminar
Shunsuke Saito
Research Scientist
Meta Reality Labs Research

Digital Human Modeling with Light

Newell-Simon Hall 3305

Abstract: Leveraging light in various ways, we can observe and model physical phenomena or states which may not be possible to observe otherwise. In this talk, I will introduce our recent exploration on digital human modeling with different types of light. First, I will present our recent work on the modeling of relightable human heads, [...]

VASC Seminar
Jonathon Luiten
Postdoctoral Fellow
RWTH Aachen and Carnegie Mellon University

Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis

Newell-Simon Hall 3305

Abstract: We present a method that simultaneously addresses the tasks of dynamic scene novel-view synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We follow an analysis-by-synthesis framework, inspired by recent work that models scenes as a collection of 3D Gaussians which are optimized to reconstruct input images via differentiable rendering. To model [...]

RI Seminar
Dr. Robert Ambrose
J. Mike Walker '66 Chair Professor
Mechanical Engineering, Texas A&M University

Robots at the Johnson Space Center and Future Plans

1305 Newell Simon Hall

Abstract: The seminar will review a series of robotic systems built at the Johnson Space Center over the last 20 years. These will include wearable robots (exoskeletons, powered gloves and jetpacks), manipulation systems (ISS cranes down to human scale) and lunar mobility systems (human surface mobility and robotic rovers). As all robotics presentations should, this [...]

VASC Seminar
Arun Ross
Professor
Michigan State University

Biometrics in a Deep Learning World

Newell-Simon Hall 3305

Abstract: Biometrics is the science of recognizing individuals based on their physical and behavioral attributes such as fingerprints, face, iris, voice and gait. The past decade has witnessed tremendous progress in this field, including the deployment of biometric solutions in diverse applications such as border security, national ID cards, amusement parks, access control, and smartphones. [...]

VASC Seminar
Andrea Tagliasacchi
Associate Professor
Simon Fraser University

Neural World Models

Newell-Simon Hall 4305

Abstract: Computer vision researchers have pushed the limits of performance in perception tasks involving natural images to near saturation. With self-supervised inference driven by recent advancements in generative modeling, it can be debated that the era of large image models is coming to a close, ushering in an era focused on video. However, it's worth [...]