PhD Thesis Defense
Postdoctoral Fellow
Robotics Institute,
Carnegie Mellon University

Communication-Efficient Active Reconstruction using Self-Organizing Gaussian Mixture Models

GHC 4405

Abstract: For the multi-robot active reconstruction task, this thesis proposes using Gaussian mixture models (GMMs) as the map representation that enables multiple downstream tasks: high-fidelity static scene reconstruction, communication-efficient map sharing, and safe informative planning. A new method called Self-Organizing Gaussian mixture modeling (SOGMM) is proposed that estimates the model complexity (i.e., number of Gaussian [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Vision-Language Models for Hand-Object Interaction Prediction

Rashid Auditorium - 4401 Gates and Hillman Centers

Abstract: How can we predict future interaction trajectories of human hands in a scene given high-level colloquial task specifications in the form of natural language? In this paper, we extend the classic hand trajectory prediction task to two tasks involving explicit or implicit language queries. Our proposed tasks require extensive understanding of human daily activities [...]