PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Multimodal Representations for Adaptable Robot Policies in Human-Inhabited Spaces

NSH 4305

Abstract:  Human beings sense and express themselves through multiple modalities. To capture multimodal ways of human communication, I want to build adaptable robot policies that infer task pragmatics from video and language prompts, reason about sounds and other sensors, take actions, and learn mannerisms of interacting with people and objects. Existing solutions for robot policies [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Sensorized Soft Material Systems with Integrated Electronics and Computing

NSH 1305

Abstract: The integration of soft and multifunctional materials in emerging technologies is becoming more widespread due to their ability to enhance or improve functionality in ways not possible using typical rigid alternatives. This trend is evident in various fields. For example, wearable technologies are increasingly designed using soft materials to improve modulus compatibility with biological [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Towards Underwater 3D Visual Perception

Abstract: With modern robotic technologies, seafloor imageries have become more accessible to both researchers and the public. This thesis leverages deep learning and 3D vision techniques to deliver valuable information from seafloor image observations. Despite the widespread use of deep learning and 3D vision algorithms across various fields, underwater imaging presents unique challenges, such as [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Teaching Robots to Drive: Scalable Policy Improvement via Human Feedback

NSH 3305

Abstract: A long-standing problem in autonomous driving is grappling with the long-tail of rare scenarios for which little or no data is available. Although learning-based methods scale with data, it is unclear that simply ramping up data collection will eventually make this problem go away. Approaches which rely on simulation or world modeling offer some [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Understanding and acting in the 4D world

NSH 4305

Abstract: As humans, we are constantly interacting with and observing a three-dimensional dynamic world; where objects around us change state as they move or are moved, and we, ourselves, move for navigation and exploration. Such an interaction between a dynamic environment and a dynamic ego-agent is complex to model as an ego-agent's perception of the [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Learning Universal Humanoid Control

GHC 4405

Abstract: Since infancy, humans acquire motor skills, behavioral priors, and objectives by learning from their caregivers. Similarly, as we create humanoids in our own image, we aspire for them to learn from us and develop universal physical and cognitive capabilities that are comparable to, or even surpass, our own. In this thesis, we explore how [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Generative Robotics: Self-Supervised Learning for Human-Robot Collaborative Creation

NSH 4305

Abstract: While Generative AI has shown breakthroughs in recent years in generating new digital contents such as images or 3D models from high-level goal inputs like text, Robotics technologies have not, instead focusing on low-level goal inputs. We propose Generative Robotics, as a new field of robotics which combines the high-level goal input abilities of [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

3D Video Models through Point Tracking, Reconstructing and Forecasting

NSH 3305

Abstract: 3D scene understanding from 2D video is essential for enabling advanced applications such as autonomous driving, robotics, virtual reality, and augmented reality. These fields rely on accurate 3D spatial awareness and dynamic interaction modeling to navigate complex environments, manipulate objects, and provide immersive experiences. Unlike 2D, 3D training data is much less abundant, which [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Towards a Robot Generalist through In-Context Learning and Abstractions

NSH 1305

Abstract: The goal of this thesis is to discover AI processes that enhance cross-domain and cross-task generalization in intelligent robot agents. Unlike the dominant approach in contemporary robot learning, which pursues generalization primarily through scaling laws (increasing data and model size), we focus on identifying the best abstractions and representations in both perception and policy [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Vision-based Human Motion Modeling and Analysis

NSH 4305

Abstract: Modern computer vision has achieved remarkable success in tasks such as detecting, segmenting, and estimating the pose of humans in images and videos, reaching or even surpassing human-level performance. However, they still face significant challenges in predicting and analyzing future human motion. This thesis explores how vision-based solutions can enhance the fidelity and accuracy [...]