MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Alignment for Vision-Language Foundation Model

NSH 3305

Abstract: Recent advancements in vision-language foundation models, exemplified by GPT4-Vision and DALL-E 3, have significantly transformed both research and practical applications, ranging from professional assistance to content creation. However, aligning them precisely with specific user goals presents a notable challenge. This thesis introduces innovative strategies for improving this alignment. I will first introduce our novel [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Efficient Sensor Coverage in Complex Environments

Abstract: This thesis develops sensor coverage algorithms for mobile robots that are scalable to large and complex environments. The core challenge is computing the shortest paths that can direct one or more robots to sweep onboard sensors over all accessible surfaces within an environment. This problem resembles the watchman route problem that is known to [...]

MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Improving Kalman Filter-based Multi-Object Tracking in Occlusion and Non-linear Motion

Newell-Simon Hall 4305

Abstract: Modern methods solve multi-object tracking from two perspectives: motion modeling and appearance matching. As a classic paradigm, motion-based tracking by Kalman filters suffers from complicated motion patterns and the problem becomes more difficult when we only have noisy bounding boxes. To improve Kalman filter-based multi-object tracking in scenarios with complex motion, occlusion, and crossover, [...]

MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Improving Kalman Filter-based Multi-Object Tracking in Occlusion and Non-linear Motion

NSH 4305

Abstract: Modern methods solve multi-object tracking from two perspectives: motion modeling and appearance matching. As a classic paradigm, motion-based tracking by Kalman filters suffers from complicated motion patterns and the problem becomes more difficult when we only have noisy bounding boxes. To improve Kalman filter-based multi-object tracking in scenarios with complex motion, occlusion, and crossover, [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Design Iteration of Dexterous Compliant Robotic Manipulators

GHC 6501

Abstract: The goal of personal robotics is to have robots in homes performing everyday tasks efficiently to improve our quality of life. Towards this end, manipulators are needed which are low cost, safe around humans, and approach human-level dexterity. However, existing off-the-shelf manipulators are expensive both in cost and manufacturing time, difficult to repair, and [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Continual Learning of Compositional Skills for Robust Robot Manipulation

GHC 6501

Abstract: Real world robots need to continuously learn new manipulation tasks in a lifelong learning manner. These new tasks often share many sub-structures e.g. sub-tasks, controllers, preconditions, with previously learned tasks. To utilize these shared sub-structures, we explore a compositional and object-centric approach to learn manipulation tasks. The first part of this thesis focuses on [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Watch, Practice, Improve: Towards In-the-wild Manipulation

GHC 4405

Abstract: The longstanding dream of many roboticists is to see robots perform diverse tasks in diverse environments. To build such a robot that can operate anywhere, many methods train on robotic interaction data. While these approaches have led to significant advances, they rely on heavily engineered setups or high amounts of supervision, neither of which [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Combining Physics-Based Light Transport and Neural Fields for Robust Inverse Rendering

NSH 3305

Abstract:   Inverse rendering — the process of recovering shape, material, and/or lighting of an object or environment from a set of images — is essential for applications in robotics and elsewhere, from AR/VR to perception on self-driving vehicles. While it is possible to perform inverse rendering from color images alone, it is often far easier [...]

PhD Thesis Defense
Extern
Robotics Institute,
Carnegie Mellon University

Improving the Transparency of Agent Decision Making to Humans Using Demonstrations

GHC 4405

Abstract: For intelligent agents (e.g. robots) to be seamlessly integrated into human society, humans must be able to understand their decision making. For example, the decision making of autonomous cars must be clear to the engineers certifying their safety, passengers riding them, and nearby drivers negotiating the road simultaneously. As an agent's decision making depends [...]

PhD Thesis Proposal
PhD Student
Robotics Institute,
Carnegie Mellon University

Robotic Climbing for Extreme Terrain Exploration

NSH 3305

Abstract: Climbing robots can operate in steep and unstructured environments that are inaccessible to other ground robots, with applications ranging from the inspection of artificial structures on Earth to the exploration of natural terrain features throughout the solar system. Climbing robots for planetary exploration face many challenges to deployment, including mass restrictions, irregular surface features, [...]