MSR Thesis Defense
MSR Student / Teaching Assistant
Robotics Institute,
Carnegie Mellon University

Geometric Heuristics Enhance POCUS AI for Pneumothorax

GHC 4405

Abstract: The interpretation of Point-of-care ultrasound (POCUS) images poses a challenge due to the scarcity of high-quality labelled data for training AI models in the medical domain. To address this limitation, novel methodologies were developed to train POCUS AI models using limited data, integrating geometric heuristics derived from expert clinicians. Focused on diagnosing pneumothorax, heuristics [...]

MSR Thesis Defense
MSR Student / Intern
Robotics Institute,
Carnegie Mellon University

Optimal Control and Robot Learning on Agile Safety-Critical Systems

GHC 6501

Abstract: We present a pipeline of optimal control methods for learning an optimal control policy and locally accurate dynamics models for agile and safety-critical robots using autonomous racing as an application example. We introduce Spline-Opt, a fast offline/online optimization and planning method that can produce a reasonably good initial optimal trajectory given very little dynamics [...]

MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Vision Model Diagnosis and Improvement Via Large Pretrained Models

Gates Hillman Center 4405

Abstract: As AI becomes increasingly pervasive in real-world applications, the deployment of machine learning models in real-world applications has underscored critical challenges in model robustness, fairness and performance. Despite significant advances, existing models often exhibit biases, fail to generalize across diverse data distributions, and struggle with unexpected input variations, leading to suboptimal or even discrimina- [...]

MSR Thesis Defense
MSR Student / Extern
Robotics Institute,
Carnegie Mellon University

Indoor Localization and Mapping with 4D mmWave Imaging Radar

GHC 6501

Abstract: State estimation is a crucial component for the successful implementation of robotic systems, relying on sensors such as cameras, LiDAR, and IMUs. However, in real-world scenarios, the performance of these sensors is degraded by challenging environments, e.g. adverse weather conditions and low-light scenarios. The emerging 4D imaging radar technology is capable of providing robust perception in adverse conditions. [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

PIE-FRIDA: Personalized Interactive Emotion-Guided Collaborative Human-Robot Art Creation

Gates Hillman Center 4405

Abstract: The introduction of generative AI has brought about many improvements in the artistic world. It allows many individuals to create artwork via simple descriptive text prompts. This has, in particular, created an avenue for non-artistic individuals to express their thoughts through generated art. Our work focuses on how emotion can be added as an [...]

MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Simulated Encounters of the Third Kind: Scenario-Based Approach to Designing Guide Robots

Newell-Simon Hall 4305

Abstract: Navigating through unfamiliar environments is a challenging task. For people who are blind or have low vision (BLV), navigation can be particularly daunting. Guide robots are a type of service robot that can assist BLV people with navigation tasks. A significant amount of research related to guide robots has focused on technical contributions, while a [...]

MSR Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Super Odometry: Selective Fusion Towards All-degraded Environments

GHC 6501

Abstract: Robust odometry is at the core of robotics and autonomous systems operating navigation, exploration, and locomotion in complex environments for a broad spectrum of applications. While great progress has been made, the robustness of the odometry system still remains a grand challenge. This talk introduces Super Odometry, an approach that leverages selective fusion to [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Learning on the Move: Integrating Action and Perception for Mobile Manipulation

Newell-Simon Hall 4305

Abstract: While there has been remarkable progress recently in the fields of manipulation and locomotion, mobile manipulation remains a long-standing challenge. Compared to locomotion or static manipulation, a mobile system must make a diverse range of long-horizon tasks feasible in unstructured and dynamic environments. While the applications are broad and interesting, there are a plethora [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Continual Personalization of Human Actions with Prompt Tuning

3305 Newell-Simon Hall

Abstract: In interactive computing devices (VR/XR headsets), users interact with the virtual world using hand gestures and body actions. Typically, models deployed in such XR devices are static and limited to their default set of action classes. The goal of our research is to provide users and developers with the capability to personalize their experience by [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Reinforcement Learning with Spatial Reasoning for Dexterous Robotic Manipulation

3305 Newell-Simon Hall

Abstract: Robotic manipulation in unstructured environments requires adaptability and the ability to handle a wide variety of objects and tasks. This thesis presents novel approaches for learning robotic manipulation skills using reinforcement learning (RL) with spatially-grounded action spaces, addressing the challenges of high-dimensional, continuous action spaces and alleviating the need for extensive training data. Our [...]

MSR Thesis Defense
MSR Student / Graduate Research Assistant
Robotics Institute,
Carnegie Mellon University

Leveraging Vision, Force Sensing, and Language Feedback for Deformable Object Manipulation

1305 Newell Simon Hall

Deformable object manipulation represents a significant challenge in robotics due to its complex dynamics, lack of low-dimensional state representations, and severe self-occlusions. This challenge is particularly critical in assistive tasks, where safe and effective manipulation of various deformable materials can significantly improve the quality of life for individuals with disabilities and address the growing needs [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data

Newell-Simon Hall 4305

Abstract: This research introduces CBGT-Net, a neural network model inspired by the cortico-basal ganglia-thalamic (CBGT) circuits in mammalian brains, which are crucial for critical thinking and decision-making. Unlike traditional neural network models that generate an output for each input or after a fixed sequence of inputs, CBGT-Net learns to produce an output once sufficient evidence [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Enhancing Robot Perception and Interaction Through Structured Domain Knowledge

Newell-Simon Hall 3305

Abstract: Despite the advancements in deep learning driven by increased computational power and large datasets, significant challenges remain. These include difficulty in handling novel entities, limited mechanisms for human experts to update knowledge, and lack of interpretability, all of which are crucial for human-centric applications like assistive robotics. To address these issues, we propose leveraging [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Towards Universal Place Recognition

3305 Newell-Simon Hall

Title: Towards Universal Place Recognition Abstract: Place Recognition is essential for achieving robust robot localization. However, current state-of-art systems remain environment/domain-specific and fragile. By leveraging insights from vision foundation models, we present AnyLoc, a universal VPR solution that performs across diverse environments without retraining or fine-tuning, significantly outperforming supervised baselines. We further introduce MultiLoc, and enable [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

GNSS-denied Ground Vehicle Localization for Off-road Environments with Bird’s-eye-view Synthesis

NSH 4305

Abstract:  Global localization is essential for the smooth navigation of autonomous vehicles. To obtain accurate vehicle states, on-board localization systems typically rely on Global Navigation Satellite System (GNSS) modules for consistent and reliable global positioning. However, GNSS signals can be obstructed by natural or artificial barriers, leading to temporary system failures and degraded state estimation. On the [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Scaling up Robot Skill Learning with Generative Simulation

Newell-Simon Hall 4305

Abstract:  Generalist robots need to learn a wide variety of skills to perform diverse tasks across multiple environments. Current robot training pipelines rely on humans to either provide kinesthetic demonstrations or program simulation environments with manually-designed reward functions for reinforcement learning. Such human involvement is an important bottleneck towards scaling up robot learning across diverse [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Simulation as a Tool for Conspicuity Measurement

1305 Newell Simon Hall

Abstract:  The use of unmanned aerial vehicles (UAVs) for time critical tasks is becoming increasingly popular. Operators are expected to use information from these swarms to make real-time and informed decisions. Consequently, detecting and recognizing targets from video is extremely pivotal to the success of these systems. At greater altitudes or with more vehicles, this [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

VP4D: View Planning for 3D and 4D Scene Understanding

1305 Newell Simon Hall

Abstract: View planning plays a critical role by gathering views that optimize scene reconstruction. Such reconstruction has played an important part in virtual production and computer animation, where a 3D map of the film set and motion capture of actors lead to an immersive experience. Current methods use uncertainty estimation in neural rendering of view [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Automating Annotation Pipelines by leveraging Multi-Modal Data

Rashid Auditorium - 4401 Gates and Hillman Centers

Abstract: The era of vision-language models (VLMs) trained on large web-scale datasets challenges conventional formulations of “open-world" perception. In this work, we revisit the task of few-shot object detection (FSOD) in the context of recent foundational VLMs. First, we point out that zero-shot VLMs such as GroundingDINO significantly outperform state-of-the-art few-shot detectors (48 vs. 33 AP) [...]

MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Leveraging Affordances for Accelerating Online RL

3305 Newell-Simon Hall

Abstract: The inability to explore environments efficiently makes online RL sample-inefficient. Most existing works tackle this problem in a setting devoid of prior information. However, additional affordances may often be cheaply available at the time of training. These affordances include small quantities of demo data, simulators that can reset to arbitrary states and domain specific [...]