PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Learning Local Heuristics in Heuristic Search

NSH 3305

Abstract: Motion planning is a fundamental problem in robotics; how can we move robots efficiently and safely? Motion planning can be solved using several paradigms with their own strengths and weaknesses. This talk dives into Heuristic Graph Search and its application to motion planning by converting it to a problem of finding a start-goal path [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Joint 2D and 3D Semi-Supervised Object Detection

NSH 4305

Abstract: While numerous 3D detection works leverage the complementary relationship between RGB images and point clouds, developments in the broader framework of semi-supervised object recognition remain uninfluenced by multi-modal fusion. Current methods develop independent pipelines for 2D and 3D semi-supervised learning despite the availability of paired image and point cloud frames. Observing that the distinct [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Towards Agile Robotics: Creating Push-Off Skills for Dynamic Interactions

GHC 8102

Abstract: Dynamic interactions play a fundamental role in human capabilities, enabling us to achieve a wide range of tasks such as moving heavy objects, manipulating our surroundings, and changing directions rapidly and safely. In contrast, most conventional robotic systems lack this level of agility and cannot perform dynamic interactions, limiting their potential in practical applications. [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Generative Evolutionary Search with Diffusion Models for Trajectory Optimization

NSH 4305

Abstract: Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward function fitted to both [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Tartancalib: Iterative Wide-Angle Lens Calibration

GHC 8115

Abstract: Mobile vision systems greatly benefit from the large field-of-view enabled by wide-angle lenses. Accurate and robust intrinsic calibration is a critical prerequisite for leveraging this property. Calibrating wide-angle lenses with current state-of-the-art techniques yields poor results due to extreme distortion at the edge. In this work, we present TartanCalib, an accurate and robust method [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Zero-Shot Video Question Answering with Procedural Programs

GHC 6121

Abstract: We propose to answer zero-shot questions about videos by generating short procedural programs that derive a final answer from solving a sequence of visual subtasks. We present Procedural Video Querying (ProViQ), which uses a large language model to generate such programs from an input question and an API of visual modules in the prompt, [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Robust Body Exposure (RoBE): A Graph-based Dynamics Modeling Approach to Manipulating Blankets over People

NSH 1109

Abstract: Robotic caregivers could potentially improve the quality of life of many who require physical assistance. However, in order to assist individuals who are lying in bed, robots must be capable of dealing with a significant obstacle: the blanket or sheet that will almost always cover the person's body. We propose a method for targeted [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Learning to Manipulate beyond Imitation

NSH 3002

Abstract: Imitation learning has been a prevalent approach for teaching robots manipulation skills but still suffers from scalability and generalizability. In this talk, I'll argue for going beyond elementary behavioral imitation from human demonstrations. Instead, I'll present two key directions: 1) Creating Manipulation Controllers from Pre-Trained Representations, and 2) Representing Video Demonstrations with Parameterized Symbolic [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Leveraging Parallelism to Accelerate Quadratic Program Solvers for MPC

GHC 8102

Abstract: Many problems in robotics can be formulated as quadratic programs (QPs). In particular, model-predictive control problems often involve repeatedly solving QPs at very high rates (up to kilohertz). However, while other areas of robotics like machine learning have achieved high performance by taking advantage of parallelism on modern computing hardware, state-of-the-art algorithms for solving [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Composing Generative and Discriminative Models for Better Generalization

NSH 3305

Abstract: Computer Vision is Correspondence, correspondence, correspondence! Inspite of the singular definition of computer vision, we still have two broad categories of approaches in the literature. Generative Models, like Stable Diffusion, learn a correspondence between image and text modality, while learning a mapping from text to image. Discriminative Models, like CLIP, on the other hand [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Lower Bounds for Moving Target Traveling Salesman Motion Planning with Obstacles

NSH 3305

Abstract: We study the problem of finding a trajectory for an agent to intercept a number of moving targets while avoiding obstacles. Applications include resupplying naval ships at sea and recharging aerial vehicles with a ground vehicle. We model the problem as an extension of the traveling salesman problem, which we refer to as the [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter

NSH 3305

Abstract: Current state-of-the-art autonomous driving vehicles mainly rely on each individual sensor system to perform perception tasks. Such a framework's reliability could be limited by occlusion or sensor failure. To address this issue, more recent research proposes using vehicle-to-vehicle (V2V) communication to share perception information with others. However, most relevant works focus only on cooperative [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Robust Off-road Wheel Odometry with Slip Estimation

NSH 4305

Abstract: Wheel odometry is not often used in state estimation for off-road vehicles due to frequent wheel slippage, varying wheel radii, and the 3D motion of the vehicle not fitting with the 2D nature of integrated wheel odometry. This paper proposes a novel 3D preintegration of wheel encoder measurements on manifold. Our method additionally estimates [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Enhancing Model Performance and Interpretability with Causal Inference as a Feature Selection Algorithm

NSH 1305

Abstract: Causal inference focuses on uncovering cause-effect relationships from data, diverging from conventional machine learning which primarily relies on correlation analysis. By identifying these causal relationships, causal inference improves feature selection for predictive models, leading to predictions that are more accurate, interpretable, and robust. This approach proves especially effective with interventional data, such as randomized [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Recent Progress in Graph-Search Methods for Multi-Robot-Arm Motion Planning

NSH 4305

Abstract: An exciting frontier in robotic manipulation is the use of multiple arms at once. However, planning concurrent motions is a challenging task using current methods. A major obstacle is the high-dimensional state space of this planning problem, which renders many traditional motion planning algorithms impractical. This opens the door for alternatives to the common [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Strategy and Skill Learning for Physics-based Table Tennis Animation

Abstract: Recent advancements in physics-based character animation leverage deep learning to generate agile and natural motion, enabling characters to execute movements such as backflips, boxing, and tennis. However, reproducing the selection and use of diverse motor skills in dynamic environments to solve complex tasks, as humans do, still remains a challenge. We present a strategy [...]