PhD Thesis Defense
Robotics Institute,
Carnegie Mellon University

Visual Dataset Pipeline: From Curation to Long-Tail Learning

NSH 4305

Abstract: Computer vision models have proven to be tremendously capable of recognizing and detecting several real-world objects: cars, people, pets. These models are only possible due to a meticulous pipeline where a task and application is first conceived followed by an appropriate dataset curation that collects and labels all necessary data. Commonly, studies are focused [...]

PhD Thesis Defense
Robotics Institute,
Carnegie Mellon University

Optimization of Small Unmanned Ground Vehicle Design using Reconfigurability, Mobility, and Complexity

Abstract: Unmanned ground vehicles are being deployed in increasingly diverse and complex environments. With modern developments in sensing and planning, the field of ground vehicle mobility presents rich possibilities for mechanical innovations that may be especially relevant for unmanned systems. In particular, reconfigurability may enable vehicles to traverse a wider set of terrains with greater [...]

PhD Thesis Defense
Robotics Institute,
Carnegie Mellon University

Towards Reconstructing Non-rigidity from Single Camera

GHC 6501

Abstract: In this talk we will discuss how to infer 3D from images captured by a single camera, without assuming the target scenes / objects being static. The non-static setting makes our problem ill-posed and challenging to solve, but is vital in practical applications where target-of-interest is non-static. To solve ill-posed problems, the current trend [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Large Scale Dense 3D Reconstruction via Sparse Representations

NSH 4305

Abstract: Dense 3D scene reconstruction is in high demand today for view synthesis, navigation, and autonomous driving. A practical reconstruction system inputs multi-view scans of the target using RGB-D cameras, LiDARs, or monocular cameras, computes sensor poses, and outputs scene reconstructions. These algorithms are computationally expensive and memory-intensive due to the presence of 3D data. [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

From Reinforcement Learning to Robot Learning: Leveraging Prior Data and Shared Evaluation

NSH 4305

Abstract: Unlike most machine learning applications, robotics involves physical constraints that make off-the-shelf learning challenging. Difficulties in large-scale data collection and training present a major roadblock to applying today’s data-intensive algorithms. Robot learning has an additional roadblock in evaluation: every physical space is different, making results across labs inconsistent. Two common assumptions of the robot [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Building 4D Models of Objects and Scenes from Monocular Videos

NSH 4305

Abstract: We explore how to infer the time-varying 3D structures of generic, deformable objects, and dynamic scenes from monocular videos. A solution to this problem is essential for virtual reality and robotics applications. However, inferring 4D structures given 2D observations is challenging due to its under-constrained nature. In a casual setup where there is neither [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Learning via Visual-Tactile Interaction

NSH 1305

Abstract: Humans learn by interacting with their surroundings using all of their senses. The first of these senses to develop is touch, and it is the first way that young humans explore their environment, learn about objects, and tune their cost functions (via pain or treats). Yet, robots are often denied this highly informative and [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Redefining the Perception-Action Interface: Visual Action Representations for Contact-Centric Manipulation

GHC 6501

Abstract:  In robotics, understanding the link between perception and action is pivotal. Typically, perception systems process sensory data into state representations like  segmentations and bounding boxes, which a planner uses to plan actions. However, this state estimation approach can fail in environments with partial observability, and in cases with challenging object properties like transparency and deformability.  [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

Multi-Human 3D Reconstruction from Monocular Videos

NSH 4305

Abstract: We study the problem of multi-human 3D reconstruction from videos captured in the wild. Human movements are dynamic, and accurately reconstructing them in various settings is crucial for developing immersive social telepresence, assistive humanoid robots, and augmented reality systems. However, creating such a system requires addressing fundamental issues with previous works regarding the data [...]

PhD Thesis Defense
PhD Student
Robotics Institute,
Carnegie Mellon University

How I Learned to Love Blobs: The Power of Gaussian Representations in Differentiable Rendering and Optimization

NSH 3305

Abstract: In this thesis, we explore the use of Gaussian Representations in multiple application areas of computer vision and robotics. In particular, we design a ray-based differentiable renderer for 3D Gaussians that can be used to solve multiple classic computer vision problems in a unified manner. For example, we can reconstruct 3D shapes from color, [...]