Carnegie Mellon University
Understanding and Mitigating Biases in Evaluation
Abstract: There are many problems in real life that involve collecting and aggregating evaluation from people, such as hiring, peer grading and conference peer review. In this thesis, we focus on three sources of biases that arise in such problems, and propose methods to mitigate them. First, we study human bias, that is, the bias [...]
Carnegie Mellon University
MSR Thesis Talk: Manan Shah
ZOOM Link: https://www.google.com/url?q=https://cmu.zoom.us/j/93845075967?pwd%3DbndGc3NvaUVDVFFTTDZvektrNWJqdz09&sa=D&source=calendar&ust=1623592142330000&usg=AOvVaw1xfNPT5c59CQGKzR2bw5sO ID: 93845075967 Passcode: 159459 Title: 3D SLAM for Powered Lower Limb Prosthesis Abstract: During locomotion, humans use visual feedback to adjust their leg movement when navigating the environment. This natural behavior is lost, however, for lower-limb amputees, as current control strategies of prosthetic legs do not typically consider environment perception. With [...]
Carnegie Mellon University
MSR Thesis Talk: Dennis Melamed
Title: Learnable Spatio-Temporal Map Embeddings for Deep Inertial Localization Abstract: Pedestrian localization systems often fuse inertial odometry with map information via hand-defined methods to reduce odometry drift, but such methods are sensitive to noise and struggle to generalize across odometry sources. To address the robustness problem in map utilization, we propose a system that forms a [...]
Learning to Perceive Videos for Embodiment
Abstract: Video understanding has achieved tremendous success in computer vision tasks, such as action recognition, visual tracking, and visual representation learning. Recently, this success has gradually been converted into facilitating robots and embodied agents to interact with the environments. In this talk, I am going to introduce our recent efforts on extracting self-supervisory signals and [...]
Carnegie Mellon University
Self-Learning of Structured Visual Representations
Abstract: Most computer vision models in deployment today are not learning. Instead, they are in a "test" mode, where they will behave the same way perpetually, until they are replaced by newer models. This is a problem, because it means the models may perform poorly as soon as their "test" environment becomes different from their [...]
Carnegie Mellon University
Resource-Constrained Learning and Inference for Visual Perception
Abstract: Real-world applications usually require computer vision algorithms to meet certain resource constraints. In this talk, I will present evaluation methods and principled solutions for both training and testing. First, I will talk about a formal setting for studying training under the non-asymptotic, resource-constrained regime, i.e., budgeted training. We analyze the following problem: "given a [...]
Carnegie Mellon University
Towards Safe and Resilient Autonomy in Multi-Robot Systems
Abstract: Autonomous systems such as robotic systems are envisioned to co-exist with humans in our daily lives, from household service to large-scale warehouse logistics, agricultural monitoring, and smart city. Reliable interactions among robots and humans require provably correct guarantees about safety and performance when designing robot behaviors. While traditional approaches for safety and performance analysis [...]
Carnegie Mellon University
Provably Constant-Time Motion Planning
Abstract: In many robotic applications, including logistics and manufacturing, robots often operate in semi-structured environments and perform highly repetitive manipulation tasks. Additionally, large parts of these environments are static most of the time. Fast and reliable motion planning is one of the key elements that ensure efficient operations in such environments. A very common example [...]
MSR Thesis Talk: Kshitij Goel
Title: Rapid Subsurface Exploration with Multiple Aerial Robots Abstract: This thesis develops a robotic exploration framework that allows for rapid and communication-efficient mapping of unknown environments with a team of aerial robots. Aerial robots can provide rapid and agile mobility in diverse environments where ground mobility is either severely constrained or impossible. However, high-speed flight [...]
Carnegie Mellon University
MSR Thesis Talk: Albert Ye
Title: BusEdge: Efficient Live Video Analytics for Transit Buses via Edge Computing Abstract: Many vehicles like transit buses are now routinely fitted with cameras. These live visual data are invaluable to achieve real-time traffic monitoring, but it is intractable to handle such a gigantic amount of data either locally or in the cloud due to [...]
Carnegie Mellon University
Planning to Optimize and Learn Reward in Navigation Tasks in Structured Environments with Time Constraints
Abstract: Planning problems in which an agent must perform tasks for reward by navigating its environment while constrained by time and location have a wide variety of applications in robotics. Many real-world environments in which such planning problems apply, such as office buildings or city streets, are very structured. They consist of passages with notable [...]
Carnegie Mellon University
MSR Thesis Talk: Yunze Man
Title: Multi-Echo 3D Object Detection Abstract: LiDAR sensors can be used to obtain a wide range of measurement signals other than a simple 3D point cloud, and those signals can be leveraged to improve perception tasks like 3D object detection. A single laser pulse can be partially reflected by multiple objects along its path, resulting [...]
Carnegie Mellon University
Krishna Uppala – MSR Thesis Talk
Title: Exemplar free video retrieval. Abstract: Video retrieval of activities has a wide range of applications. In the traditional mode of operation, a collection of example videos describing the activities are given and the retrieval technique identifies other samples in a dataset that semantically match the examples provided. However, retrieval using a collection of example [...]
Ruixuan Liu – MSR Thesis Talk
Title: Data-efficient Behavior Prediction for Safe Human-Robot Collaboration. Abstract: Predicting human behavior is critical to facilitate safe and efficient human-robot collaboration (HRC). However, human behavior is difficult to predict due to the scarcity of human motion data. This work explores using online adaptation, an online approach, and data augmentation, an offline approach, to deal with the [...]
MSR Thesis Talk: Vidhi Jain
Title: Explainability in navigation policies Abstract: Today's autonomous agents have improved performance with learning and planning algorithms, but the applicability of such agents in the human-inhabited world is confined. Humans find it hard to explain the model's decision-making and thus, may not trust it as a teammate. While working with a machine learning model that [...]
Carnegie Mellon University
MSR Thesis Talk: Aditya Sripada
Title: Turning Behavior of Running Systems induced by Leg Placement Abstract: Compared to legged robots, animals and humans can perform much faster and larger turns, even when they run at high speeds. Such rapid turns require the body of a runner to reorient dynamically and in synchrony with its redirection during stance. While it is [...]
Carnegie Mellon University
MSR Thesis Talk: Rohit Jena
Title: Learning Mental Models of Experts in a Simulated Search and Rescue Scenario Abstract: Search and Rescue is a task where the rescuers need to be cognitively agile, strategically consistent, and efficient to save as many trapped victims as possible. In a team scenario, the rescuers must additionally coordinate with each other based on [...]
MSR Thesis Talk: Mohamad Qadri
Title: Robotic Vision for 3D Modeling and Sizing in Agriculture Abstract: Obtaining accurate perceptual information is a critical component in agricultural robotics since there is a heavy need for interaction with the environment to perform tasks such as pruning, harvesting, and phenotyping. In this thesis, we tackle the problem of perception and 3D modeling in [...]
Carnegie Mellon University
MSR Thesis Talk: Alex Baikovitz
Title: Underground Representations for Robot Localization and Mapping Abstract: There has been exciting recent progress in using radar as a sensor for robot navigation given its increased robustness to varying environmental conditions. However, within these different radar perception systems, ground penetrating radar (GPR) remains under-explored. By measuring structures beneath the ground, GPR can provide stable features that [...]
Carnegie Mellon University
MSR Thesis Talk: Ankur Deka
Title: On combining Reinforcement Learning & Adversarial Training Abstract: Reinforcement Learning (RL) allows us to train an agent to excel at a given sequential decision-making task by optimizing for a reward signal. Adversarial training involves a joint optimization scheme where an agent and an adversary compete against each other. In this work, we explore some [...]
MSR Thesis Talk: Zhengyi Luo
Title: Physics-based object-aware ego-centric human pose estimation Abstract: We investigate the roles of body kinematics, dynamics, and objects for 3D human pose estimation using a head-mounted camera. Human kinematics models play a key role in encoding the natural range of human motion, while dynamics models can react to the spatial arrangement between humans and objects [...]
MSR Thesis Talk: Tarasha Khurana
Title: Detecting Invisible People Abstract: Monocular object detection and tracking have improved drastically in recent years, but rely on a key assumption: that objects are visible to the camera. Many offline tracking approaches reason about occluded objects post-hoc, by linking together tracklets after the object re-appears, making use of reidentification (ReID). However, online tracking [...]
Carnegie Mellon University
MSR Thesis Talk: Sujay Bajracharya
work, we address the problem of goal-directed cloth manipulation, a challenging task due to the deformability of cloth. Our insight is that optical flow, a technique normally used for motion estimation in video, can also provide an effective representation for corresponding cloth poses across observation and goal images. We introduce FabricFlowNet (FFN), a cloth manipulation [...]
Carnegie Mellon University
MSR Thesis Talk: Edward Chen
Title: Towards Practical Ultrasound AI Across Real-World Patient Diversity Abstract: In the case of high-tempo, traumatic scenarios on the battlefield, real-time ultrasound (US) imaging serves as an enabler for countless possible robotic interventions. Having the ability to automatically segment anatomical landmarks in the body, such as arteries, veins, ligaments, and veins, for percutaneous procedures remains [...]
Carnegie Mellon University
MSR Thesis Talk: Haochen Wang
Title: Audiovisual ontology and robust representations via cross-modal fusion Abstract: The shrill of an ambulance siren and flashing lights, the hum of an accelerating car — important events often come to us simultaneously through sight and sound. We first consider the problem of identifying these events from raw, unlabeled audiovisual data of agents interacting with [...]
Social Navigation with Pedestrian Groups
Abstract: Autonomous navigation in human crowds (i.e., social navigation) presents several challenges: The robot often needs to rely on its noisy sensors to identify and localize the pedestrians in human crowds; The robot needs plan efficient paths to reach its goals; The robot needs to do so in a safe and socially appropriate manner. In [...]
Carnegie Mellon University
MSR Thesis Talk: Viraj Parimi
Title: T-HTN: Timeline Based HTN Planning for Multi-Agent Robots Abstract: Planning in mission-critical systems like deep-space habitats with onboard robotic systems must be robust to unforeseen circumstances. Such systems are expected to complete a set of goals with different deadlines each day for routine maintenance while also accounting for emergencies. With the presence of [...]
Carnegie Mellon University
Robust and Scalable Perception For Autonomy
Abstract: Autonomous mobile robots have the potential to drastically improve the quality of our daily life. For example, self-driving vehicles could make transportation safer and more affordable. To navigate complex environments, such robots need a perception system that translates raw sensory data to high-level understanding. This thesis focuses on two fundamental challenges in learning such [...]
Carnegie Mellon University
MSR Thesis Talk: Yiming Zuo
Title: Towards Self-supervised Object Discovery and Tracking Abstract: Object discovery and multiple object tracking (MOT) are two highly interrelated tasks that are known to be fundamental problems in computer vision, and are crucial for video understanding. Most existing methods rely on supervised training with human annotations, which is laborious and expensive. In this thesis, [...]
Carnegie Mellon University
MSR Thesis Talk: Qiao Gu
Title: Towards Object-generic 6D Pose Estimation Abstract: Pose estimation is a basic module in many robot manipulation pipelines. Estimating the pose of objects in the environment can be useful for grasping, motion planning, or manipulation. However, current state-of-the-art methods for pose estimation either rely on large annotated training sets or simulated data. Further, the long [...]
Carnegie Mellon University
MSR Thesis Talk: Divam Gupta
Title: End-to-End Deep Stereo Layout Estimation Abstract: Accurate layout estimation is crucial for planning and navigation in robotics applications, such as self-driving. In this paper, we introduce the Stereo Bird's Eye ViewNetwork (SBEVNet), a novel supervised end-to-end framework for estimation of bird's eye view layout from a pair of stereo images. Although our network [...]
Carnegie Mellon University
Michael Tasota – MSR Thesis Talk
Title: Design of a Multimodal System for Social Emotional Learning in Early Childhood Classrooms Abstract: As the prevalence of mobile and touch-based devices continues to expand in society, so too does its impact on young children. With educational technologies also on the rise, young children benefit most from those technologies that are designed to [...]
Carnegie Mellon University
MSR Thesis Talk: Aaron Huang
Title: End-to-End Methods for Autonomous Driving in Simulation Abstract: Fully autonomous driving is considered one of the grand challenges of modern technology and a variety of approaches have emerged for creating and evaluating autonomous driving agents. The self-driving industry typically adopts a modular software architecture and uses large fleets of autonomous vehicles for data [...]
Carnegie Mellon University
MSR Thesis Talk
Title: Retrieval-based Novel Activity Detection in Untrimmed Videos Abstract: Accurately detecting activities in untrimmed videos is a challenging task as systems need to handle variance in object scales, multiple viewpoints, and multiple types of activities. Furthermore, in a real-world scenario, activity detectors are often required to detect novel kinds of activities when the need [...]
Carnegie Mellon University
Semantic Mapping for Autonomous Navigation and Exploration
Abstract: The last two decades have seen enormous progress in the sensors and algorithms for 3D perception, giving robots the means to build accurate spatial maps and localize themselves in them in real time. The geometric information in these maps is invaluable for navigation while avoiding obstacles, but insufficient, by itself, for robots to robustly [...]
Carnegie Mellon University
MSR Thesis Talk: Howard Coffin
Title: Mutual Information Maps for Single and Multi-Target Ergodic Search Abstract: This thesis addresses use of multi-agent systems to perform autonomous search for moving targets. Target search has many applications, including search and rescue and surveillance, but most robotic systems used in these situations require human operators. Recent works have used ergodic search methods to [...]
Carnegie Mellon University
MSR Thesis Talk: Wanwen Chen
Title: Ultrasound-based Needle Tracking and Lateral Manipulation Planning for Common Needle Steering Abstract: Many medical interventions require inserting a needle towards the desired site. Robotic needle insertion can potentially provide reliable needle access independent of practitioner experience and availability. However, when the needle is inserted into heterogeneous tissue, the complicated needle-tissue interaction introduces multiple challenges [...]
Carnegie Mellon University
MSR Thesis Talk: Rohit Garg
Title: Coordinated Aerial Exploration of Subterranean Voids Abstract: Subterranean exploration has been thrust into the spotlight by the recent DARPA Subterranean Challenge. Teams are tasked with developing a multi-agent system that can rapidly navigate through unknown underground environments while sending back maps and other semantic information. A system with such capabilities has a wide [...]
Carnegie Mellon University
MSR Thesis Talk: Harvey Shi
Title: The design of a compact laser scanner and an integrated simulation environment for smart manufacturing Abstract: Advanced manufacturing is on the rise with the notable progress in the additive and hybrid manufacturing field. However, most of the existing solutions are open loop without sensory feedback because of the lack of a small form factor [...]
Carnegie Mellon University
MSR Thesis Talk: Daqian Cheng
Zoom link: https://cmu.zoom.us/j/99726348928?pwd=azlicW9qZk1Nc3BkQXBLT2d6djVBQT09 Meeting ID: 997 2634 8928 Passcode: 136122 Title: SLAM with Laser Profilers for High Definition Mapping in Confined Spaces Abstract: Three-dimensional reconstruction in confined spaces is important for the manufacturing of aircraft wings, the inspection of narrow pipes, the examination of turbine blades, etc. It is also challenging because confined spaces tend [...]
Carnegie Mellon University
MSR Thesis Talk: Quint Hester
Title: Graph Neural Networks for Wind Farm Yaw Optimization Abstract: In order to mitigate the effects of climate change, it is important to maximize the power output of our wind energy resources. Wind farm control is typically greedy, with all turbines facing directly into the wind. Due to wake interactions, this leads to reductions [...]
Carnegie Mellon University
MSR Thesis Talk: Henry Zhang
Title: On Visual-Aided LiDAR-Inertial Odometry System in Challenging Subterranean Environments Abstract: Simultaneous Localization and Mapping (SLAM) is one of the fundamental components for autonomous robotic exploration. The goal of SLAM is to create an accurate map for the environment and provide robust state estimation for planning, control, and perception tasks. However, due to the [...]
Carnegie Mellon University
Efficient Robot Decision-Making for Achieving Multiple Independent Tasks
Abstract: We focus on robotics applications where a robot is required to accomplish a set of tasks that are partially observable and evolve independently of each other according to their dynamics. One such domain that we target in this work is decision-making for a robot waiter waiting tables at a restaurant. The robot waiter should [...]
MSR Thesis Talk: Ryan Darnley
Title: Communication Network Flow Control With LQR and Factor Graphs Abstract: Advances in the estimation and more recently the controls communities have leveraged the use of factor graphs to overcome problems of scalability and runtime in dynamical systems. In continuation of that work, we look to use factor graphs to perform LQR control on [...]
Carnegie Mellon University
MSR Thesis Talk: Shirsendu Halder
Title: Robust 3D reconstruction in noisy environments Abstract: Automated inspection in industrial manufacturing can minimize the total production cost of a part. Current inspection solutions often involve measuring a part manually, which interrupts the machining process. We present two non-contact real-time systems which integrate visual inspection in-line with CNC (computer numerical control) machines and ensures [...]
MSR Thesis Talk: Tejus Gupta
ZOOM Link: https://cmu.zoom.us/j/2388465851 Meeting ID: 238 846 5851 Google calendar invite link Title: Adaptive and Efficient Models for Intent Recognition Abstract: Assistive robots should have the ability to understand the intent of humans, predict their behavior, and plan to provide anticipatory assistance in complex real-life environments. In this thesis, we present adaptive and efficient algorithms for recognizing [...]
Carnegie Mellon University
MSR Thesis Talk: Siddharth Agrawal
Title: Learning to Imitate, Adapt and Communicate Abstract: For AI agents to co-exist with humans, they need to be able to learn from us humans, adapt to any perceived changes in our behavior, and communicate in a manner that is easily interpretable. In this work, we investigate the following 3 subproblems: Imitation Learning, Adaptation in Human-Agent [...]
Carnegie Mellon University
MSR Thesis Talk: Danny Vedova
Link: https://cmu.zoom.us/j/92810207130?pwd=blphbkovYVA2b1VoSmhuZ0h2bFRzZz09 Meeting ID: 928 1020 7130 Passcode: 721620 Title: Design and Analysis of a Control Pipeline for a Manipulator in Space Abstract: Many satellites are rapidly reaching the end of their lifespans, and risk de-orbiting if no action is taken. One common problem satellites face towards the end of their lifespans is that they are running [...]
Carnegie Mellon University
MSR Thesis Talk: Fan Jia
Zoom Link: https://cmu.zoom.us/j/9793644214?pwd=enpab3dPc0FBWVdqWDNrcVlabUJCZz09 Meeting ID: 979 364 4214 Passcode: 304453 Title: Trust-Aware Behavior Reflection for Robot Swarm Self-Healing Abstract: The deployment of robot swarms is influenced by real-world factors, such as motor issues, sensor failure, and wind disturbances. These factors cause the appearance of faulty robots. In a decentralized swarm, sharing incorrect information from faulty robots will lead to [...]
Carnegie Mellon University
MSR Thesis Talk: Tejas Zodage
Meeting Link: https://cmu.zoom.us/j/2427727031 Title: Point Cloud Registration as a Classification Problem Abstract: Point Cloud Registration(PCR) is an important step in fields such as robotic manipulation, augmented and virtual reality, SLAM, etc. In the context of computer vision, registration, in general, refers to the process of aligning data obtained from different frames, and as the [...]
Carnegie Mellon University
Decentralized Navigation of Quadrotor Teams in Uncertain Workspaces
Abstract: A fundamental requirement for realizing scalable and responsive real-world multi-robot systems for time-sensitive critical applications such as search and rescue or building clearance is a motion-planning and coordination framework that exhibits three essential properties. The first property is safety that encompasses aspects relating to kinodynamic feasibility and collision-avoidance. The second property is reliability that [...]
Control of Robots with Nonstationary Dynamics
Abstract: Robots can be constructed with fewer resources and less strict constraints than is generally believed. Soft robots can be constructed with very few parts and from a wide variety of materials. This makes them a potentially appealing choice for applications where there are resource constraints on system fabrication. However, soft robot dynamics are difficult [...]
Carnegie Mellon University
Bayesian Models for Science-Driven Robotic Exploration
Abstract Planetary rovers allow for science investigations at remote locations. They have traversed many kilometers and made major scientific discoveries. However, rovers spend a considerable amount of time awaiting instructions from mission control. The reason is that they are designed for highly supervised data collection, not for autonomous exploration. The exploration of farther worlds will [...]
Open Challenges in Sign Language Translation & Production
Abstract: Machine translation and computer vision have greatly benefited of the advances in deep learning. The large and diverse amount of textual and visual data have been used to train neural networks whether in a supervised or self-supervised manner. Nevertheless, the convergence of the two field in sign language translation and production is still poses [...]
The Search for Ancient Life on Mars Began with a Safe Landing
Abstract: Prior mars rover missions have all landed in flat and smooth regions, but for the Mars 2020 mission, which is seeking signs of ancient life, this was no longer acceptable. To maximize the variety of rock samples that will eventually be returned to earth for analysis, the Perseverance rover needed to land in a [...]
Meshlet Primitives for Dense RGB-D SLAM in Dynamic Environments
Abstract: Dense RGB-D SLAM has been well established as a method for achieving robust localization while providing high quality dense surface reconstruction. However, despite significant progress, dense RGB-D SLAM has remained difficult to achieve on computationally constrained platforms, such as those used on autonomous aerial vehicles. A significant limiting factor in the current state of [...]
3D Recognition with self-supervised learning and generic architectures
Abstract: Supervised learning relies on manual labeling which scales poorly with the number of tasks and data. Manual labeling is especially cumbersome for 3D recognition tasks such as detection and segmentation and thus most 3D datasets are surprisingly small compared to image or video datasets. 3D recognition methods are also fragmented based on the type [...]
Carnegie Mellon University
Heuristics for routing and scheduling of Spatio-temporal type problems in industrial environments
Abstract: Spatio-temporal problems are fairly common in industrial environments. In practice, these problems come with different characteristics and are often very hard to solve optimally. So, practitioners prefer to develop heuristics that exploit mathematical structure specific to the problem for obtaining good performance. In this thesis, we will present work on heuristics for 3 different [...]
Computational Light Transport with Interferometry
Abstract: Optical interferometry is the measurement of small, sub-wavelength distances by exploiting the wave nature of light. Due to its capability to resolve micron-scale displacements, it has found widespread applications in biomedical imaging, industrial fabrication, physics, and astrophysics. In this thesis, we introduce a set of techniques we call computational interferometry, that bring the benefits [...]
Rapid Adaptation for Robot Learning
Abstract: How can we train a robot to generalize to diverse environments? This question underscores the holy grail of robot learning research because it is difficult to supervise an agent for all possible situations it can encounter in the future. We posit that the only way to guarantee such a generalization is to continually learn and [...]
Carnegie Mellon University
3D Reconstruction using Differential Imaging
Abstract: 3D reconstruction has been at the core of many computer vision applications, including autonomous driving, visual inspection in manufacturing, and augmented and virtual reality (AR/VR). Despite the tremendous progress made over the years, there remain challenging open-research problems. This thesis addresses three such problems in 3D reconstruction. First, we address the problem of defocus [...]
Robotic Cave Exploration for Search, Science, and Survey
Abstract: Robotic cave exploration has the potential to create significant societal impact through facilitating search and rescue, in the fight against antibiotic resistance (science), and via mapping (survey). But many state-of-the-art approaches for active perception and autonomy in subterranean environments rely on disparate perceptual pipelines (e.g., pose estimation, occupancy modeling, hazard detection) that process the same underlying sensor data in [...]
Humans, hands, and horses: 3D reconstruction of articulated object categories using strong, weak, and self-supervision
Abstract: Reconstructing 3D objects from a single 2D image is a task that humans perform effortlessly, yet computer vision so far has only robustly solved 3D face reconstruction. In this talk we will see how we can extend the scope of monocular 3D reconstruction to more challenging, articulated categories such as human bodies, hands and [...]
Enabling Grounded Language Communication for Human-Robot Teaming
Abstract: The ability for robots to effectively understand natural language instructions and convey information about their observations and interactions with the physical world is highly dependent on the sophistication and fidelity of the robot’s representations of language, environment, and actions. As we progress towards more intelligent systems that perform a wider range of tasks in a [...]
Looking behind the Seen in Order to Anticipate
Abstract: Despite significant recent progress in computer vision and machine learning, personalized autonomous agents often still don’t participate robustly and safely across tasks in our environment. We think this is largely because they lack an ability to anticipate, which in turn is due to a missing understanding about what is happening behind the seen, i.e., [...]
Robots that Learn through Language
Abstract: Advances in perception have been integral to transitioning robots from machines restricted to factory automation to autonomous agents that operate robustly in unstructured environments. As our surrogates, robots enable people to explore the deepest depths of the ocean and distant regions of space, making discoveries that would otherwise be impossible. The age of robots [...]
Towards Reconstructing Any Object in 3D
Abstract: The world we live in is incredibly diverse, comprising of over 10k natural and man-made object categories. While the computer vision community has made impressive progress in classifying images from such diverse categories, the state-of-the-art 3D prediction systems are still limited to merely tens of object classes. A key reason for this stark difference [...]
Carnegie Mellon University
Beyond rigid objects: Data-driven Methods for Manipulation of Deformable Objects
Abstract: Manipulation of deformable objects challenges common assumptions made for rigid objects. Deformable objects have high intrinsic state representation and complex dynamics with high degrees of freedom, making it difficult for state estimation and planning. The completed work can be divided into two parts. In the first part, we explore reinforcement learning (RL) as a [...]
Carnegie Mellon University
Simulation, Perception, and Generation of Human Behavior
Abstract: Understanding and modeling human behavior is fundamental to almost any computer vision and robotics applications that involve humans. In this thesis, we take a holistic approach to human behavior modeling and tackle its three essential aspects --- simulation, perception, and generation. Throughout this thesis, we show how the three aspects are deeply connected and [...]
The Clinician’s AI Partner: Augmenting Clinician Capabilities Across the Spectrum of Healthcare
Abstract: Clinicians often work under highly demanding conditions to deliver complex care to patients. As our aging population grows and care becomes increasingly complex, physicians and nurses are now also experiencing feelings of burnout at unprecedented levels. In this talk, I will discuss possibilities for computer vision to function as a partner to clinicians, and to augment their capabilities, across [...]
The Unusual Effectiveness of Abstractions for Assistive AI
Abstract: Can we balance efficiency and reliability while designing assistive AI systems? What would such AI systems need to provide? In this talk I will present some of our recent work addressing these questions. In particular, I will show that a few fundamental principles of abstraction are surprisingly effective in designing efficient and reliable AI [...]
Reliable and Accessible Visual Recognition
Abstract: As visual recognition models are developed across diverse applications; we need the ability to reliably deploy our systems in a variety of environments. At the same time, visual models tend to be trained and evaluated on a static set of curated and annotated data which only represents a subset of the world. In this [...]
Fake It Till You Make It: Face analysis in the wild using synthetic data alone
Abstract: In this seminar I will demonstrate how synthetic data alone can be used to perform face-related computer vision in the wild. The community has long enjoyed the benefits of synthesizing training data with graphics, but the domain gap between real and synthetic data has remained a problem, especially for human faces. Researchers have tried [...]
Carnegie Mellon University
Structured Learning for Robust Robot Manipulation
Abstract: Robust and generalizable robots that can autonomously manipulate objects in semi-structured environments can bring material benefits to society. Data-driven learning approaches are crucial for enabling such systems by identifying and exploiting patterns in semi-structured environments, allowing robots to adapt to novel scenarios with minimal human supervision. However, despite significant prior work in learning for [...]
Robotics and Warehouse Automation at Berkshire Grey
Abstract: This talk tells the Berkshire Grey story, from its founding in 2013 to its IPO earlier this year — the first robotics IPO since iRobot over15 years ago. Berkshire Grey produces automated systems for e-commerce order fulfillment, parcel sortation, store replenishment, and related operations in warehouses, distribution centers, and in the back ends of [...]
An Experimental Design Perspective on Model-Based Reinforcement Learning
Abstract: In many practical applications of RL, it is expensive to observe state transitions from the environment. For example, in the problem of plasma control for nuclear fusion, computing the next state for a given state-action pair requires querying an expensive transition function which can lead to many hours of computer simulation or dollars of [...]
Learning Model Preconditions for Planning with Multiple Models
Abstract: Different models can provide differing levels of fidelity when a robot is planning. Analytical models are often fast to evaluate but only work in limited ranges of conditions. Meanwhile, physics simulators are effective at modeling complex interactions between objects but are typically more computationally expensive. Learning when to switch between the various models can [...]
Leveraging StyleGAN for Image Editing and Manipulation
Abstract: StyleGAN has recently been established as the state-of-the-art unconditional generator, synthesizing images of phenomenal realism and fidelity, particularly for human faces. With its rich semantic space, many works have attempted to understand and control StyleGAN’s latent representations with the goal of performing image manipulations. To perform manipulations on real images, however, one must learn to [...]
Resilient Exploration in SubT Environments: Team Explorer’s Approach and Lessons Learned in the Final Event
Abstract: Subterranean robot exploration is difficult with many mobility, communications, and navigation challenges that require an approach with a diverse set of systems, and reliable autonomy. While prior work has demonstrated partial successes in addressing the problem, here we convey a comprehensive approach to address the problem of subterranean exploration in a wide range of [...]
Simulation-based Planning for Pick-and-Place in Heavy Clutter using Non-prehensile Manipulation
Abstract: Robot manipulation in domestic households, industrial manufacturing and warehouses might require contact-rich interactions with objects in the environment. For pick-and-place style grasping tasks in cluttered scenes, it can be more economical for the robot to rely on non-prehensile actions vis-à-vis deliberate prehensile rearrangement. Non-prehensile actions also let the robot manipulate large and bulky objects [...]
Carnegie Mellon University
Relationships in instance segmentation and anomaly detection
Abstract: This thesis primarily covers work on two different tasks in computer vision: (1) anomaly detection and (2) instance segmentation. Anomaly detection is an underexplored unsupervised problem that has existed in many fields. On the other hand, instance (and panoptic) segmentation is a supervised problem that can leverage the powerful data and key developments from [...]
Next-Gen Video Communication
Abstract: Video communication connects our world. It is necessary in conducting business, educational and personal activities across different geographical locations. However, the quality of an average user’s video communication is dramatically worse than that of professionally created videos in news broadcasts, talk shows, and on YouTube. This is because professionally created videos are often captured with [...]
Carnegie Mellon University
Learning with Diverse Forms of Imperfect and Indirect Supervision
Abstract: High capacity Machine Learning (ML) models trained on large, annotated datasets have driven impressive advances in several fields including natural language processing and computer vision, in turn leading to impactful applications of ML in areas such as healthcare, e-commerce, and predictive maintenance. However, obtaining annotated datasets at the scale required for training such models [...]
MRSD Annual Poster Presentation
Four student teams from the MRSD program will use posters, videos, and hardware to show their project work on robots for room disinfection, search & rescue, increasing human capability via a third arm, and increased-efficiency factory-floor obstacle avoidance.
Carnegie Mellon University
3D Representation Learning for Perception and Prediction: A Modular Yet Highly Integrated Approach
Abstract: Modularized and cascaded autonomy stacks (object detection, then tracking and then trajectory prediction) have been widely adopted in many autonomous systems such as self-driving cars due to its interpretability. In this talk, I advocate the use of such a modular approach but improve its accuracy and robustness by developing different 3D representations for each [...]