PhD Speaking Qualifier
Robotics Institute,
Carnegie Mellon University

Task-Driven Modular Networks for Zero-Shot Compositional Learning

Zoom Link Abstract: One of the hallmarks of human intelligence is the ability to compose learned knowledge into novel concepts which can be recognized without a single training example. In contrast, current state-of-the-art methods require hundreds of training examples for each possible category to build reliable and accurate classifiers. To alleviate this striking difference in [...]

VASC Seminar
Qi Guo
PhD Student
Harvard University

Bio-inspired depth sensing using computational optics

Virtual Seminar:  https://cmu.zoom.us/j/249106600   Abstract:  Jumping spiders rely on accurate depth perception for predation and navigation. They accomplish depth perception, despite their tiny brains, by using specialized optics. Each principal eye includes a multitiered retina that simultaneously receives multiple images with different amounts of defocus, and distance is decoded from these images with seemingly little [...]

PhD Thesis Proposal
Robotics Institute,
Carnegie Mellon University

Active Vision: Autonomous Aerial Cinematography with Learned Artistic Decision-Making

Zoom Link Abstract: Aerial cinematography is revolutionizing industries that require live and dynamic camera viewpoints such as entertainment, sports, and security. Fundamentally, it is a tool with immense potential to improve human creativity, expressiveness, and sharing of experiences. However, safely piloting a drone while filming a moving target in the presence of obstacles is immensely [...]

PhD Thesis Proposal
Robotics Institute,
Carnegie Mellon University

Efficient Robot Decision-Making for Achieving Multiple Independent Tasks

Zoom Link Abstract: We focus on robotics applications where a robot is required to accomplish a set of tasks that are partially observable and evolve independently of each other according to their dynamics. One such domain that we target in this work is decision-making for a robot waiter waiting tables at a restaurant. The robot [...]

PhD Speaking Qualifier
PhD Student
Robotics Institute,
Carnegie Mellon University

Image to LiDAR Map Registration using Late Feature Projection

Zoom Link Abstract: Accurate localization is essential for autonomous operation in many problem domains. This is most often performed by comparing LiDAR scans collected in real-time to a HD point cloud based map. While this enables centimeter-level accuracy, it depends on an expensive LiDAR sensor at run time. Recently, efforts have been underway to reduce [...]

PhD Thesis Defense
Robotics Institute,
Carnegie Mellon University

Vision with Small Baselines

Zoom Link Abstract: 3D sensing with portable imaging systems is becoming more and more popular in computer vision applications such as autonomous driving, virtual reality, robotics manipulation and surveillance, due to the decreasing expense and size of RGB cameras. Despite the compactness and portability of the small baseline vision systems, it is well-known that the [...]

PhD Thesis Proposal
Robotics Institute,
Carnegie Mellon University

Provably Constant-time Motion Planning

Zoom Link Abstract: In manufacturing and warehouse scenarios, robots often perform recurring manipulation tasks in structured environments. Fast and reliable motion planning is one of the key elements that ensure efficient operations in such environments. A very common example scenario is of manipulators working at conveyor belts, where they have limited time to pick moving [...]

PhD Thesis Defense
Robotics Institute,
Carnegie Mellon University

Humans In Their Natural Habitat: Training AI to Understand People

Zoom Link Abstract: Computer vision has a great potential to help our daily lives by searching for lost keys, watering flowers or reminding us to take a pill. To succeed with such tasks, computer vision methods need to be trained from real and diverse examples of our daily dynamic scenes. First, we need to give [...]

VASC Seminar
Gemma Roig
Assistant Professor
Department of Computer Science, Goethe University Frankfurt

Task-specific Vision DNN Models and Their Relation for Explaining Different Areas of the Visual Cortex

Virtual VASC Seminar:  https://cmu.zoom.us/j/249106600   Abstract:  Deep Neural Networks (DNNs) are state-of-the-art models for many vision tasks. We propose an approach to assess the relationship between visual tasks and their task-specific models. Our method uses Representation Similarity Analysis (RSA), which is commonly used to find a correlation between neuronal responses from brain data and models. [...]

PhD Speaking Qualifier
Robotics Institute,
Carnegie Mellon University

A Theory of Fermat Paths for Non-line-of-sight Shape Reconstruction

Zoom Link Abstract: Traditionally, computer vision systems and algorithms, such as stereo vision, and shape from shading, have been developed to mimic human vision. As a consequence, a lot of these systems operate under constraints that we take for granted in human vision. An example of such a constraint is that the scene of interest [...]