VASC Seminar
Ronghang Hu
Research Scientist
Facebook Inc.

Reasoning over Text in Images for VQA and Captioning

Abstract: Text in images carries essential information for multimodal reasoning, such as VQA or image captioning. To enable machines to perceive and understand scene text and reason jointly with other modalities, 1) we collect the TextCaps dataset, which requires models to read and reason over text and visual content in the image to generate image [...]

RI Seminar
Avik De
Co-founder & CTO
Ghost Robotics

Design and control of insect-scale bees and dog-scale quadrupeds

Abstract: Enhanced robot autonomy---whether it be in the context of extended tether-free flight of a 100mg insect-scale flapping-wing micro aerial vehicle (FWMAV), or long inspection routes for a quadrupedal robot---is hindered by fundamental constraints in power and computation. With this motivation, I will discuss a few projects I have worked on to circumvent these issues in [...]

VASC Seminar
Jhony Kaesemodel Pontes
Research Scientist
Argo AI

Point Cloud Registration with or without Learning

Abstract: I will be presenting two of our recent works on 3D point cloud registration:   A scene flow method for non-rigid registration: I will discuss our current method to recover scene flow from point clouds. Scene flow is the three-dimensional (3D) motion field of a scene, and it provides information about the spatial arrangement [...]

RI Seminar
Cynthia Sung
Assistant Professor
Mechanical Engineering & Applied Mechanics, University of Pennsylvania

Dynamical Robots via Origami-Inspired Design

Abstract: Origami-inspired engineering produces structures with high strength-to-weight ratios and simultaneously lower manufacturing complexity. This reliable, customizable, cheap fabrication and component assembly technology is ideal for robotics applications in remote, rapid deployment scenarios that require platforms to be quickly produced, reconfigured, and deployed. Unfortunately, most examples of folded robots are appropriate only for small-scale, low-load [...]

VASC Seminar
Arsalan Mousavian
Senior Robotics Research Scientist
NVIDIA

Propelling Robot Manipulation of Unknown Objects using Learned Object Centric Models

Abstract: There is a growing interest in using data-driven methods to scale up manipulation capabilities of robots for handling a large variety of objects. Many of these methods are oblivious to the notion of objects and they learn monolithic policies from the whole scene in image space. As a result, they don’t generalize well to [...]

VASC Seminar
Phillip Isola
Assistant Professor
EECS, MIT

When and Why Does Contrastive Learning Work?

Abstract: Contrastive learning organizes data by pulling together related items and pushing apart everything else. These methods have become very popular but it's still not entirely clear when and why they work. I will share two ideas from our recent work. First, I will argue that contrastive learning is really about learning to forget. Different [...]

VASC Seminar
Ehsan Adeli
Clinical Assistant Professor
Stanford University

Anticipating the Future: forecasting the dynamics in multiple levels of abstraction

Abstract: A key navigational capability for autonomous agents is to predict the future locations, actions, and behaviors of other agents in the environment. This is particularly crucial for safety in the realm of autonomous vehicles and robots. However, many current approaches to navigation and control assume perfect perception and knowledge of the environment, even though [...]

VASC Seminar
Xiaolong Wang
Assistant Professor
UCSD

Learning to Perceive Videos for Embodiment

Abstract: Video understanding has achieved tremendous success in computer vision tasks, such as action recognition, visual tracking, and visual representation learning. Recently, this success has gradually been converted into facilitating robots and embodied agents to interact with the environments. In this talk, I am going to introduce our recent efforts on extracting self-supervisory signals and [...]

VASC Seminar
Xavier Giro Nieto
Associate Professor
Universitat Politecnica de Catalunya

Open Challenges in Sign Language Translation & Production

Abstract: Machine translation and computer vision have greatly benefited of the advances in deep learning. The large and diverse amount of textual and visual data have been used to train neural networks whether in a supervised or self-supervised manner. Nevertheless, the convergence of the two field in sign language translation and production is still poses [...]

RI Seminar
Andrew E. Johnson
Principal Robotics Systems Engineer
NASA Jet Propulsion Laboratory, California Institute of Technology

The Search for Ancient Life on Mars Began with a Safe Landing

1305 Newell Simon Hall

Abstract: Prior mars rover missions have all landed in flat and smooth regions, but for the Mars 2020 mission, which is seeking signs of ancient life, this was no longer acceptable. To maximize the variety of rock samples that will eventually be returned to earth for analysis, the Perseverance rover needed to land in a [...]