MSR Thesis Talk - Zhaoyuan Fang - Robotics Institute Carnegie Mellon University
Loading Events

MSR Speaking Qualifier

July

25
Mon
Zhaoyuan Fang Robotics Institute,
Carnegie Mellon University
Monday, July 25
4:00 pm to 5:00 pm
NSH 4305
MSR Thesis Talk – Zhaoyuan Fang

Title: Features in Extra Dimensions: Spatial and Temporal Scene Representations

Abstract:
Computer vision models have made great progress in featurizing pixels of images. However, an image is only a projection of the actual 3D scene: occlusions and perspective distortions exist. To arrive at a better representation of the scene itself, extra dimensions are needed to learn spatial or temporal priors.

In this thesis, we propose two methods that introduce extra dimensions for modelling the scene space and time. The first method lifts features from the image plane onto the bird’s eye view (BEV) plane for perception in autonomous driving. Features over the scene space enables our models to handle occlusion better, producing accurate BEV semantic representation. The second method introduces extra dimensions for modelling time, for better geometry-free point tracking. We track points through partial or full occlusions, using components that drive the current state-of-the-art in flow and object tracking, such as learned temporal priors, iterative optimization, and appearance updates. Features allocated over timesteps enables our models to track over long horizons and through occlusions, outperforming previous feature-matching and optical flow methods.

Committee:
Katerina Fragkiadaki (advisor)
Shubham Tulsiani
Adam W. Harley

Zoomhttps://cmu.zoom.us/j/6663324632