MSR Thesis Talk: Khiem Vuong - Robotics Institute Carnegie Mellon University
Loading Events

MSR Thesis Defense

July

13
Thu
Khiem Vuong PhD Student Robotics Institute,
Carnegie Mellon University
Thursday, July 13
12:30 pm to 2:00 pm
GHC 8102
MSR Thesis Talk: Khiem Vuong
Title: Scaling up Camera Calibration and Amodal 3D Object Reconstruction for Smart Cities

Abstract:
Smart cities integrate thousands of outdoor cameras to enhance urban infrastructure, but their automated analysis potential remains untapped due to various challenges. Firstly, the lack of accurate camera calibration information, such as its intrinsics parameters and external orientation, restricts the measurement of real-world distances from the captured video. To address this issue, we propose a scalable framework leveraging publicly available street-level imagery and map data to automatically reconstruct a metric 3D model of the surrounding scene, allowing for accurate calibration of in-the-wild traffic cameras around the world.

Secondly, the presence of occlusions poses significant challenges in object understanding. For example, objects in the scene may be partially occluded by other static or dynamic objects, truncated by the camera’s field of view, or be self-occluded, i.e., the camera facing side of the object is occluded by the opposing side of the object. We present a holistic approach to handle such occlusions for amodal 3D shape reconstruction. The approach starts by learning occlusion categories with human supervision. Then, these learned categories are exploited in a novel framework that uses a mixed representation (keypoints, segmentations and shape basis) for objects to automatically generate a large physically realistic dataset of occlusions using freely available time-lapse imagery from traffic cameras. This dataset provides strong 2D and 3D self-supervision to a network that jointly learns amodal 2D keypoints and segmentations, which are then optimized to reconstruct 3D shapes under constraints provided by occlusion categories. Our system demonstrates significant improvements in amodal 3D reconstruction of heavily occluded objects captured at any time of the day from traffic, hand-held, and in-vehicle cameras, thus enhancing the potential of smart cities to utilize outdoor cameras for effective urban planning.

Committee:
Prof. Srinivasa G. Narasimhan (advisor)
Prof. Shubham Tulsiani
Yufei Ye

Date: Thursday, July 13, 2023
Time: 12:30 PM to 2:00 PM (ET)
Location: GHC 8102
Meeting ID: 987 3554 3332
Passcode: 843349