MSR Thesis Talk: Muyang Li - Robotics Institute Carnegie Mellon University
Loading Events

MSR Thesis Defense

April

21
Fri
Muyang Li MSR Student Robotics Institute,
Carnegie Mellon University
Friday, April 21
10:00 am to 11:00 am
NSH 4305
MSR Thesis Talk: Muyang Li

Title: Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Abstract:

During image editing, existing deep generative models tend to re-synthesize the entire output from scratch, including the unedited regions. This leads to a significant waste of computation, especially for minor editing operations. In this work, we present Spatially Sparse Inference (SSI), a general-purpose technique that selectively performs computation for edited regions and accelerates various generative models, including both conditional GANs and diffusion models. Our key observation is that users tend to gradually change the input image. This motivates us to cache and reuse the feature maps of the original image. Given an edited image, we sparsely apply the convolutional filters to the edited regions while reusing the cached features for the unedited areas. Based on our algorithm, we further propose Sparse Incremental Generative Engine (SIGE) to convert the computation reduction to latency reduction on off-the-shelf hardware. With about 1%-area edits, our method reduces the computation of DDPM by 7.5×, Stable Diffusion by 8.2×, and GauGAN by 18× while preserving the visual fidelity. With SIGE, we accelerate the inference time of DDPM by 3.0× on NVIDIA RTX 3090 and 6.6× on Apple M1 Pro CPU, Stable Diffusion by 7.2× on 3090, and GauGAN by 5.6× on 3090 and 14× on M1 Pro CPU.

Committee:

Prof. Jun-Yan Zhu (advisor)

Prof. Song Han, MIT

Prof. Tianqi Chen

Jinkun Cao