Research Collection School Of Computing and Information Systems

RIGID: Recurrent GAN inversion and editing of real face videos and beyond

Publication Type

Journal Article

Publication Date

1-2025

Abstract

GAN inversion is essential for harnessing the editability of GANs in real images, yet existing methods that invert video frames individually often yield temporally inconsistent results. To address this issue, we present a unified recurrent framework, Recurrent vIdeo GAN Inversion and eDiting (RIGID), designed to enforce temporally coherent GAN inversion and facial editing in real videos explicitly and simultaneously. Our approach models temporal relations between current and previous frames in three ways: (1) by maximizing inversion fidelity and consistency through learning a temporally compensated latent code and spatial features, (2) by disentangling high-frequency incoherent noises from the latent space, and (3) by introducing an in-between frame composition constraint to eliminate inconsistency after attribute manipulation, ensuring that each frame is a direct composite of its neighbors. Compared to existing video- and attribute-specific works, RIGID eliminates the need for expensive re-training of the model, resulting in approximately 60 faster performance. Furthermore, RIGID can be easily extended to other face domains, showcasing its versatility and adaptability. Extensive experiments demonstrate that RIGID outperforms state-of-the-art methods in inversion and editing tasks both qualitatively and quantitatively.

Discipline

Artificial Intelligence and Robotics

Research Areas

Intelligent Systems and Optimization

Publication

International Journal of Computer Vision

Volume

133

Issue

First Page

3437

Last Page

3455

ISSN

0920-5691

Identifier

10.1007/s11263-024-02329-8

Publisher

Springer

Citation

XU, Yangyang; HE, Shengfeng; WONG, Kwan-Yee; and LUO, Ping. RIGID: Recurrent GAN inversion and editing of real face videos and beyond. (2025). International Journal of Computer Vision. 133, (6), 3437-3455.
Available at: https://ink.library.smu.edu.sg/sis_research/11108

Additional URL

https://doi.org/10.1007/s11263-024-02329-8

This document is currently not available here.

COinS

Research Collection School Of Computing and Information Systems

RIGID: Recurrent GAN inversion and editing of real face videos and beyond

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Additional URL

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

RIGID: Recurrent GAN inversion and editing of real face videos and beyond

Author

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Additional URL

Share

Search

Links

Browse

Links