Publication Type

Journal Article

Version

publishedVersion

Publication Date

11-2022

Abstract

Photorealistic multiview face synthesis from a single image is a challenging problem. Existing works mainly learn a texture mapping model from the source to the target faces. However, they rarely consider the geometric constraints on the internal deformation arising from pose variations, which causes a high level of uncertainty in face pose modeling, and hence, produces inferior results for large pose variations. Moreover, current methods typically suffer from undesired facial details loss due to the adoption of the de-facto standard encoder-decoder architecture without any skip connections (SCs). In this article, we directly learn and exploit geometric constraints and propose a fully deformable network to simultaneously model the deformations of both landmarks and faces for face synthesis. Specifically, our model consists of two parts: a deformable landmark learning network (DLLN) and a gated deformable face synthesis network (GDFSN). The DLLN converts an initial reference landmark to an individual-specific target landmark as delicate pose guidance for face rotation. The GDFSN adopts a dual-stream structure, with one stream estimating the deformation of two views in the form of convolution offsets according to the source pose and the converted target pose, and the other leveraging the predicted deformation offsets to create the target face. In this way, individual-aware pose changes are explicitly modeled in the face generator to cope with geometric transformation, by adaptively focusing on pertinent regions of the source face. To compensate for offset estimation errors, we introduce a soft-gating mechanism for adaptive fusion between deformable features and primitive features. Additionally, a pose-aligned SC (PASC) is tailored to propagate low-level input features to the appropriate positions in the output features for further enhancing the facial details and identity preservation. Extensive experiments on six benchmarks show that our approach performs favorably against the state-of-the-arts, especially with large pose changes. Code is available at https://github.com/cschengxu/FDFace.

Keywords

Faces, Face recognition, Strain, Deformable models, Electronic mail, Convolution, Standards, Deformable convolution, gating, multiview face synthesis, pose-invariant face recognition

Discipline

Information Security

Research Areas

Information Systems and Management

Publication

IEEE Transactions on Neural Networks and Learning Systems

Volume

Issue

First Page

8854

Last Page

8868

ISSN

2162-2388

Identifier

10.1109/TNNLS.2022.3216018

Publisher

IEEE

Citation

XU, Cheng; LI, Keke; LUO, Xuandi; XU, Xuemiao; HE, Shengfeng; and ZHANG, Kun. Fully deformable network for multiview face image synthesis. (2022). IEEE Transactions on Neural Networks and Learning Systems. 35, (7), 8854-8868.
Available at: https://ink.library.smu.edu.sg/sis_research/7861

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1109/TNNLS.2022.3216018

Download

Included in

Information Security Commons

COinS

Research Collection School Of Computing and Information Systems

Fully deformable network for multiview face image synthesis

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Fully deformable network for multiview face image synthesis

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links