Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
6-2021
Abstract
Humans can communicate emotions through a plethora of facial expressions, each with its own intensity, nuances and ambiguities. The generation of such variety by means of conditional GANs is limited to the expressions encoded in the used label system. These limitations are caused either due to burdensome labeling demand or the confounded label space. On the other hand, learning from inexpensive and intuitive basic categorical emotion labels leads to limited emotion variability. In this paper, we propose a novel GAN-based framework which learns an expressive and interpretable conditional space (usable as a label space) of emotions, instead of conditioning on handcrafted labels. Our framework only uses the categorical labels of basic emotions to jointly learn the conditional space as well as the emotion manipulation. Such learning can benefit from the image variability within discrete labels, especially when the intrinsic labels reside beyond the discrete space of the defined. Our experiments demonstrate the effectiveness of the proposed framework, by allowing us to control and generate a gamut of complex and compound emotions, while using only the basic categorical emotion labels during training.
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Data Science and Engineering
Publication
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 June 19-25
First Page
568
Last Page
577
Publisher
IEEE Computer Society
City or Country
Virtual
Citation
D'APOLITO, S.; PAUNDEL, D.P.; HUANG, Zhiwu; VERGARA, A.R.; and VAN, Gool L..
GANmut: learning interpretable conditional space for a gamut of emotions. (2021). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 June 19-25. 568-577.
Available at: https://ink.library.smu.edu.sg/sis_research/6409
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons