Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
2-2023
Abstract
Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting.
Keywords
Deep Generative Models & Autoencoders, Computational Photography, Image & Video Synthesis, Deep Neural Network Algorithms, valuation and Analysis (Machine Learning)
Discipline
Artificial Intelligence and Robotics | Theory and Algorithms
Research Areas
Information Systems and Management
Publication
Proceedings of the 37th AAAI Conference on Artificial Intelligence, Washington, DC, February 7-14
Volume
37
First Page
10762
Last Page
10770
ISBN
9781577358800
Identifier
10.1609/aaai.v37i9.26277
Publisher
AAAI Press
City or Country
Washington
Citation
YANG, Huiting; HUANG, Danqing; LIN, Chin-Yew; and HE, Shengfeng.
Layout generation as intermediate action sequence prediction. (2023). Proceedings of the 37th AAAI Conference on Artificial Intelligence, Washington, DC, February 7-14. 37, 10762-10770.
Available at: https://ink.library.smu.edu.sg/sis_research/8088
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1609/aaai.v37i9.26277