Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

2-2023

Abstract

Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting.

Keywords

Deep Generative Models & Autoencoders, Computational Photography, Image & Video Synthesis, Deep Neural Network Algorithms, valuation and Analysis (Machine Learning)

Discipline

Artificial Intelligence and Robotics | Theory and Algorithms

Research Areas

Information Systems and Management

Publication

Proceedings of the 37th AAAI Conference on Artificial Intelligence, Washington, DC, February 7-14

Volume

First Page

10762

Last Page

10770

ISBN

9781577358800

Identifier

10.1609/aaai.v37i9.26277

Publisher

AAAI Press

City or Country

Washington

Citation

YANG, Huiting; HUANG, Danqing; LIN, Chin-Yew; and HE, Shengfeng. Layout generation as intermediate action sequence prediction. (2023). Proceedings of the 37th AAAI Conference on Artificial Intelligence, Washington, DC, February 7-14. 37, 10762-10770.
Available at: https://ink.library.smu.edu.sg/sis_research/8088

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1609/aaai.v37i9.26277

Download

Included in

Artificial Intelligence and Robotics Commons, Theory and Algorithms Commons

COinS

Research Collection School Of Computing and Information Systems

Layout generation as intermediate action sequence prediction

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Layout generation as intermediate action sequence prediction

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links