Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
5-2024
Abstract
Fashion analysis refers to the process of examining and evaluating trends, styles, and elements within the fashion industry to understand and interpret its current state, generating fashion reports. It is traditionally performed by fashion professionals based on their expertise and experience, which requires high labour cost and may also produce biased results for relying heavily on a small group of people. In this paper, to tackle the Fashion Report Generation (FashionReGen) task, we propose an intelligent Fashion Analyzing and Reporting system based the advanced Large Language Models (LLMs), debbed as GPT-FAR. Specifically, it tries to deliver FashionReGen based on effective catwalk analysis, the proposed GPT-FAR system is equipped with several key procedures, namely, catwalk understanding, collective organization and analysis, and report generation. By posing and exploring such an open-ended, complex and domain-specific task of FashionReGen, it is able to test the general capability of LLMs in fashion domain. It also inspires the explorations of more high-level tasks with industrial significance in other domains. Video illustration and more materials of GPT-FAR can be found in https://github.com/CompFashion/FashionReGen.
Keywords
Fashion Report Generation, GPT, Large Language Model, Multimodal Understanding and Generation
Discipline
Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
WWW '24: The ACM Web Conference 2024, Singapore, May 13-17
First Page
991
Last Page
994
Identifier
10.1145/3589335.3651232
Publisher
ACM
City or Country
New York
Citation
DING, Yujuan; MA, Yunshan; FAN, Wenqi; YAO, Yige; CHUA, Tat‑Seng; and LI, Qing.
FashionReGen: LLM‑empowered fashion report generation. (2024). WWW '24: The ACM Web Conference 2024, Singapore, May 13-17. 991-994.
Available at: https://ink.library.smu.edu.sg/sis_research/10913
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3589335.3651232
Included in
Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons