Research Collection School Of Computing and Information Systems

Personalized fashion recommendation with visual explanations based on multimodal attention network: Towards visually explainable recommendation

Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

7-2019

Abstract

Fashion recommendation has attracted increasing attention from both industry and academic communities. This paper proposes a novel neural architecture for fashion recommendation based on both image region-level features and user review information. Our basic intuition is that: for a fashion image, not all the regions are equally important for the users, i.e., people usually care about a few parts of the fashion image. To model such human sense, we learn an attention model over many pre-segmented image regions, based on which we can understand where a user is really interested in on the image, and correspondingly, represent the image in a more accurate manner. In addition, by discovering such fine-grained visual preference, we can visually explain a recommendation by highlighting some regions of its image. For better learning the attention model, we also introduce user review information as a weak supervision signal to collect more comprehensive user preference. In our final framework, the visual and textual features are seamlessly coupled by a multimodal attention network. Based on this architecture, we can not only provide accurate recommendation, but also can accompany each recommended item with novel visual explanations. We conduct extensive experiments to demonstrate the superiority of our proposed model in terms of Top-N recommendation, and also we build a collectively labeled dataset for evaluating our provided visual explanations in a quantitative manner.

Discipline

Databases and Information Systems | Graphics and Human Computer Interfaces | OS and Networks

Research Areas

Data Science and Engineering

Publication

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, July 21-25

First Page

765

Last Page

774

ISBN

9781450361729

Identifier

10.1145/3331184.3331254

Publisher

ACM

City or Country

Paris, France

Citation

CHEN, Xu; CHEN, Hanxiong; XU, Hongteng; ZHANG, Yongfeng; CAO, Yixin; QIN, Zheng; and ZHA, Hongyuan. Personalized fashion recommendation with visual explanations based on multimodal attention network: Towards visually explainable recommendation. (2019). Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, July 21-25. 765-774.
Available at: https://ink.library.smu.edu.sg/sis_research/7463

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

http://doi.org/10.1145/3331184.3331254

Download

Included in

Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons, OS and Networks Commons

COinS

Research Collection School Of Computing and Information Systems

Personalized fashion recommendation with visual explanations based on multimodal attention network: Towards visually explainable recommendation

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Personalized fashion recommendation with visual explanations based on multimodal attention network: Towards visually explainable recommendation

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links