Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
4-2025
Abstract
Existing neural multi-objective combinatorial optimization (MOCO) methods still exhibit an optimality gap since they fail to fully exploit the intrinsic features of problem instances. A significant factor contributing to this shortfall is their reliance solely on graph-modal information. To overcome this, we propose a novel graph-image multimodal fusion (GIMF) framework that enhances neural MOCO methods by integrating graph and image information of the problem instances. Our GIMF framework comprises three key components: (1) a constructed coordinate image to better represent the spatial structure of the problem instance, (2) a problem-size adaptive resolution strategy during the image construction process to improve the cross-size generalization of the model, and (3) a multimodal fusion mechanism with modality-specific bottlenecks to efficiently couple graph and image information. We demonstrate the versatility of our GIMF by implementing it with two state-of-the-art neural MOCO backbones. Experimental results on classic MOCO problems show that our GIMF significantly outperforms state-of-the-art neural MOCO methods and exhibits superior generalization capability.
Discipline
Artificial Intelligence and Robotics
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Sustainability
Publication
Proceedings of the 13th International Conference on Learning Representations, Singapore, 2025 April 24
First Page
8355
Last Page
8374
ISBN
9798331320850
Publisher
ICLR
City or Country
Singapore
Citation
CHEN, Jinbiao; WANG, Jiahai; CAO, Zhiguang; and WU, Yaoxin.
Neural multi-objective combinatorial optimization via graph-image multimodal fusion. (2025). Proceedings of the 13th International Conference on Learning Representations, Singapore, 2025 April 24. 8355-8374.
Available at: https://ink.library.smu.edu.sg/sis_research/10554
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://openreview.net/forum?id=4sJ2FYE65U