Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2025

Abstract

Combinatorial optimization (CO) problems, central to decision-making scenarios like logistics and manufacturing, are traditionally solved using problem-specific algorithms requiring significant domain expertise. While large language models (LLMs) have shown promise in automating CO problem solving, existing approaches rely on intermediate steps such as code generation or solver invocation, limiting their generality and accessibility. This paper introduces a novel framework that empowers LLMs to serve as end-to-end CO solvers by directly mapping natural language problem descriptions to solutions. We propose a two-stage training strategy: supervised fine-tuning (SFT) imparts LLMs with solution generation patterns from domain-specific solvers, while a feasibility-and-optimality-aware reinforcement learning (FOARL) process explicitly mitigates constraint violations and refines solution quality. Evaluation across seven NP-hard CO problems shows that our method achieves a high feasibility rate and reduces the average optimality gap to 1.03-8.20% by tuning a 7B-parameter LLM, surpassing both general-purpose LLMs (e.g., GPT-4o), reasoning models (e.g., DeepSeek-R1), and domain-specific heuristics. Our method establishes a unified language-based pipeline for CO without extensive code execution or manual architectural adjustments for different problems, offering a general and language-driven alternative to traditional solver design while maintaining relative feasibility guarantees.

Keywords

combinatorial optimization, large language models, end-to-end solvers, supervised fine-tuning, reinforcement learning, constraint satisfaction, feasibility-aware training, natural language to solution, NP-hard problems, language-driven optimization

Discipline

Artificial Intelligence and Robotics

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 39th Conference on Neural Information Processing

City or Country

San Diego, US

Citation

JIANG, Xia; WU, Yaoxin; LI, Minshuo; CAO, Zhiguang; and ZHANG, Yingqian. Large language models as end-to-end combinatorial optimization solvers. (2025). Proceedings of the 39th Conference on Neural Information Processing.
Available at: https://ink.library.smu.edu.sg/sis_research/10566

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Research Collection School Of Computing and Information Systems

Large language models as end-to-end combinatorial optimization solvers

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Large language models as end-to-end combinatorial optimization solvers

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links