Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
7-2025
Abstract
Large Language Models (LLMs) have emerged as a transformative AI paradigm, profoundly influencing broad aspects of daily life. Despite their remarkable performance, LLMs exhibit a fundamental limitation: hallucination—the tendency to produce misleading outputs that appear plausible. This inherent unreliability poses significant risks, particularly in high-stakes domains where trustworthiness is essential. On the other hand, Formal Methods (FMs), which share foundations with symbolic AI, provide mathematically rigorous techniques for modeling, specifying, reasoning, and verifying the correctness of systems. These methods have been widely employed in mission-critical domains such as aerospace, defense, and cybersecurity. However, the broader adoption of FMs remains constrained by significant challenges, including steep learning curves, limited scalability, and difficulties in adapting to the dynamic requirements of daily applications. To build trustworthy AI agents, we argue that the integration of LLMs and FMs is necessary to overcome the limitations of both paradigms. LLMs offer adaptability and human-like reasoning but lack formal guarantees of correctness and reliability. FMs provide rigor but need enhanced accessibility and automation to support broader adoption from LLMs.
Discipline
Software Engineering
Research Areas
Software and Cyber-Physical Systems
Areas of Excellence
Digital transformation
Publication
Proceedings of the 42nd International Conference on Machine Learning, Vancouver, Canada, 2025 July 13-19
First Page
1
Last Page
19
City or Country
Canada
Citation
ZHANG, Yedi; CAI, Yufan; ZUO, Xinyue; LUAN, Xiaokun; WANG, Kailong; HOU, Zhe; ZHANG, Yifan; WEI, Zhiyuan; SUN, Meng; Jun SUN; SUN, Jing; and DONG, Jin Song.
Position: Trustworthy AI agents require the integration of large language models and formal methods. (2025). Proceedings of the 42nd International Conference on Machine Learning, Vancouver, Canada, 2025 July 13-19. 1-19.
Available at: https://ink.library.smu.edu.sg/sis_research/10282
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.