Research Collection School Of Computing and Information Systems

Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations

Yang DENG, Singapore Management UniversityFollow
Yong ZHAO
Moxin LI
See-Kiong NG
Tat-Seng CHUA

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

11-2024

Abstract

Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid providing hallucinated answers to these unknown questions, existing studies typically investigate approaches to refusing to answer these questions. In this work, we propose a novel and scalable self-alignment method to utilize the LLM itself to enhance its response-ability to different types of unknown questions, being capable of not only refusing to answer but also providing explanation to the unanswerability of unknown questions. Specifically, the Self-Align method first employ a two-stage class-aware self-augmentation approach to generate a large amount of unknown question-response data. Then we conduct disparity-driven self-curation to select qualified data for fine-tuning the LLM itself for aligning the responses to unknown questions as desired. Experimental results on two datasets across four types of unknown questions validate the superiority of the Self-Align method over existing baselines in terms of three types of task formulation.

Keywords

Large Language Models, LLMs, Unknown question response, Self-Align method

Discipline

Artificial Intelligence and Robotics | Computer Sciences

Research Areas

Data Science and Engineering; Intelligent Systems and Optimization

Areas of Excellence

Digital transformation

Publication

Proceedings of the 19th Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) : Miami, Florida, USA, November 12-16

First Page

13652

Last Page

13673

Publisher

Association for Computational Linguistics

City or Country

USA

Citation

DENG, Yang; ZHAO, Yong; LI, Moxin; NG, See-Kiong; and CHUA, Tat-Seng. Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations. (2024). Proceedings of the 19th Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) : Miami, Florida, USA, November 12-16. 13652-13673.
Available at: https://ink.library.smu.edu.sg/sis_research/9614

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Research Collection School Of Computing and Information Systems

Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Don’t just say “I don’t know”! Self-aligning Large Language Models for responding to unknown questions with explanations

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links