Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2023

Abstract

Large language models (LLMs) have demonstrated great potential for domain-specific applications, such as the law domain. However, recent disputes over GPT-4’s law evaluation raise questions concerning their performance in real-world legal tasks. To systematically investigate their competency in the law, we design practical baseline solutions based on LLMs and test on the task of legal judgment prediction. In our solutions, LLMs can work alone to answer open questions or coordinate with an information retrieval (IR) system to learn from similar cases or solve simplified multi-choice questions. We show that similar cases and multi-choice options, namely label candidates, included in prompts can help LLMs recall domain knowledge that is critical for expertise legal reasoning. We additionally present an intriguing paradox wherein an IR system surpasses the performance of LLM+IR due to limited gains acquired by weaker LLMs from powerful IR systems. In such cases, the role of LLMs becomes redundant. Our evaluation pipeline can be easily extended into other tasks to facilitate evaluations in other domains. Code is available at https://github.com/srhthu/ LM-CompEval-Legal

Discipline

Databases and Information Systems | Programming Languages and Compilers

Research Areas

Data Science and Engineering

Publication

Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10

First Page

7337

Last Page

7348

Identifier

10.18653/v1/2023.findings-emnlp.490

Publisher

Association for Computational Linguistics

City or Country

Singapore

Citation

SHUI, Ruihao; CAO, Yixin; WANG, Xiang; and CHUA, Tat-Seng. A comprehensive evaluation of large language models on legal judgment prediction. (2023). Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10. 7337-7348.
Available at: https://ink.library.smu.edu.sg/sis_research/8396

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.18653/v1/2023.findings-emnlp.490

Download

Included in

Databases and Information Systems Commons, Programming Languages and Compilers Commons

COinS

Research Collection School Of Computing and Information Systems

A comprehensive evaluation of large language models on legal judgment prediction

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

A comprehensive evaluation of large language models on legal judgment prediction

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links