Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
5-2024
Abstract
Question-answering (QA) retrieval is the task of retrieving the most relevant answer to a given question from a collection of answers. Various approaches to QA retrieval have been developed recently. One successful and popular model is Contextualized Late Interaction over BERT (ColBERT), a transformer-based approach that adopts a query-document scoring mechanism that retains the granularity of transformer matching, whilst improving on efficiency. However, one key limitation is that it requires further fine-tuning for new query or collection types. In this work, we explore and propose several non-parametric retrieval augmentation methods based on explicit signals of term importance that improve over ColBERT's baseline performance. In particular, we consider the QA retrieval task in the context of StackExchange question-answering forum, verifying the effectiveness of our methods in this setting.
Keywords
Information retrieval, Retrieval models, Retrieval ranking, Question-answering, Neural information retrieval, Term importance, Weighted late interaction
Discipline
Artificial Intelligence and Robotics | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Publication
Proceedings of the ACM Web Conference 2024 (WWW 2024) : Singapore, May 13-17
First Page
601
Last Page
604
Identifier
10.1145/3589335.3651568
Publisher
Association for Computing Machinery
City or Country
Singapore
Citation
TAN, Bryan Zhi Yang and LAUW, Hady W..
Term importance for transformer-based QA retrieval : A case study of StackExchange. (2024). Proceedings of the ACM Web Conference 2024 (WWW 2024) : Singapore, May 13-17. 601-604.
Available at: https://ink.library.smu.edu.sg/sis_research/9855
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3589335.3651568
Included in
Artificial Intelligence and Robotics Commons, Numerical Analysis and Scientific Computing Commons
Comments
PDF provided by faculty