Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

2-2023

Abstract

Word in context (WiC) task aims to determine whether a target word’s occurrences in two sentences share the same sense. In this paper, we propose a Contrastive Learning WiC (CLWiC) framework to improve the learning of sentence/word representations and classification of target word senses in the sentence pair when performing WiC on lowresource languages. In representation learning, CLWiC trains a pre-trained language model’s ability to cope with lowresource languages using both unsupervised and supervised contrastive learning. The WiC classifier learning further finetunes the language model with WiC classification loss under two classifier architecture options, SGBERT and WiSBERT, which use single-encoder and dual-encoder for encoding a WiC task instance respectively. We evaluate the models developed based on CLWiC framework on a new WiC dataset constructed for Singlish, a low-resource English creole language used in Singapore, as well as the standard English WiC benchmark dataset. Our experiments show that CLWiC-based models using both unsupervised and supervised contrastive learning outperform those not using contrastive learning. This performance difference is more substantial for the Singlish dataset than for the English dataset. Unsupervised contrastive learning appears to improve WiC performance more than supervised one. Finally, we show that using joint learning strategy, we can achieve the best WiC performance.

Discipline

Databases and Information Systems | Programming Languages and Compilers

Research Areas

Data Science and Engineering

Publication

Proceedings of the 37th AAAI Workshop on Knowledge Augmented Methods for Natural Language Processing, Washington, DC, 2023 February 7-14

First Page

Last Page

City or Country

Washington D.C.

Citation

LO, Pei-Chi; LEE, Yang-Yin; CHEN, Hsien-Hao; KWEE, Agus Trisnajaya; and LIM, Ee-peng. Contrastive learning approach to word-in-context task for low-resource languages. (2023). Proceedings of the 37th AAAI Workshop on Knowledge Augmented Methods for Natural Language Processing, Washington, DC, 2023 February 7-14. 1-8.
Available at: https://ink.library.smu.edu.sg/sis_research/8327

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Databases and Information Systems Commons, Programming Languages and Compilers Commons

COinS

Research Collection School Of Computing and Information Systems

Contrastive learning approach to word-in-context task for low-resource languages

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Contrastive learning approach to word-in-context task for low-resource languages

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links