Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
11-2018
Abstract
Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method.
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Data Science and Engineering
Publication
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4
First Page
227
Last Page
237
Identifier
10.18653/v1/D18-1021
Publisher
Association for Computational Linguistics
City or Country
Brussels, Belgium
Citation
CAO, Yixin; HOU, Lei; LI, Juanzi; LIU, Zhiyuan; LI, Chengjiang; CHEN, Xu; and DONG, Tiansi.
Joint representation learning of cross-lingual words and entities via attentive distant supervision. (2018). Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4. 227-237.
Available at: https://ink.library.smu.edu.sg/sis_research/7465
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.18653/v1/D18-1021
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons