Publication Type
Journal Article
Version
publishedVersion
Publication Date
1-2023
Abstract
Contextual path retrieval (CPR) refers to the task of finding contextual path(s) between a pair of entities in a knowledge graph that explains the connection between them in a given context. For this novel retrieval task, we propose the Embedding-based Contextual Path Retrieval (ECPR) framework. ECPR is based on a three-component structure that includes a context encoder and path encoder that encode query context and path, respectively, and a path ranker that assigns a ranking score to each candidate path to determine the one that should be the contextual path. For context encoding, we propose two novel context encoding methods, i.e., context-fused entity embeddings and contextualized embeddings. For path encoding, we propose PathVAE, an inductive embedding approach to generate path representations. Finally, we explore two path-ranking approaches. In our evaluation, we construct a synthetic dataset from Wikipedia and two real datasets of Wikinews articles constructed through crowdsourcing. Our experiments show that methods based on ECPR framework outperform baseline methods, and that our two proposed context encoders yield significantly better performance than baselines. We also analyze a few case studies to show the distinct features of ECPR-based methods.
Keywords
Knowledge base, embedding learning, information retrieval, reasoning
Discipline
Databases and Information Systems
Research Areas
Data Science and Engineering
Publication
ACM Transactions on Information Systems
Volume
41
Issue
1
First Page
1
Last Page
38
ISSN
1046-8188
Identifier
10.1145/3502720
Publisher
Association for Computing Machinery (ACM)
Citation
LO, Pei-chi and LIM, Ee-peng.
Contextual Path Retrieval: A contextual entity relation embedding-based approach. (2023). ACM Transactions on Information Systems. 41, (1), 1-38.
Available at: https://ink.library.smu.edu.sg/sis_research/7780
Copyright Owner and License
Publisher
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3502720