Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
8-2021
Abstract
We present InferWiki, a Knowledge Graph Completion (KGC) dataset that improves upon existing benchmarks in inferential ability, assumptions, and patterns. First, each testing sample is predictable with supportive data in the training set. To ensure it, we propose to utilize rule-guided train/test generation, instead of conventional random split. Second, InferWiki initiates the evaluation following the open-world assumption and improves the inferential difficulty of the closed-world assumption, by providing manually annotated negative and unknown triples. Third, we include various inference patterns (e.g., reasoning path length and types) for comprehensive evaluation. In experiments, we curate two settings of InferWiki varying in sizes and structures, and apply the construction process on CoDEx as comparative datasets. The results and empirical analyses demonstrate the necessity and high-quality of InferWiki. Nevertheless, the performance gap among various inferential assumptions and patterns presents the difficulty and inspires future research direction. Our datasets can be found in https://github. com/TaoMiner/inferwiki.
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Data Science and Engineering
Publication
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Virtual Conference, 2021 August 1-6
First Page
6855
Last Page
6865
ISBN
9781954085527
Publisher
Association for Computational Linguistics (ACL)
City or Country
Virtual Conference
Citation
CAO, Yixin; JI, Xiang; LV, Xin; LI, Juanzi; WEN, Yonggang; and ZHANG, Hanwang.
Are missing links predictable? An inferential benchmark for knowledge graph completion. (2021). Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Virtual Conference, 2021 August 1-6. 6855-6865.
Available at: https://ink.library.smu.edu.sg/sis_research/7316
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://aclanthology.org/2021.acl-long.534.pdf
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons