Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

11-2021

Abstract

Multi-hop reasoning has been widely studied in recent years to obtain more interpretable link prediction. However, we find in experiments that many paths given by these models are actually unreasonable, while little work has been done on interpretability evaluation for them. In this paper, we propose a unified framework to quantitatively evaluate the interpretability of multi-hop reasoning models so as to advance their development. In specific, we define three metrics, including path recall, local interpretability, and global interpretability for evaluation, and design an approximate strategy to calculate these metrics using the interpretability scores of rules. We manually annotate all possible rules and establish a benchmark. In experiments, we verify the effectiveness of our benchmark. Besides, we run nine representative baselines on our benchmark, and the experimental results show that the interpretability of current multi-hop reasoning models is less satisfactory and is 51.7% lower than the upper bound given by our benchmark. Moreover, the rule-based models outperform the multi-hop reasoning models in terms of performance and interpretability, which points to a direction for future research, i.e., how to better incorporate rule information into the multi-hop reasoning model. We will publish our codes and datasets upon acceptance.

Discipline

Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Virtual Conference, November 7-11

First Page

8899

Last Page

8911

ISBN

9781955917094

Identifier

10.18653/v1/2021.emnlp-main.700

Publisher

Association for Computational Linguistics (ACL)

City or Country

Virtual, Punta Cana

Citation

LV, Xin; CAO, Yixin; HOU, Lei; LI, Juanzi; LIU, Zhiyuan; ZHANG, Yichi; and DAI, Zelin. Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability. (2021). Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Virtual Conference, November 7-11. 8899-8911.
Available at: https://ink.library.smu.edu.sg/sis_research/7317

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

http://doi.org/10.18653/v1/2021.emnlp-main.700

Download

Included in

Databases and Information Systems Commons

COinS

Research Collection School Of Computing and Information Systems

Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links