Publication Type
Journal Article
Version
publishedVersion
Publication Date
6-2023
Abstract
At present, the entity and relation joint extraction task has attracted more and more scholars' attention in the field of natural language processing (NLP). However, most of their methods rely on NLP tools to construct dependency trees to obtain sentence structure information. The adjacency matrix constructed by the dependency tree can convey syntactic information. Dependency trees obtained through NLP tools are too dependent on the tools and may not be very accurate in contextual semantic description. At the same time, a large amount of irrelevant information will cause redundancy. This paper presents a novel end-to-end entity and relation joint extraction based on the multi-head attention graph convolutional network model (MAGCN), which does not rely on external tools. MAGCN generates an adjacency matrix through a multi-head attention mechanism to form an attention graph convolutional network model, uses head selection to identify multiple relations, and effectively improve the prediction result of overlapping relations. The authors extensively experiment and prove the method's effectiveness on three public datasets: NYT, WebNLG, and CoNLL04. The results show that the authors' method outperforms the state-of-the-art research results for the task of entities and relation extraction.
Keywords
Information retrieval, Natural language processing
Discipline
Artificial Intelligence and Robotics | Programming Languages and Compilers
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
CAAI Transactions on Intelligence Technology
Volume
8
Issue
2
First Page
468
Last Page
477
ISSN
2468-6557
Identifier
10.1049/cit2.12086
Publisher
Wiley
Citation
TAO, Zhihua; OUYANG, Chunping; LIU, Yongbin; CHUNG, Tonglee; and CAO, Yixin.
Multi-head attention graph convolutional network model: End-to-end entity and relation joint extraction based on multi-head attention graph convolutional network. (2023). CAAI Transactions on Intelligence Technology. 8, (2), 468-477.
Available at: https://ink.library.smu.edu.sg/sis_research/9347
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1049/cit2.12086
Included in
Artificial Intelligence and Robotics Commons, Programming Languages and Compilers Commons