Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

8-2023

Abstract

Connectivity across documents often exhibits a hierarchical network structure. Hyperbolic Graph Neural Networks (HGNNs) have shown promise in preserving network hierarchy. However, they do not model the notion of topics, thus document representations lack semantic interpretability. On the other hand, a corpus of documents usually has high variability in degrees of topic specificity. For example, some documents contain general content (e.g., sports), while others focus on specific themes (e.g., basketball and swimming). Topic models indeed model latent topics for semantic interpretability, but most assume a flat topic structure and ignore such semantic hierarchy. Given these two challenges, we propose a Hyperbolic Graph Topic Modeling Network to integrate both network hierarchy across linked documents and semantic hierarchy within texts into a unified HGNN framework. Specifically, we construct a two-layer document graph. Intra- and cross-layer encoding captures network hierarchy. We design a topic tree for text decoding to preserve semantic hierarchy and learn interpretable topics. Supervised and unsupervised experiments verify the effectiveness of our model.

Keywords

Hyperbolic graph neural networks, Text mining, Topic modeling

Discipline

Graphics and Human Computer Interfaces | OS and Networks

Research Areas

Data Science and Engineering

Publication

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Long Beach CA, August 6-10

First Page

3206

Last Page

3216

ISBN

9798400701030

Identifier

10.1145/3580305.3599384

Publisher

ACM

City or Country

New York, USA

Additional URL

https://doi.org/10.1145/3580305.3599384

Share

COinS