Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
2-2020
Abstract
Oftentimes documents are linked to one another in a network structure,e.g., academic papers cite other papers, Web pages link to other pages. In this paper we propose a holistic topic model to learn meaningful and unified low-dimensional representations for networked documents that seek to preserve both textual content and network structure. On the basis of reconstructing not only the input document but also its adjacent neighbors, we develop two neural encoder architectures. Adjacent-Encoder, or AdjEnc, induces competition among documents for topic propagation, and reconstruction among neighbors for semantic capture. Adjacent-Encoder-X, or AdjEnc-X, extends this to also encode the network structure in addition to document content. We evaluate our models on real-world document networks quantitatively and qualitatively, outperforming comparable baselines comprehensively
Keywords
Encoder architecture, Low-dimensional representation, Network structures, Semantic capture, Textual content, Topic Modeling, Artificial intelligence
Discipline
Artificial Intelligence and Robotics | Databases and Information Systems
Research Areas
Data Science and Engineering
Publication
Proceedings of the 34th AAAI Conference on Artificial Intelligence 2020, New York, February 7-12
First Page
6737
Last Page
6745
ISBN
9781577358350
Identifier
10.1609/aaai.v34i04.6152
Publisher
AAAI Press
City or Country
Menlo Park, CA
Embargo Period
5-15-2020
Citation
ZHANG, Ce and LAUW, Hady W..
Topic modeling on document networks with adjacent-encoder. (2020). Proceedings of the 34th AAAI Conference on Artificial Intelligence 2020, New York, February 7-12. 6737-6745.
Available at: https://ink.library.smu.edu.sg/sis_research/5124
Copyright Owner and License
Publisher
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1609/aaai.v34i04.6152