Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
9-2021
Abstract
Semantic interpretability and visual expressivity are important objectives in exploratory analysis of text. On the one hand, while some documents may have explicit categories, we could develop a better understanding of a corpus by studying its finer-grained structures, which may be latent. By inferring latent topics and discovering keywords associated with each topic, one obtains a semantic interpretation of the corpus. One the other hand, by visualizing documents, latent topics, and category labels on the same plot, one gains a bird’s eye view of the relationships among documents, topics, and various categories. Semantic visualization is a class of methods that unify both topic modeling and visualization. In this paper, we propose a novel semantic visualization model for networked documents that incorporates partial labels. We introduce coordinate-based label distribution and label-dependent topic distribution to visualize documents, topics, and labels in a semi-supervised way. We further derive three variants for singly-labeled, multi-labeled, and hierarchically-labeled documents. The focus on semi-supervision that employs variants of labeling structures is particularly novel. Experiments verify the efficacy of our model against baselines.
Keywords
Dimensionality reduction, Generative models, Semantic visualization, Topic modeling
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Data Science and Engineering
Publication
Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2021, Bilbao, Spain, September 13-17: Proceedings
Volume
12977
First Page
762
Last Page
778
ISBN
9783030865238
Identifier
10.1007/978-3-030-86523-8_46
Publisher
Springer
City or Country
Cham
Embargo Period
12-13-2021
Citation
ZHANG, Delvin Ce and LAUW, Hady W..
Semi-supervised semantic visualization for networked documents. (2021). Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2021, Bilbao, Spain, September 13-17: Proceedings. 12977, 762-778.
Available at: https://ink.library.smu.edu.sg/sis_research/6428
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1007/978-3-030-86523-8_46
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons