Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
7-2016
Abstract
Word cloud is a visualization form for text that is recognized for its aesthetic, social, and analytical values. Here, we are concerned with deepening its analytical value for visual comparison of documents. To aid comparative analysis of two or more documents, users need to be able to perceive similarities and differences among documents through their word clouds. However, as we are dealing with text, approaches that treat words independently may impede accurate discernment of similarities among word clouds containing different words of related meanings. We therefore motivate the principle of displaying related words in a coherent manner, and propose to realize it through modeling the latent aspects of words. Our WORD FLOCK solution brings together latent variable analysis for embedding and aspect modeling, and calibrated layout algorithm within a synchronized word cloud generation framework. We present the quantitative and qualitative results on real-life text corpora, showcasing how the word clouds are useful in preserving the information content of documents so as to allow more accurate visual comparison of documents
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Publication
Proceedings of the 25th International Joint Conference on Artificial Intelligence IJCAI 2016: New York, July 9-15
First Page
2536
Last Page
2543
Publisher
AAAI Press
City or Country
Palo Alto, CA
Citation
LE, Tuan M. V. and LAUW, Hady W..
Word clouds with latent variable analysis for visual comparison of documents. (2016). Proceedings of the 25th International Joint Conference on Artificial Intelligence IJCAI 2016: New York, July 9-15. 2536-2543.
Available at: https://ink.library.smu.edu.sg/sis_research/3357
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://www.ijcai.org/Proceedings/16/Papers/361.pdf
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons