Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
6-2025
Abstract
Community detection in heterogeneous information networks (HINs) poses significant challenges due to the diversity of entity types and the complexity of their interrelations. While traditional algorithms may perform adequately in some scenarios, many struggle with the high memory usage and computational demands of large-scale HINs. To address these challenges, we introduce a novel framework, SCAR, which efficiently uncovers community structures in HINs without requiring network materialization. SCAR leverages insights from meta-paths to interpret multi-relational data through compact vertex-based sketches, significantly reducing computational overhead and materialization overhead. We propose a sketch-based technique for estimating changes in modularity, improving both the precision and speed in community detection. Our extensive evaluations on diverse real-world datasets provide detailed comparative metrics, demonstrating that SCAR outperforms several state-of-the-art methods, including Gdy, Louvain, Leiden, Infomap, Walktrap, and Networkit, in execution time and memory consumption while maintaining competitive accuracy. Overall, SCAR offers a robust and scalable solution for revealing community structures in large HINs, with applications across various domains, including social networks, academic collaboration networks, and e-commerce platforms.
Keywords
Graph Algorithms, Community Detection
Discipline
Graphics and Human Computer Interfaces | OS and Networks | Theory and Algorithms
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
Proceedings of the ACM on Management of Data, Volume 3, Issue 3, Berlin, Germany, June 22-27
First Page
1
Last Page
27
Identifier
10.1145/3725276
Publisher
ACM
City or Country
New York
Citation
JIANG, Jiaxin; YAO, Siyuan; CHEN, Yuhang; HE, Bingsheng; NIU, Yudong; LI, Yuchen; SUN, Shixuan; and LIU, Yongchao.
Community detection in heterogeneous information networks without materialization. (2025). Proceedings of the ACM on Management of Data, Volume 3, Issue 3, Berlin, Germany, June 22-27. 1-27.
Available at: https://ink.library.smu.edu.sg/sis_research/10406
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3725276
Included in
Graphics and Human Computer Interfaces Commons, OS and Networks Commons, Theory and Algorithms Commons