Research Collection School Of Computing and Information Systems

Top-K Aggregation Queries over Large Networks

Publication Type

Conference Paper

Version

submittedVersion

Publication Date

3-2010

Abstract

Searching and mining large graphs today is critical to a variety of application domains, ranging from personalized recommendation in social networks, to searches for functional associations in biological pathways. In these domains, there is a need to perform aggregation operations on large-scale networks. Unfortunately the existing implementation of aggregation operations on relational databases does not guarantee superior performance in network space, especially when it involves edge traversals and joins of gigantic tables. In this paper, we investigate the neighborhood aggregation queries: Find nodes that have top-k highest aggregate values over their h-hop neighbors. While these basic queries are common in a wide range of search and recommendation tasks, surprisingly they have not been studied systematically. We developed a Local Neighborhood Aggregation framework, called LONA, to answer them efficiently. LONA exploits two properties unique in network space: First, the aggregate value for the neighboring nodes should be similar in most cases; Second, given the distribution of attribute values, it is possible to estimate the upper-bound value of aggregates. These two properties inspire the development of novel pruning techniques, forward pruning using differential index and backward pruning using partial distribution. Empirical results show that LONA could outperform the baseline algorithm up to 10 times in real-life large networks.

Discipline

Databases and Information Systems | Numerical Analysis and Scientific Computing

Publication

26th International Conference on Data Engineering Conference (ICDE 2010), March 1-6, 2010, Long Beach: Proceedings

ISBN

9781424454440

Identifier

10.1109/ICDE.2010.5447863

Publisher

IEEE

City or Country

Long Beach, CA

Citation

Yan, Xifeng; He, Bin; ZHU, Feida; and Han, Jiawei. Top-K Aggregation Queries over Large Networks. (2010). 26th International Conference on Data Engineering Conference (ICDE 2010), March 1-6, 2010, Long Beach: Proceedings.
Available at: https://ink.library.smu.edu.sg/sis_research/508

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

http://doi.ieeecomputersociety.org/10.1109/ICDE.2010.5447863

Download

Included in

Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons

COinS

Research Collection School Of Computing and Information Systems

Top-K Aggregation Queries over Large Networks

Publication Type

Version

Publication Date

Abstract

Discipline

Publication

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Top-K Aggregation Queries over Large Networks

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Publication

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links