Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
8-2008
Abstract
Data mining tasks in P2P are bound by issues like scalability, peer dynamism, asynchronism, and data privacy preservation. These challenges pose difficulties for deploying conventional machine learning techniques in P2P networks, which may be hard to achieve classification accuracies comparable to regular centralized solutions. We recently investigated the classification problem in P2P networks and proposed a novel P2P classification approach by cascading Reduced Support Vector Machines (RSVM). Although promising results were obtained, the existing solution has some drawback of redundancy in both communication and computation. In this paper, we present a new approach to over the limitation of the previous approach. The new method can effectively reduce the redundancy and thus significantly improve the efficiency of communication and computation, meanwhile it still maintains good classification accuracies comparable to both the centralized solution and the previously proposed P2P solution. Experimental results demonstrate the feasibility and effectiveness of the new P2P classification solution.
Discipline
Computer Sciences | Databases and Information Systems
Research Areas
Data Science and Engineering
Publication
VLDB Workshop on Peer-to-Peer Computing 2008, August 23, Auckland, 23 August: Proceedings
First Page
13
Last Page
25
Publisher
VLDB Endowment
City or Country
Saratoga, CA
Citation
ANG, Hock Hee; GOPALKRISHNAN, Vikvekanand; HOI, Steven C. H.; NG, Wee Keong; and DATTA, Anwitaman.
Classification in P2P Networks by Bagging Cascade RSVMs. (2008). VLDB Workshop on Peer-to-Peer Computing 2008, August 23, Auckland, 23 August: Proceedings. 13-25.
Available at: https://ink.library.smu.edu.sg/sis_research/2406
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.