Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
9-2009
Abstract
Distributed classification aims to learn with accuracy comparable to that of centralized approaches but at far lesser communication and computation costs. By nature, P2P networks provide an excellent environment for performing a distributed classification task due to the high availability of shared resources, such as bandwidth, storage space, and rich computational power. However, learning in P2P networks is faced with many challenging issues; viz., scalability, peer dynamism, asynchronism and fault-tolerance. In this paper, we address these challenges by presenting CEMPaR—a communication-efficient framework based on cascading SVMs that exploits the characteristics of DHT-based lookup protocols. CEMPaR is designed to be robust to parameters such as the number of peers in the network, imbalanced data sizes and class distribution while incurring extremely low communication cost yet maintaining accuracy comparable to the best-in-the-class approaches. Feasibility and effectiveness of our approach are demonstrated with extensive experimental studies on real and synthetic datasets.
Discipline
Computer Sciences | Databases and Information Systems | OS and Networks
Publication
Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I
Volume
5781
First Page
83
Last Page
98
ISBN
9783642041792
Identifier
10.1007/978-3-642-04180-8_23
Publisher
Springer Verlag
City or Country
Berlin
Citation
ANG, Hock Hee; Gopalkrishnan, Vivekanand; NG, Wee Keong; and HOI, Steven C. H..
Communication-efficient Classification in P2P Networks. (2009). Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I. 5781, 83-98.
Available at: https://ink.library.smu.edu.sg/sis_research/2374
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://dx.doi.org/10.1007/978-3-642-04180-8_23