Conference Proceeding Article
Subspace clustering is an unsupervised learning problem that aims at grouping data points into multiple “clusters” so that data points in a single cluster lie approximately on a low-dimensional linear subspace. It is originally motivated by 3D motion segmentation in computer vision, but has recently been generically applied to a wide range of statistical machine learning problems, which often involves sensitive datasets about human subjects. This raises a dire concern for data privacy. In this work, we build on the framework of differential privacy and present two provably private subspace clustering algorithms. We demonstrate via both theory and experiments that one of the presented methods enjoys formal privacy and utility guarantees; the other one asymptotically preserves differential privacy while having good performance in practice. Along the course of the proof, we also obtain two new provable guarantees for the agnostic subspace clustering and the graph connectivity problem which might be of independent interests.
Theory and Algorithms
NIPS'15 Proceedings of the 28th International Conference on Neural Information Processing Systems
WANG, Yining; WANG, Yu-Xiang; and SINGH, Aarti.
Differentially Private Subspace Clustering. (2015). NIPS'15 Proceedings of the 28th International Conference on Neural Information Processing Systems. 1000-1008. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/3469
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.