Publication Type

Conference Proceeding Article

Publication Date

3-2015

Abstract

Non-independent reasoning (NIR) allows the information about one record in the data to be learnt from the information of other records in the data. Most posterior/prior based privacy criteria consider NIR as a privacy violation and require to smooth the distribution of published data to avoid sensitive NIR. The drawback of this approach is that it limits the utility of learning statistical relationships. The differential privacy criterion considers NIR as a non-privacy violation, therefore, enables learning statistical relationships, but at the cost of potential disclosures through NIR. A question is whether it is possible to (1) allow learning statistical relationships, yet (2) prevent sensitive NIR about an individual. We present a data perturbation and sampling method to achieve both (1) and (2). The enabling mechanism is a new privacy criterion that distinguishes the two types of NIR in (1) and (2) with the help of the law of large numbers. In particular, the record sampling effectively prevents the sensitive disclosure in (2) while having less effect on the statistical learning in (1).

Keywords

Data privacy, Differential privacy

Discipline

Databases and Information Systems | Information Security | Theory and Algorithms

Research Areas

Data Management and Analytics

Publication

Proceedings of the 18th International Conference on Extending Database Technology (EDBT): March 23-27, 2015, Brussels, Belgium

First Page

469

Last Page

480

ISBN

9783893180677

Identifier

10.5441/002/edbt.2015.41

Publisher

OpenProceedings

City or Country

Brussels

Creative Commons License

Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Additional URL

http://doi.org/10.5441/002/edbt.2015.41