Differentially private query learning: From data publishing to model publishing

Publication Type:
Conference Proceeding
Citation:
Proceedings - 2017 IEEE International Conference on Big Data, Big Data 2017, 2017, 2018-January pp. 1117 - 1122
Issue Date:
2017-07-01
Filename Description Size
08258037.pdfPublished version282.27 kB
Adobe PDF
Full metadata record
© 2017 IEEE. As one of the most influential privacy definitions, differential privacy provides a rigorous and provable privacy guarantee for data publishing. However, the curator has to release a large number of queries in a batch or a synthetic dataset in the Big Data era. Two challenges need to be tackled: one is how to decrease the correlation between large sets of queries, while the other is how to predict on fresh queries. This paper transfers the data publishing problem to a machine learning problem, in which queries are considered as training samples and a prediction model will be released rather than query results or synthetic datasets. When the model is published, it can be used to answer current submitted queries and predict results for fresh queries from the public. Compared with the traditional method, the proposed prediction model enhances the accuracy of query results for non-interactive publishing. We prove that learning model can successfully retain the utility of published queries while preserving privacy.
Please use this identifier to cite or link to this item: