TY - GEN
T1 - Efficient and accurate strategies for differentially-private sliding window queries
AU - Cao, Jianneng
AU - Xiao, Qian
AU - Ghinita, Gabriel
AU - Li, Ninghui
AU - Bertino, Elisa
AU - Tan, Kian Lee
PY - 2013
Y1 - 2013
N2 - Regularly releasing the aggregate statistics about data streams in a privacy-preserving way not only serves valuable commercial and social purposes, but also protects the privacy of individuals. This problem has already been studied under differential privacy, but only for the case of a single continuous query that covers the entire time span, e.g., counting the number of tuples seen so far in the stream. However, most real-world applications are window-based, that is, they are interested in the statistical information about streaming data within a window, instead of the whole unbound stream. Furthermore, a Data Stream Management System (DSMS) may need to answer numerous correlated aggregated queries simultaneously, rather than a single one. To cope with these requirements, we study how to release differentially private answers for a set of sliding window aggregate queries. We propose two solutions, each consisting of query sampling and composition. We first selectively sample a subset of representative sliding window queries from the set of all the submitted ones. The representative queries are answered by adding Laplace noises in a way satisfying differential privacy. For each non-representative query, we compose its answer from the query results of those representatives. The experimental evaluation shows that our solutions are efficient and effective.
AB - Regularly releasing the aggregate statistics about data streams in a privacy-preserving way not only serves valuable commercial and social purposes, but also protects the privacy of individuals. This problem has already been studied under differential privacy, but only for the case of a single continuous query that covers the entire time span, e.g., counting the number of tuples seen so far in the stream. However, most real-world applications are window-based, that is, they are interested in the statistical information about streaming data within a window, instead of the whole unbound stream. Furthermore, a Data Stream Management System (DSMS) may need to answer numerous correlated aggregated queries simultaneously, rather than a single one. To cope with these requirements, we study how to release differentially private answers for a set of sliding window aggregate queries. We propose two solutions, each consisting of query sampling and composition. We first selectively sample a subset of representative sliding window queries from the set of all the submitted ones. The representative queries are answered by adding Laplace noises in a way satisfying differential privacy. For each non-representative query, we compose its answer from the query results of those representatives. The experimental evaluation shows that our solutions are efficient and effective.
UR - http://www.scopus.com/inward/record.url?scp=84876786176&partnerID=8YFLogxK
U2 - 10.1145/2452376.2452400
DO - 10.1145/2452376.2452400
M3 - Conference contribution
AN - SCOPUS:84876786176
SN - 9781450315975
T3 - ACM International Conference Proceeding Series
SP - 191
EP - 202
BT - Advances in Database Technology - EDBT 2013
T2 - 16th International Conference on Extending Database Technology, EDBT 2013
Y2 - 18 March 2013 through 22 March 2013
ER -