TY - GEN
T1 - Query optimization for differentially private data management systems
AU - Peng, Shangfu
AU - Yang, Yin
AU - Zhang, Zhenjie
AU - Winslett, Marianne
AU - Yu, Yong
PY - 2013
Y1 - 2013
N2 - Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.
AB - Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.
UR - http://www.scopus.com/inward/record.url?scp=84881342820&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2013.6544900
DO - 10.1109/ICDE.2013.6544900
M3 - Conference contribution
AN - SCOPUS:84881342820
SN - 9781467349086
T3 - Proceedings - International Conference on Data Engineering
SP - 1093
EP - 1104
BT - ICDE 2013 - 29th International Conference on Data Engineering
T2 - 29th International Conference on Data Engineering, ICDE 2013
Y2 - 8 April 2013 through 11 April 2013
ER -