TY - JOUR
T1 - Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops
AU - Shaham, Sina
AU - Ghinita, Gabriel
AU - Shahabi, Cyrus
N1 - Publisher Copyright:
© 2022 Copyright held by the owner/author(s).
PY - 2022
Y1 - 2022
N2 - Conventional origin-destination (OD) matrices record the count of trips between pairs of start and end locations, and have been extensively used in transportation, traffic planning, etc. More recently, due to use case scenarios such as COVID-19 pandemic spread modeling, it is increasingly important to also record intermediate points along an individual’s path, rather than only the trip start and end points. This can be achieved by using a multi-dimensional frequency matrix over a data space partitioning at the desired level of granularity. However, serious privacy constraints occur when releasing OD matrix data, and especially when adding multiple intermediate points, which makes individual trajectories more distinguishable to an attacker. To address this threat, we propose a technique for privacy-preserving publication of multi-dimensional OD matrices that achieves differential privacy (DP), the de-facto standard in private data release. We propose a family of approaches that factor in important data properties such as data density and homogeneity in order to build OD matrices that provide provable protection guarantees while preserving query accuracy. Extensive experiments on real and synthetic datasets show that the proposed approaches clearly outperform existing state-of-the-art.
AB - Conventional origin-destination (OD) matrices record the count of trips between pairs of start and end locations, and have been extensively used in transportation, traffic planning, etc. More recently, due to use case scenarios such as COVID-19 pandemic spread modeling, it is increasingly important to also record intermediate points along an individual’s path, rather than only the trip start and end points. This can be achieved by using a multi-dimensional frequency matrix over a data space partitioning at the desired level of granularity. However, serious privacy constraints occur when releasing OD matrix data, and especially when adding multiple intermediate points, which makes individual trajectories more distinguishable to an attacker. To address this threat, we propose a technique for privacy-preserving publication of multi-dimensional OD matrices that achieves differential privacy (DP), the de-facto standard in private data release. We propose a family of approaches that factor in important data properties such as data density and homogeneity in order to build OD matrices that provide provable protection guarantees while preserving query accuracy. Extensive experiments on real and synthetic datasets show that the proposed approaches clearly outperform existing state-of-the-art.
UR - http://www.scopus.com/inward/record.url?scp=105003136613&partnerID=8YFLogxK
U2 - 10.48786/edbt.2022.04
DO - 10.48786/edbt.2022.04
M3 - Conference article
AN - SCOPUS:105003136613
SN - 2367-2005
SP - 131
EP - 142
JO - Advances in Database Technology - EDBT
JF - Advances in Database Technology - EDBT
T2 - 25th International Conference on Extending Database Technology, EDBT 2022
Y2 - 29 March 2022 through 1 April 2022
ER -