Hybrid differentially-private string matching

Fang Yu Rao, Gabriel Ghinita, Elisa Bertino

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Private record linkage enables multiple parties to exchange records of similar entities without disclosing their own data sets. Several techniques solve private matching for numerical attributes, but they cannot handle well string-typed data. We propose a protocol for differentially-private linkage of records with string-valued attributes. String-valued data are partitioned into groups, and each partition is represented by a synopsis. Strings are first embedded into a metric space which preserves the relative distance among them, and then they are translated to multi-dimensional points. We propose two partitioning algorithms: the first one uses solely Lipschitz embedding coordinates, whereas the second one also leverages string length information. To the best of our knowledge, this is the first efficient and provably secure solution to the private string matching problem. Extensive experimental results on a real dataset demonstrate that our solution is efficient and accurate. We also show that our solution can be adapted to alternative protection models, e.g., output-constrained differential privacy.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE 38th International Conference on Distributed Computing Systems, ICDCS 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages478-488
Number of pages11
ISBN (Electronic)9781538668719
DOIs
Publication statusPublished - 19 Jul 2018
Externally publishedYes
Event38th IEEE International Conference on Distributed Computing Systems, ICDCS 2018 - Vienna, Austria
Duration: 2 Jul 20185 Jul 2018

Publication series

NameProceedings - International Conference on Distributed Computing Systems
Volume2018-July

Conference

Conference38th IEEE International Conference on Distributed Computing Systems, ICDCS 2018
Country/TerritoryAustria
CityVienna
Period2/07/185/07/18

Keywords

  • Differential Privacy
  • Lipschitz Embedding
  • String Matching

Fingerprint

Dive into the research topics of 'Hybrid differentially-private string matching'. Together they form a unique fingerprint.

Cite this