TY - GEN
T1 - Privacy preserving schema and data matching
AU - Scannapieco, Monica
AU - Figotin, Ilya
AU - Bertino, Elisa
AU - Elmagarmid, Ahmed K.
PY - 2007
Y1 - 2007
N2 - In many business scenarios, record matching is performed across different data sources with the aim of identifying common information shared among these sources. However such need is often in contrast with privacy requirements concerning the data stored by the sources. In this paper, we propose a protocol for record matching that preserves privacy both at the data level and at the schema level. Specifically, if two sources need to identify their common data, by running the protocol they can compute the matching of their datasets without sharing their data in clear and only sharing the result of the matching. The protocol uses a third party, and maps records into a vector space in order to preserve their privacy. Experimental results show the efficiency of the matching protocol in terms of precision and recall as well as the good computational performance.
AB - In many business scenarios, record matching is performed across different data sources with the aim of identifying common information shared among these sources. However such need is often in contrast with privacy requirements concerning the data stored by the sources. In this paper, we propose a protocol for record matching that preserves privacy both at the data level and at the schema level. Specifically, if two sources need to identify their common data, by running the protocol they can compute the matching of their datasets without sharing their data in clear and only sharing the result of the matching. The protocol uses a third party, and maps records into a vector space in order to preserve their privacy. Experimental results show the efficiency of the matching protocol in terms of precision and recall as well as the good computational performance.
KW - Privacy
KW - Record matching
UR - http://www.scopus.com/inward/record.url?scp=35448932873&partnerID=8YFLogxK
U2 - 10.1145/1247480.1247553
DO - 10.1145/1247480.1247553
M3 - Conference contribution
AN - SCOPUS:35448932873
SN - 1595936866
SN - 9781595936868
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 653
EP - 664
BT - SIGMOD 2007
T2 - SIGMOD 2007: ACM SIGMOD International Conference on Management of Data
Y2 - 12 June 2007 through 14 June 2007
ER -