TY - GEN
T1 - Preserving privacy and fairness in peer-to-peer data integration
AU - Elmeleegy, Hazem
AU - Ouzzani, Mourad
AU - Elmagarmid, Ahmed
AU - Abusalah, Ahmad
PY - 2010
Y1 - 2010
N2 - Peer-to-peer data integration - a.k.a. Peer Data Management Systems (PDMSs) - promises to extend the classical data integration approach to the Internet scale. Unfortunately, some challenges remain before realizing this promise. One of the biggest challenges is preserving the privacy of the exchanged data while passing through several intermediate peers. Another challenge is protecting the mappings used for data translation. Protecting the privacy without being unfair to any of the peers is yet a third challenge. This paper presents a novel query answering protocol in PDMSs to address these challenges. The protocol employs a technique based on noise selection and insertion to protect the query results, and a commutative encryption-based technique to protect the mappings and ensure fairness among peers. An extensive security analysis of the protocol shows that it is resilient to several possible types of attacks. We implemented the protocol within an established PDMS: the Hyperion system. We conducted an experimental study using real data from the healthcare domain. The results show that our protocol manages to achieve its privacy and fairness goals, while maintaining query processing time at the interactive level.
AB - Peer-to-peer data integration - a.k.a. Peer Data Management Systems (PDMSs) - promises to extend the classical data integration approach to the Internet scale. Unfortunately, some challenges remain before realizing this promise. One of the biggest challenges is preserving the privacy of the exchanged data while passing through several intermediate peers. Another challenge is protecting the mappings used for data translation. Protecting the privacy without being unfair to any of the peers is yet a third challenge. This paper presents a novel query answering protocol in PDMSs to address these challenges. The protocol employs a technique based on noise selection and insertion to protect the query results, and a commutative encryption-based technique to protect the mappings and ensure fairness among peers. An extensive security analysis of the protocol shows that it is resilient to several possible types of attacks. We implemented the protocol within an established PDMS: the Hyperion system. We conducted an experimental study using real data from the healthcare domain. The results show that our protocol manages to achieve its privacy and fairness goals, while maintaining query processing time at the interactive level.
KW - fairness
KW - mappings
KW - peer data management systems
KW - peer-to-peer data integration
KW - privacy
UR - http://www.scopus.com/inward/record.url?scp=77954732461&partnerID=8YFLogxK
U2 - 10.1145/1807167.1807249
DO - 10.1145/1807167.1807249
M3 - Conference contribution
AN - SCOPUS:77954732461
SN - 9781450300322
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 759
EP - 770
BT - Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10
T2 - 2010 International Conference on Management of Data, SIGMOD '10
Y2 - 6 June 2010 through 11 June 2010
ER -