NADEEF/ER: Generic and interactive entity resolution

Ahmed Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge Arnulfo Quiané-Ruiz, Nan Tang, Si Yin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

26 Citations (Scopus)

Abstract

Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning system Nadeef. Nadeef/Er provides a rich programming interface for manipulating entities, which allows generic, efficient and extensible ER. In this demo, users will have the opportunity to experience the following features: (1) Easy specification - Users can easily define ER rules with a browser-based specification, which will then be automatically transformed to various functions, treated as black-boxes by Nadeef; (2) Generality and extensibility - Users can customize their ER rules by refining and fine-tuning the above functions to achieve both effective and efficient ER solutions; (3) Interactivity - We also extended the existing Nadeef dashboard with summarization and clustering techniques to facilitate understanding problems faced by the ER process as well as to allow users to influence resolution decisions.

Original languageEnglish
Title of host publicationSIGMOD 2014 - Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
PublisherAssociation for Computing Machinery
Pages1071-1074
Number of pages4
ISBN (Print)9781450323765
DOIs
Publication statusPublished - 2014
Event2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014 - Snowbird, UT, United States
Duration: 22 Jun 201427 Jun 2014

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014
Country/TerritoryUnited States
CitySnowbird, UT
Period22/06/1427/06/14

Keywords

  • Entity resolution
  • Generic
  • Interactive
  • NADEEF

Fingerprint

Dive into the research topics of 'NADEEF/ER: Generic and interactive entity resolution'. Together they form a unique fingerprint.

Cite this