DATA CLEANING

Ihab Francis Ilyas Kaldas (Inventor), George Beskales (Inventor), Ahmed Elmagarmid (Inventor)

Research output: Patent

Abstract

A computer-implemented method comprising partitioning data representing an input instance of a database including multiple tuples into multiple fragments of tuples, detecting tuples which violate a data quality specification in respective ones of the fragments, selecting a data cleaning asset on the basis of characteristics of errors in detected tuples for a fragment and based on declared asset capabilities, assigning a selected data cleaning asset to the fragment, the selected data cleaning asset to provide a set of candidate corrections for the detected tuples in the fragment, providing data representing an output instance of the database in which detected tuples are replaced with selected candidate corrections.

Original languageEnglish
Patent numberUS2013275393
IPCG06F 17/ 30 A I
Priority date12/04/12
Publication statusPublished - 17 Oct 2013

Fingerprint

Dive into the research topics of 'DATA CLEANING'. Together they form a unique fingerprint.

Cite this