The data analytics group at the Qatar Computing Research Institute

George Beskales*, Ihab F. Ilyas, Paolo Papotti, Gautam Das, Felix Naumann, Jorge Quiane-Ruiz, Ahmed K. Elmagarmid, Mourad Ouzzani, Nan Tang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

The Qatar Computing Research Institute (QCRI), a member of Qatar Foundation for Education, Science and Community Development, started its activities in early 2011. QCRI is focusing on tackling large-scale computing challenges that address national priorities for growth and development and that have global impact in computing research. DA@QCRI has built expertise focusing on three core data management challenges: extracting data from its natural digital habitat, integrating a large and evolving number of sources, and robust cleaning to assure data quality and validation. Cleaning data requires collecting and maintaining a massive amount of metadata, such as data violations, lineage of data changes, and possible data repairs. In addition, users need to understand better the current health of the data and the data cleaning process through summarization or samples of data errors before they can effectively guide any data cleaning process. Providing a scalable data cleaning solution requires efficient methods to generate, maintain, and access such metadata.

Original languageEnglish
Pages (from-to)33-38
Number of pages6
JournalSIGMOD Record
Volume41
Issue number4
DOIs
Publication statusPublished - Jan 2013

Fingerprint

Dive into the research topics of 'The data analytics group at the Qatar Computing Research Institute'. Together they form a unique fingerprint.

Cite this