TY - GEN
T1 - A demo of the data civilizer system
AU - Fernandez, Raul Castro
AU - Deng, Dong
AU - Mansour, Essam
AU - Qahtan, Abdulhakim A.
AU - Tao, Wenbo
AU - Abedjan, Ziawasch
AU - Elmagarmid, Ahmed
AU - Ilyas, Ihab F.
AU - Madden, Samuel
AU - Ouzzani, Mourad
AU - Stonebraker, Michael
AU - Tang, Nan
N1 - Publisher Copyright:
© 2017 ACM.
PY - 2017/5/9
Y1 - 2017/5/9
N2 - Finding relevant data for a specific task from the numerous data sources available in any organization is a daunting task. This is not only because of the number of possible data sources where the data of interest resides, but also due to the data being scattered all over the enterprise and being typically dirty and inconsistent. In practice, data scientists are routinely reporting that the majority (more than 80%) of their effort is spent finding, cleaning, integrating, and accessing data of interest to a task at hand. We propose to demonstrate Data Civilizer to ease the pain faced in analyzing data "in the wild". Data Civilizer is an end-to-end big data management system with components for data discovery, data integration and stitching, data cleaning, and querying data from a large variety of storage engines, running in large enterprises.
AB - Finding relevant data for a specific task from the numerous data sources available in any organization is a daunting task. This is not only because of the number of possible data sources where the data of interest resides, but also due to the data being scattered all over the enterprise and being typically dirty and inconsistent. In practice, data scientists are routinely reporting that the majority (more than 80%) of their effort is spent finding, cleaning, integrating, and accessing data of interest to a task at hand. We propose to demonstrate Data Civilizer to ease the pain faced in analyzing data "in the wild". Data Civilizer is an end-to-end big data management system with components for data discovery, data integration and stitching, data cleaning, and querying data from a large variety of storage engines, running in large enterprises.
UR - http://www.scopus.com/inward/record.url?scp=85021185176&partnerID=8YFLogxK
U2 - 10.1145/3035918.3058740
DO - 10.1145/3035918.3058740
M3 - Conference contribution
AN - SCOPUS:85021185176
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1639
EP - 1642
BT - SIGMOD 2017 - Proceedings of the 2017 ACM International Conference on Management of Data
PB - Association for Computing Machinery
T2 - 2017 ACM SIGMOD International Conference on Management of Data, SIGMOD 2017
Y2 - 14 May 2017 through 19 May 2017
ER -