TY - GEN
T1 - FAHES
T2 - 34th IEEE International Conference on Data Engineering, ICDE 2018
AU - Qahtan, Abdulhakim
AU - Elmagarmid, Ahmed
AU - Ouzzani, Mourad
AU - Tang, Nan
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/24
Y1 - 2018/10/24
N2 - It is well established that missing values, if not dealt with properly, may lead to poor data analytics models, misleading conclusions, and limitation in the generalization of findings. A key challenge in detecting these missing values is when they manifest themselves in a form that is otherwise valid, making it hard to distinguish them from other legitimate values. We propose to demonstrate FAHES, a system for detecting different types of disguised missing values (DMVs) which often occur in real world data. FAHES consists of several components, namely a profiler to generate rules for detecting repeated patterns, an outlier detection module, and a module to detect values that are used repeatedly in random records. Using several real world datasets, we will demonstrate how FAHES can easily catch DMVs.
AB - It is well established that missing values, if not dealt with properly, may lead to poor data analytics models, misleading conclusions, and limitation in the generalization of findings. A key challenge in detecting these missing values is when they manifest themselves in a form that is otherwise valid, making it hard to distinguish them from other legitimate values. We propose to demonstrate FAHES, a system for detecting different types of disguised missing values (DMVs) which often occur in real world data. FAHES consists of several components, namely a profiler to generate rules for detecting repeated patterns, an outlier detection module, and a module to detect values that are used repeatedly in random records. Using several real world datasets, we will demonstrate how FAHES can easily catch DMVs.
KW - Disguised missing values
UR - http://www.scopus.com/inward/record.url?scp=85051518156&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2018.00188
DO - 10.1109/ICDE.2018.00188
M3 - Conference contribution
AN - SCOPUS:85051518156
T3 - Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
SP - 1609
EP - 1612
BT - Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 16 April 2018 through 19 April 2018
ER -