TY - GEN
T1 - Are They Likely to Complain on Phish or Spam? A Prediction Model
AU - Al-Hussaini, Sarah
AU - Al-Thani, Dena
AU - Yang, Yin
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/11/5
Y1 - 2020/11/5
N2 - Customers are the core of businesses. Specifically, telecommunication companies, customer satisfaction is considered to be a commercial necessity and therefore a priority. High rates of customer satisfaction increase both retention and attraction rates. As a result, telecommunication companies are always seeking new means to achieve these objectives. A large volume of calls is received in a typical call center from customers complaining about phishing or spam attacks daily. It is difficult to identify the purpose of the call manually. In this work, we expand on previous efforts to focus more on impacted phone spam or phish consumers. The study focuses on both mediums of communication, phone calls and messages. A historical sample of customers' complaints dataset was used, and several machine learning classification algorithms were applied to analyze the calls. These are Logistic Regression, XGBoost, Gradient Boosting, Random Forest, CatBoost, KNN, and SVM. The predictive model can identify whether an individual is likely to complain about a spam or phish attack. The performance of the baseline classifier achieves an accuracy of 63.4 % that is based on CatBoost. Moreover, the model identifies consumers' demographics. The findings show that people of age 45 are more likely to complain and that males are less likely to complain.
AB - Customers are the core of businesses. Specifically, telecommunication companies, customer satisfaction is considered to be a commercial necessity and therefore a priority. High rates of customer satisfaction increase both retention and attraction rates. As a result, telecommunication companies are always seeking new means to achieve these objectives. A large volume of calls is received in a typical call center from customers complaining about phishing or spam attacks daily. It is difficult to identify the purpose of the call manually. In this work, we expand on previous efforts to focus more on impacted phone spam or phish consumers. The study focuses on both mediums of communication, phone calls and messages. A historical sample of customers' complaints dataset was used, and several machine learning classification algorithms were applied to analyze the calls. These are Logistic Regression, XGBoost, Gradient Boosting, Random Forest, CatBoost, KNN, and SVM. The predictive model can identify whether an individual is likely to complain about a spam or phish attack. The performance of the baseline classifier achieves an accuracy of 63.4 % that is based on CatBoost. Moreover, the model identifies consumers' demographics. The findings show that people of age 45 are more likely to complain and that males are less likely to complain.
KW - Machine learning
KW - call center
KW - data processing
KW - phishing
KW - prediction methods
KW - spam
UR - http://www.scopus.com/inward/record.url?scp=85101654244&partnerID=8YFLogxK
U2 - 10.1109/BESC51023.2020.9348318
DO - 10.1109/BESC51023.2020.9348318
M3 - Conference contribution
AN - SCOPUS:85101654244
T3 - Proceedings of 2020 7th IEEE International Conference on Behavioural and Social Computing, BESC 2020
BT - Proceedings of 2020 7th IEEE International Conference on Behavioural and Social Computing, BESC 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE International Conference on Behavioural and Social Computing, BESC 2020
Y2 - 5 November 2020 through 7 November 2020
ER -