TY - GEN
T1 - Things change
T2 - 2020 ACM CHI Conference on Human Factors in Computing Systems, CHI EA 2020
AU - Jung, Soon Gyo
AU - Salminen, Joni
AU - Chowdhury, Shammur A.
AU - Ramirez Robillos, Dianne
AU - Jansen, Bernard J.
N1 - Publisher Copyright:
© 2020 Owner/Author.
PY - 2020/4/25
Y1 - 2020/4/25
N2 - We address a recommendation task for next likely flight destination to customers of a major international airline company. We compare performance using historical flight data and an actual user evaluation. Using two years of historical flight data consisting of tens of millions of flights, an ensemble and a collaborative filtering approach obtained an accuracy of 47% and 20% using a test set of 100,000 customers, respectively, highlighting the challenge of the domain. We then evaluated our recommendations on 10,000 actual customers, with a 45-45-10 split among ensemble, collaborative filtering, and control group. The overall predictive power employed with real users was 23%, with the ensemble method having a predictive power of 19% and 30% for collaborative filtering. Results indicate that, in complex and shifting domains such as this one, one cannot rely solely on historical data for evaluating the impact of user recommendations. We discuss implications for recommendation systems and future research in this and related domains.
AB - We address a recommendation task for next likely flight destination to customers of a major international airline company. We compare performance using historical flight data and an actual user evaluation. Using two years of historical flight data consisting of tens of millions of flights, an ensemble and a collaborative filtering approach obtained an accuracy of 47% and 20% using a test set of 100,000 customers, respectively, highlighting the challenge of the domain. We then evaluated our recommendations on 10,000 actual customers, with a 45-45-10 split among ensemble, collaborative filtering, and control group. The overall predictive power employed with real users was 23%, with the ensemble method having a predictive power of 19% and 30% for collaborative filtering. Results indicate that, in complex and shifting domains such as this one, one cannot rely solely on historical data for evaluating the impact of user recommendations. We discuss implications for recommendation systems and future research in this and related domains.
KW - Algorithmic trade-off
KW - Prediction
KW - Recommendations
KW - User study
UR - http://www.scopus.com/inward/record.url?scp=85090204011&partnerID=8YFLogxK
U2 - 10.1145/3334480.3382945
DO - 10.1145/3334480.3382945
M3 - Conference contribution
AN - SCOPUS:85090204011
T3 - Conference on Human Factors in Computing Systems - Proceedings
BT - CHI EA 2020 - Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems
PB - Association for Computing Machinery
Y2 - 25 April 2020 through 30 April 2020
ER -