TY - JOUR
T1 - Your culture is in your password
T2 - An analysis of a demographically-diverse password dataset
AU - AlSabah, Mashael
AU - Oligeri, Gabriele
AU - Riley, Ryan
N1 - Publisher Copyright:
© 2018 Elsevier Ltd
PY - 2018/8
Y1 - 2018/8
N2 - A large number of studies on passwords make use of passwords leaked by attackers who compromised online services. Frequently, these leaks contain only the passwords themselves, or basic information such as usernames or email addresses. While metadata-rich leaks exist, they are often limited in the variety of demographics they cover. In this work, we analyze a meta-data rich data leak from a Middle Eastern bank with a demographically-diverse user base. We provide an analysis of passwords created by groups of people of different cultural backgrounds, some of which are under-represented in existing data leaks, e.g., Arab, Filipino, Indian, and Pakistani. The contributions provided by this work are many-fold. First, our results contribute to the existing body of knowledge regarding how users include personal information in their passwords. Second, we illustrate the differences that exist in how users from different cultural/linguistic backgrounds create passwords. Finally, we study the (empirical and theoretical) guessability of the dataset based on two attacker models, and show that a state of the art password strength estimator inflates the strength of passwords created by users from non-English speaking backgrounds. We improve its estimations by training it with contextually relevant information.
AB - A large number of studies on passwords make use of passwords leaked by attackers who compromised online services. Frequently, these leaks contain only the passwords themselves, or basic information such as usernames or email addresses. While metadata-rich leaks exist, they are often limited in the variety of demographics they cover. In this work, we analyze a meta-data rich data leak from a Middle Eastern bank with a demographically-diverse user base. We provide an analysis of passwords created by groups of people of different cultural backgrounds, some of which are under-represented in existing data leaks, e.g., Arab, Filipino, Indian, and Pakistani. The contributions provided by this work are many-fold. First, our results contribute to the existing body of knowledge regarding how users include personal information in their passwords. Second, we illustrate the differences that exist in how users from different cultural/linguistic backgrounds create passwords. Finally, we study the (empirical and theoretical) guessability of the dataset based on two attacker models, and show that a state of the art password strength estimator inflates the strength of passwords created by users from non-English speaking backgrounds. We improve its estimations by training it with contextually relevant information.
KW - Authentication
KW - Authorization
KW - Password security
KW - Passwords analytics
KW - Usable security
UR - http://www.scopus.com/inward/record.url?scp=85047259015&partnerID=8YFLogxK
U2 - 10.1016/j.cose.2018.03.014
DO - 10.1016/j.cose.2018.03.014
M3 - Article
AN - SCOPUS:85047259015
SN - 0167-4048
VL - 77
SP - 427
EP - 441
JO - Computers and Security
JF - Computers and Security
ER -