TY - GEN
T1 - CrisisViT
T2 - 20th Global Information Systems for Crisis Response and Management Conference, ISCRAM 2023
AU - Long, Zijun
AU - McCreadie, Richard
AU - Imran, Muhammad
N1 - Publisher Copyright:
© 2023 Information Systems for Crisis Response and Management, ISCRAM. All rights reserved.
PY - 2023
Y1 - 2023
N2 - In times of emergency, crisis response agencies need to quickly and accurately assess the situation on the ground in order to deploy relevant services and resources. However, authorities often have to make decisions based on limited information, as data on affected regions can be scarce until local response services can provide first-hand reports. Fortunately, the widespread availability of smartphones with high-quality cameras has made citizen journalism through social media a valuable source of information for crisis responders. However, analyzing the large volume of images posted by citizens requires more time and effort than is typically available. To address this issue, this paper proposes the use of state-of-the-art deep neural models for automatic image classification/tagging, specifically by adapting transformer-based architectures for crisis image classification (CrisisViT). We leverage the new Incidents1M crisis image dataset to develop a range of new transformer-based image classification models. Through experimentation over the standard Crisis image benchmark dataset, we demonstrate that the CrisisViT models significantly outperform previous approaches in emergency type, image relevance, humanitarian category, and damage severity classification. Additionally, we show that the new Incidents1M dataset can further augment the CrisisViT models resulting in an additional 1.25% absolute accuracy gain.
AB - In times of emergency, crisis response agencies need to quickly and accurately assess the situation on the ground in order to deploy relevant services and resources. However, authorities often have to make decisions based on limited information, as data on affected regions can be scarce until local response services can provide first-hand reports. Fortunately, the widespread availability of smartphones with high-quality cameras has made citizen journalism through social media a valuable source of information for crisis responders. However, analyzing the large volume of images posted by citizens requires more time and effort than is typically available. To address this issue, this paper proposes the use of state-of-the-art deep neural models for automatic image classification/tagging, specifically by adapting transformer-based architectures for crisis image classification (CrisisViT). We leverage the new Incidents1M crisis image dataset to develop a range of new transformer-based image classification models. Through experimentation over the standard Crisis image benchmark dataset, we demonstrate that the CrisisViT models significantly outperform previous approaches in emergency type, image relevance, humanitarian category, and damage severity classification. Additionally, we show that the new Incidents1M dataset can further augment the CrisisViT models resulting in an additional 1.25% absolute accuracy gain.
KW - Crisis Management
KW - Deep Learning
KW - Social Media Classification
KW - Supervised Learning
KW - Vision transformers
UR - http://www.scopus.com/inward/record.url?scp=85171756137&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85171756137
T3 - Proceedings of the International ISCRAM Conference
SP - 309
EP - 319
BT - Proceedings - 20th Global Information Systems for Crisis Response and Management Conference, ISCRAM 2023
PB - Information Systems for Crisis Response and Management, ISCRAM
Y2 - 28 May 2023 through 31 May 2023
ER -