TY - JOUR
T1 - MEDIC
T2 - a multi-task learning dataset for disaster image classification
AU - Alam, Firoj
AU - Alam, Tanvirul
AU - Hasan, Md Arid
AU - Hasnat, Abul
AU - Imran, Muhammad
AU - Ofli, Ferda
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2023/1
Y1 - 2023/1
N2 - Recent research in disaster informatics demonstrates a practical and important use case of artificial intelligence to save human lives and suffering during natural disasters based on social media contents (text and images). While notable progress has been made using texts, research on exploiting the images remains relatively under-explored. To advance image-based approaches, we propose MEDIC (https://crisisnlp.qcri.org/meclic/index.html), which is the largest social media image classification dataset for humanitarian response consisting of 71,198 images to address four different tasks in a multi-task learning setup. This is the first dataset of its kind: social media images, disaster response, and multi-task learning research. An important property of this dataset is its high potential to facilitate research on multi-task learning, which recently receives much interest from the machine learning community and has shown remarkable results in terms of memory, inference speed, performance, and generalization capability. Therefore, the proposed dataset is an important resource for advancing image-based disaster management and multi-task machine learning research. We experiment with different deep learning architectures and report promising results, which are above the majority baselines for all tasks. Along with the dataset, we also release all relevant scripts (https://github.com/firojalam/medic).
AB - Recent research in disaster informatics demonstrates a practical and important use case of artificial intelligence to save human lives and suffering during natural disasters based on social media contents (text and images). While notable progress has been made using texts, research on exploiting the images remains relatively under-explored. To advance image-based approaches, we propose MEDIC (https://crisisnlp.qcri.org/meclic/index.html), which is the largest social media image classification dataset for humanitarian response consisting of 71,198 images to address four different tasks in a multi-task learning setup. This is the first dataset of its kind: social media images, disaster response, and multi-task learning research. An important property of this dataset is its high potential to facilitate research on multi-task learning, which recently receives much interest from the machine learning community and has shown remarkable results in terms of memory, inference speed, performance, and generalization capability. Therefore, the proposed dataset is an important resource for advancing image-based disaster management and multi-task machine learning research. We experiment with different deep learning architectures and report promising results, which are above the majority baselines for all tasks. Along with the dataset, we also release all relevant scripts (https://github.com/firojalam/medic).
KW - Crisis informatics
KW - Dataset
KW - Deep learning
KW - Image classification
KW - Multi-task learning
KW - Natural disasters
KW - Social media images
UR - http://www.scopus.com/inward/record.url?scp=85137462169&partnerID=8YFLogxK
U2 - 10.1007/s00521-022-07717-0
DO - 10.1007/s00521-022-07717-0
M3 - Article
AN - SCOPUS:85137462169
SN - 0941-0643
VL - 35
SP - 2609
EP - 2632
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 3
ER -