A hybrid domain adaptation approach for identifying crisis-relevant tweets

Reza Mazloom, Hongmin Li, Doina Caragea, Cornelia Caragea, Muhammad Imran

Research output: Contribution to journalArticlepeer-review

Abstract

Huge amounts of data generated on social media during emergency situations is regarded as a trove of critical information. The use of supervised machine learning techniques in the early stages of a crisis is challenged by the lack of labeled data for that event. Furthermore, supervised models trained on labeled data from a prior crisis may not produce accurate results, due to inherent crisis variations. To address these challenges, the authors propose a hybrid feature instance-parameter adaptation approach based on matrix factorization, k-nearest neighbors, and self-training. The proposed feature-instance adaptation selects a subset of the source crisis data that is representative for the target crisis data. The selected labeled source data, together with unlabeled target data, are used to learn self-training domain adaptation classifiers for the target crisis. Experimental results have shown that overall the hybrid domain adaptation classifiers perform better than the supervised classifiers learned from the original source data.
Original languageEnglish
JournalInternational Journal of Information Systems for Crisis Response and Management
Volume11
Issue number2
DOIs
Publication statusPublished - Jul 2019

Fingerprint

Dive into the research topics of 'A hybrid domain adaptation approach for identifying crisis-relevant tweets'. Together they form a unique fingerprint.

Cite this