TY - GEN
T1 - Towards a data-driven approach to identify crisis-related topics in social media streams
AU - Imran, Muhammad
AU - Castillo, Carlos
PY - 2015/5/18
Y1 - 2015/5/18
N2 - While categorizing any type of user-generated content online is a challenging problem, categorizing social media messages during a crisis situation adds an additional layer of complex- ity, due to the volume and variability of information, and to the fact that these messages must be classified as soon as they arrive. Current approaches involve the use of au- tomaticspecification, humanspecification, or a mixture of both. In these types of approaches, there are several reasons to keep the number of information categories small and up- dated, which we examine in this article. This means at the onset of a crisis an expert must select a handful of informa- tion categories into which information will be categorized. The next step, as the crisis unfolds, is to dynamically change the initial set as new information is posted online. In this paper, we propose an effective way to dynamically extract emerging, potentially interesting, new categories from social media data.
AB - While categorizing any type of user-generated content online is a challenging problem, categorizing social media messages during a crisis situation adds an additional layer of complex- ity, due to the volume and variability of information, and to the fact that these messages must be classified as soon as they arrive. Current approaches involve the use of au- tomaticspecification, humanspecification, or a mixture of both. In these types of approaches, there are several reasons to keep the number of information categories small and up- dated, which we examine in this article. This means at the onset of a crisis an expert must select a handful of informa- tion categories into which information will be categorized. The next step, as the crisis unfolds, is to dynamically change the initial set as new information is posted online. In this paper, we propose an effective way to dynamically extract emerging, potentially interesting, new categories from social media data.
KW - Information types
KW - Social media content analysis
KW - Stream classification
KW - Text classification
UR - http://www.scopus.com/inward/record.url?scp=84968552344&partnerID=8YFLogxK
U2 - 10.1145/2740908.2741729
DO - 10.1145/2740908.2741729
M3 - Conference contribution
AN - SCOPUS:84968552344
T3 - WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
SP - 1205
EP - 1210
BT - WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
PB - Association for Computing Machinery, Inc
T2 - 24th International Conference on World Wide Web, WWW 2015
Y2 - 18 May 2015 through 22 May 2015
ER -