Towards a data-driven approach to identify crisis-related topics in social media streams

Muhammad Imran, Carlos Castillo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

39 Citations (Scopus)

Abstract

While categorizing any type of user-generated content online is a challenging problem, categorizing social media messages during a crisis situation adds an additional layer of complex- ity, due to the volume and variability of information, and to the fact that these messages must be classified as soon as they arrive. Current approaches involve the use of au- tomaticspecification, humanspecification, or a mixture of both. In these types of approaches, there are several reasons to keep the number of information categories small and up- dated, which we examine in this article. This means at the onset of a crisis an expert must select a handful of informa- tion categories into which information will be categorized. The next step, as the crisis unfolds, is to dynamically change the initial set as new information is posted online. In this paper, we propose an effective way to dynamically extract emerging, potentially interesting, new categories from social media data.

Original languageEnglish
Title of host publicationWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
PublisherAssociation for Computing Machinery, Inc
Pages1205-1210
Number of pages6
ISBN (Electronic)9781450334730
DOIs
Publication statusPublished - 18 May 2015
Event24th International Conference on World Wide Web, WWW 2015 - Florence, Italy
Duration: 18 May 201522 May 2015

Publication series

NameWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web

Conference

Conference24th International Conference on World Wide Web, WWW 2015
Country/TerritoryItaly
CityFlorence
Period18/05/1522/05/15

Keywords

  • Information types
  • Social media content analysis
  • Stream classification
  • Text classification

Fingerprint

Dive into the research topics of 'Towards a data-driven approach to identify crisis-related topics in social media streams'. Together they form a unique fingerprint.

Cite this