Please use this identifier to cite or link to this item:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invite to open peer review

Mining urban events from the tweet stream through a probabilistic mixture model

AuthorsCapdevila, Joan CSIC ; Cerquides, Jesús CSIC ORCID ; Torres, Jordi
KeywordsEvent detection
Social networks
Variational inference
Probabilistic models
Issue Date2018
PublisherSpringer Nature
CitationData Mining and Knowledge Discovery 32: 764- 786 (2018)
AbstractThe geographical identification of content in Social Networks have enabled to bridge the gap between online social platforms and the physical world. Although vast amounts of data in such networks are due to breaking news or global occurrences, local events witnessed by users in situ are also present in these streams and of great importance for many city entities. Nowadays, unsupervised machine learning techniques, such as Tweet-SCAN, are able to retrospectively detect these local events from tweets. However, these approaches have limited abilities to reason about unseen observations in a principled way due to the lack of a proper probabilistic foundation. Probabilistic models have also been proposed for the task, but their event identification capabilities are far from those of Tweet-SCAN. In this paper, we identify two key factors which, when combined, boost the accuracy of such models. As a first key factor, we notice that the large amount of meaningless social data requires explicitly modeling non-event observations.Therefore, we propose to incorporate a background model that captures spatio-temporal fluctuations of non-event tweets. As a second key factor, we observe that the shortness of tweets hampers the application of traditional topic models. Thus, we integrate event detection and topic modeling, assigning topic proportions to events instead of assigning them to individual tweets. As a result, we propose Warble, a new probabilistic model and learning scheme for retrospective event detection that incorporates these two key factors. We evaluate Warble in a data set of tweets located in Barcelona during its festivities. The empirical results show that the model outperforms other state-of-the-art techniques in detecting various types of events while relying on a principled probabilistic framework that enables to reason under uncertainty.
Identifiersdoi: 10.1007/s10618-017-0541-y
issn: 1573-756X
Appears in Collections:(IIIA) Artículos

Files in This Item:
File Description SizeFormat
accesoRestringido.pdf15,38 kBAdobe PDFThumbnail
Show full item record

CORE Recommender


checked on May 19, 2024


checked on Feb 28, 2024

Page view(s)

checked on May 22, 2024


checked on May 22, 2024

Google ScholarTM




WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.