Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/161333
Share/Export:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invite to open peer review
Title

Tweet-SCAN: An event discovery technique for geo-located tweets

AuthorsCapdevila, Joan CSIC ; Cerquides, Jesús CSIC ORCID ; Nin, Jordi; Torres, Jordi
KeywordsTwitter
Hierarchical Dirichlet Process (HDP)
Probabilistic topic models
Unsupervised learning
Event discovery
DBSCAN
Issue Date2017
PublisherElsevier
CitationPattern Recognition Letters 93: 58- 68 (2017)
AbstractTwitter has become one of the most popular Location-based Social Networks (LBSNs) that bridges physical and virtual worlds. Tweets, 140-character-long messages, are aimed to give answer to the What¿s happening? question. Occurrences and events in the real life (such as political protests, music concerts, natural disasters or terrorist acts) are usually reported through geo-located tweets by users on site. Uncovering event-related tweets from the rest is a challenging problem that necessarily requires exploiting different tweet features. With that in mind, we propose Tweet-SCAN, a novel event discovery technique based on the popular density-based clustering algorithm called DBSCAN. Tweet-SCAN takes into account four main features from a tweet, namely content, time, location and user to group together event-related tweets. The proposed technique models textual content through a probabilistic topic model called Hierarchical Dirichlet Process and introduces Jensen¿Shannon distance for the task of neighborhood identification in the textual dimension. As a matter of fact, we show Tweet-SCAN performance in two real data sets of geo-located tweets posted during Barcelona local festivities in 2014 and 2015, for which some of the events were identified by domain experts beforehand. Through these tagged data sets, we are able to assess Tweet-SCAN capabilities to discover events, justify using a textual component and highlight the effects of several parameters. © 2016 Elsevier B.V.
URIhttp://hdl.handle.net/10261/161333
DOI10.1016/j.patrec.2016.08.010
Identifiersdoi: 10.1016/j.patrec.2016.08.010
issn: 0167-8655
Appears in Collections:(IIIA) Artículos




Files in This Item:
File Description SizeFormat
accesoRestringido.pdf15,38 kBAdobe PDFThumbnail
View/Open
Show full item record

CORE Recommender

SCOPUSTM   
Citations

18
checked on May 21, 2024

WEB OF SCIENCETM
Citations

16
checked on Feb 28, 2024

Page view(s)

350
checked on May 22, 2024

Download(s)

106
checked on May 22, 2024

Google ScholarTM

Check

Altmetric

Altmetric


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.