English
español
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10261/115926
Share/Impact:
Statistics |
![]() ![]() ![]() |
|
|
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE | |||
|
Title: | Crowdsourcing dialect characterization through twitter |
Authors: | Gonçalves, Bruno; Sánchez, David ![]() |
Issue Date: | 19-Nov-2014 |
Publisher: | Public Library of Science |
Citation: | PLoS ONE 9(11): e112074 (2014) |
Abstract: | We perform a large-scale analysis of language diatopic variation using geotagged microblogging datasets. By collecting all Twitter messages written in Spanish over more than two years, we build a corpus from which a carefully selected list of concepts allows us to characterize Spanish varieties on a global scale. A cluster analysis proves the existence of well defined macroregions sharing common lexical properties. Remarkably enough, we find that Spanish language is split into two superdialects, namely, an urban speech used across major American and Spanish citites and a diverse form that encompasses rural areas and small towns. The latter can be further clustered into smaller varieties with a stronger regional character. Copyright: © 2014 Gonçalves, Sanchez. |
Description: | Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are publicly available. Documentation on how to query the Twitter API can be found here: https://dev.twitter.com/overview/documentation. |
Publisher version (URL): | http://dx.doi.org/10.1371/journal.pone.0112074 |
URI: | http://hdl.handle.net/10261/115926 |
DOI: | 10.1371/journal.pone.0112074 |
Identifiers: | doi: 10.1371/journal.pone.0112074 issn: 1932-6203 |
Appears in Collections: | (IFISC) Artículos |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
crowdsourcing_dialect_Goncalves.pdf | 1,21 MB | Adobe PDF | ![]() View/Open |
Show full item record
Review this work
Review this work
Related articles:
WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.