Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/130393
Share/Export:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE
Title

Expectation-Maximization Binary Clustering for Behavioural Annotation

AuthorsGarriga, Joan CSIC ORCID ; Palmer, John R. B. CSIC ORCID; Oltra, Aitana CSIC ORCID ; Bartumeus, Frederic CSIC ORCID
Issue Date2016
PublisherPublic Library of Science
CitationPLoS ONE 11(3): e0151984 (2016)
AbstractThe growing capacity to process and store animal tracks has spurred the development of new methods to segment animal trajectories into elementary units of movement. Key challenges for movement trajectory segmentation are to (i) minimize the need of supervision, (ii) reduce computational costs, (iii) minimize the need of prior assumptions (e.g. simple parametrizations), and (iv) capture biologically meaningful semantics, useful across a broad range of species. We introduce the Expectation-Maximization binary Clustering (EMbC), a general purpose, unsupervised approach to multivariate data clustering. The EMbC is a variant of the Expectation-Maximization Clustering (EMC), a clustering algorithm based on the maximum likelihood estimation of a Gaussian mixture model. This is an iterative algorithm with a closed form step solution and hence a reasonable computational cost. The method looks for a good compromise between statistical soundness and ease and generality of use (by minimizing prior assumptions and favouring the semantic interpretation of the final clustering). Here we focus on the suitability of the EMbC algorithm for behavioural annotation of movement data. We show and discuss the EMbC outputs in both simulated trajectories and empirical movement trajectories including different species and different tracking methodologies. We use synthetic trajectories to assess the performance of EMbC compared to classic EMC and Hidden Markov Models. Empirical trajectories allow us to explore the robustness of the EMbC to data loss and data inaccuracies, and assess the relationship between EMbC output and expert label assignments. Additionally, we suggest a smoothing procedure to account for temporal correlations among labels, and a proper visualization of the output for movement trajectories. Our algorithm is available as an R-package with a set of complementary functions to ease the analysis.
Description26 páginas, 11 figuras, 1 tabla.
Publisher version (URL)http://dx.doi.org/10.1371/journal.pone.0151984
URIhttp://hdl.handle.net/10261/130393
DOI10.1371/journal.pone.0151984
ISSN1932-6203
Appears in Collections:(CEAB) Artículos

Files in This Item:
File Description SizeFormat
Fede 2016.PDF3,28 MBAdobe PDFThumbnail
View/Open
Show full item record
Review this work

PubMed Central
Citations

33
checked on May 10, 2023

SCOPUSTM   
Citations

87
checked on Jun 2, 2023

WEB OF SCIENCETM
Citations

85
checked on Jun 6, 2023

Page view(s)

340
checked on Jun 6, 2023

Download(s)

239
checked on Jun 6, 2023

Google ScholarTM

Check

Altmetric

Altmetric


Related articles:


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.