Enabling network inference methods to handle missing data and outliers

Folch-Fortuny, Abel; Villaverde, A. F.; Ferrer, Alberto; Banga, Julio R.

Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/304346

COMPARTIR / EXPORTAR:

SHARE CORE BASE	Comparte tu historia de Acceso Abierto
Visualizar otros formatos: MARC \| Dublin Core \| RDF \| ORE \| MODS \| METS \| DIDL \| DATACITE
Refman EndNote Bibtex RefWorks Excel CSV PDF DataCite Send via email

Título:	Enabling network inference methods to handle missing data and outliers
Autor:	Folch-Fortuny, Abel; Villaverde, A. F. CSIC ORCID; Ferrer, Alberto; Banga, Julio R. CSIC ORCID
Palabras clave:	Network inference Missing data Outlier detection Projection to latent structures Trimmed scores regression Information theory Mutual information
Fecha de publicación:	2015
Editor:	BioMed Central
Citación:	BMC Bioinformatics 16: 283 (2015)
Resumen:	Background: The inference of complex networks from data is a challenging problem in biological sciences, as well as in a wide range of disciplines such as chemistry, technology, economics, or sociology. The quantity and quality of the data greatly affect the results. While many methodologies have been developed for this task, they seldom take into account issues such as missing data or outlier detection and correction, which need to be properly addressed before network inference. Results: Here we present an approach to (i) handle missing data and (ii) detect and correct outliers based on multivariate projection to latent structures. The method, called trimmed scores regression (TSR), enables network inference methods to analyse incomplete datasets by imputing the missing values coherently with the latent data structure. Furthermore, it substitutes the faulty values in a dataset by proper estimations. We provide an implementation of this approach, and show how it can be integrated with any network inference method as a preliminary data curation step. This functionality is demonstrated with a state of the art network inference method based on mutual information distance and entropy reduction, MIDER. Conclusion: The methodology presented here enables network inference methods to analyse a large number of incomplete and faulty datasets that could not be reliably analysed so far. Our comparative studies show the superiority of TSR over other missing data approaches used by practitioners. Furthermore, the method allows for outlier detection and correction
Descripción:	12 pages, 3 figures, 2 tables
Versión del editor:	http://dx.doi.org/10.1186/s12859-015-0717-7
URI:	http://hdl.handle.net/10261/304346
DOI:	10.1186/s12859-015-0717-7
E-ISSN:	1471-2105
Aparece en las colecciones:	(IIM) Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
Enabling_network_OA_2015.pdf		852,56 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro completo

CORE Recommender

SCOPUS^TM
Citations

18

checked on 01-may-2024

WEB OF SCIENCE^TM
Citations

15

checked on 25-feb-2024

Page view(s)

52

checked on 02-may-2024

Download(s)

10

checked on 02-may-2024

Enabling network inference methods to handle missing data and outliers

Ficheros en este ítem:

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Page view(s)

Download(s)

Google Scholar^TM

Altmetric

Altmetric

Enabling network inference methods to handle missing data and outliers

Ficheros en este ítem:

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM