English   español  
Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/168556
logo share SHARE logo core CORE   Add this article to your Mendeley library MendeleyBASE

Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:

Identification of expression patterns in the progression of disease stages by integration of transcriptomic data

AutorAibar, Sara ; Abáigar, María ; Campos-Laborie, Francisco J.; Sanchez-Santos, Jose Manuel; Hernández, Jesús M. ; De Las Rivas, Javier
Palabras claveData integration
Expression pattern
Pattern recognition
Disease stage
Disease subtype
Disease progression
Gene expression
Gene signature
Expression profiling
Fecha de publicación2016
EditorBioMed Central
CitaciónBMC Bioinformatics 17(Supl.15): 11-21 (2016)
Resumen[Background]: In the study of complex diseases using genome-wide expression data from clinical samples, a difficult case is the identification and mapping of the gene signatures associated to the stages that occur in the progression of a disease. The stages usually correspond to different subtypes or classes of the disease, and the difficulty to identify them often comes from patient heterogeneity and sample variability that can hide the biomedical relevant changes that characterize each stage, making standard differential analysis inadequate or inefficient. [Results]: We propose a methodology to study diseases or disease stages ordered in a sequential manner (e.g. from early stages with good prognosis to more acute or serious stages associated to poor prognosis). The methodology is applied to diseases that have been studied obtaining genome-wide expression profiling of cohorts of patients at different stages. The approach allows searching for consistent expression patterns along the progression of the disease through two major steps: (i) identifying genes with increasing or decreasing trends in the progression of the disease; (ii) clustering the increasing/decreasing gene expression patterns using an unsupervised approach to reveal whether there are consistent patterns and find genes altered at specific disease stages. The first step is carried out using Gamma rank correlation to identify genes whose expression correlates with a categorical variable that represents the stages of the disease. The second step is done using a Self Organizing Map (SOM) to cluster the genes according to their progressive profiles and identify specific patterns. Both steps are done after normalization of the genomic data to allow the integration of multiple independent datasets. In order to validate the results and evaluate their consistency and biological relevance, the methodology is applied to datasets of three different diseases: myelodysplastic syndrome, colorectal cancer and Alzheimer's disease. A software script written in R, named genediseasePatterns, is provided to allow the use and application of the methodology. [Conclusion]: The method presented allows the analysis of the progression of complex and heterogeneous diseases that can be divided in pathological stages. It identifies gene groups whose expression patterns change along the advance of the disease, and it can be applied to different types of genomic data studying cohorts of patients in different states.
DescripciónFrom Statistical Methods for Omics Data Integration and Analysis 2015 - Valencia, Spain. 14-16 September 2015.
Versión del editorhttps://doi.org/10.1186/s12859-016-1290-4
Identificadoresdoi: 10.1186/s12859-016-1290-4
e-issn: 1471-2105
Aparece en las colecciones: (IBMCC) Artículos
Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
identifidata.pdf1,05 MBAdobe PDFVista previa
Mostrar el registro completo

Artículos relacionados:

NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.