Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/17470
Share/Export:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invite to open peer review
Title

Biclustering of gene expression data by non-smooth non-negative matrix factorization

AuthorsCarmona-Sáez, Pedro; Pascual-Marqui, Roberto D.; Tirado, Francisco; Carazo, José M.; Pascual-Montano, Alberto
KeywordsDatasets
nsNMF
Issue Date17-Feb-2006
PublisherBioMed Central
CitationBMC Bioinformatics 7:78 (2006)
Abstract[Background] The extended use of microarray technologies has enabled the generation and accumulation of gene expression datasets that contain expression levels of thousands of genes across tens or hundreds of different experimental conditions. One of the major challenges in the analysis of such datasets is to discover local structures composed by sets of genes that show coherent expression patterns across subsets of experimental conditions. These patterns may provide clues about the main biological processes associated to different physiological states.
[Results] In this work we present a methodology able to cluster genes and conditions highly related in sub-portions of the data. Our approach is based on a new data mining technique, Non-smooth Non-Negative Matrix Factorization (nsNMF), able to identify localized patterns in large datasets. We assessed the potential of this methodology analyzing several synthetic datasets as well as two large and heterogeneous sets of gene expression profiles. In all cases the method was able to identify localized features related to sets of genes that show consistent expression patterns across subsets of experimental conditions. The uncovered structures showed a clear biological meaning in terms of relationships among functional annotations of genes and the phenotypes or physiological states of the associated conditions.
[Conclusion] The proposed approach can be a useful tool to analyze large and heterogeneous gene expression datasets. The method is able to identify complex relationships among genes and conditions that are difficult to identify by standard clustering algorithms.
Description18 pages, 1 table, 5 figures, 1 additional file.
Publisher version (URL)http://dx.doi.org/10.1186/1471-2105-7-78
URIhttp://hdl.handle.net/10261/17470
DOI10.1186/1471-2105-7-78
ISSN1471-2105
Appears in Collections:(CNB) Artículos




Files in This Item:
File Description SizeFormat
1471-2105-7-78.pdf624,54 kBAdobe PDFThumbnail
View/Open
Show full item record

CORE Recommender

PubMed Central
Citations

56
checked on Apr 21, 2024

SCOPUSTM   
Citations

154
checked on Apr 23, 2024

WEB OF SCIENCETM
Citations

104
checked on Feb 28, 2024

Page view(s)

467
checked on Apr 21, 2024

Download(s)

226
checked on Apr 21, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Related articles:


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.