Discovering semantic features in the literature: a foundation for building functional associations

Chagoyen, Mónica; Carmona-Sáez, Pedro; Shatkay, Hagit; Carazo, José M.; Pascual-Montano, Alberto

Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/3458

COMPARTIR / EXPORTAR:

SHARE CORE BASE	Comparte tu historia de Acceso Abierto
Visualizar otros formatos: MARC \| Dublin Core \| RDF \| ORE \| MODS \| METS \| DIDL \| DATACITE
Refman EndNote Bibtex RefWorks Excel CSV PDF DataCite Send via email

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Chagoyen, Mónica	-
dc.contributor.author	Carmona-Sáez, Pedro	-
dc.contributor.author	Shatkay, Hagit	-
dc.contributor.author	Carazo, José M.	-
dc.contributor.author	Pascual-Montano, Alberto	-
dc.date.accessioned	2008-04-07T12:27:09Z	-
dc.date.available	2008-04-07T12:27:09Z	-
dc.date.issued	2006-01-26	-
dc.identifier.citation	BMC Bioinformatics 2006, 7:41	en_US
dc.identifier.issn	1471-2105	-
dc.identifier.uri	http://hdl.handle.net/10261/3458	-
dc.description	This article is available from: http://www.biomedcentral.com/1471-2105/7/41	en_US
dc.description.abstract	[Background] Experimental techniques such as DNA microarray, serial analysis of gene expression (SAGE) and mass spectrometry proteomics, among others, are generating large amounts of data related to genes and proteins at different levels. As in any other experimental approach, it is necessary to analyze these data in the context of previously known information about the biological entities under study. The literature is a particularly valuable source of information for experiment validation and interpretation. Therefore, the development of automated text mining tools to assist in such interpretation is one of the main challenges in current bioinformatics research.	en_US
dc.description.abstract	[Results] We present a method to create literature profiles for large sets of genes or proteins based on common semantic features extracted from a corpus of relevant documents. These profiles can be used to establish pair-wise similarities among genes, utilized in gene/protein classification or can be even combined with experimental measurements. Semantic features can be used by researchers to facilitate the understanding of the commonalities indicated by experimental results. Our approach is based on non-negative matrix factorization (NMF), a machine-learning algorithm for data analysis, capable of identifying local patterns that characterize a subset of the data. The literature is thus used to establish putative relationships among subsets of genes or proteins and to provide coherent justification for this clustering into subsets. We demonstrate the utility of the method by applying it to two independent and vastly different sets of genes.	en_US
dc.description.abstract	[Conclusion] The presented method can create literature profiles from documents relevant to sets of genes. The representation of genes as additive linear combinations of semantic features allows for the exploration of functional associations as well as for clustering, suggesting a valuable methodology for the validation and interpretation of high-throughput experimental data.	en_US
dc.description.sponsorship	This work has been partially funded by Santander-UCM (grant PR27/05- 13964), Comunidad Autonoma de Madrid (grant CAM GR/SAL/0653/ 2004), Comision Interministerial de Ciencia y Tecnologia (grants CICYT BFU2004-00217/BMC and GEN2003-20235-c05-05) and a collaborative grant between the Spanish Research Council and the National Research Council of Canada (CSIC-050402040003). PCS is recipient of a grant from Comunidad Autonoma de Madrid. APM acknowledges the support of the Spanish Ramón y Cajal program. HS is supported by the Canadian NSERC Discovery Grant 298292-04.	en_US
dc.format.extent	1355397 bytes	-
dc.format.extent	590431 bytes	-
dc.format.extent	1105408 bytes	-
dc.format.extent	31862 bytes	-
dc.format.extent	24067 bytes	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/vnd.ms-excel	-
dc.format.mimetype	application/pdf	-
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	en_US
dc.publisher	BioMed Central	en_US
dc.relation.isversionof	Publisher's version	-
dc.rights	openAccess	en_US
dc.title	Discovering semantic features in the literature: a foundation for building functional associations	en_US
dc.type	artículo	en_US
dc.identifier.doi	10.1186/1471-2105-7-41	-
dc.description.peerreviewed	Peer reviewed	en_US
dc.identifier.pmid	16438716	-
dc.type.coar	http://purl.org/coar/resource_type/c_6501	es_ES
item.cerifentitytype	Publications	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.grantfulltext	open	-
item.openairetype	artículo	-
item.fulltext	With Fulltext	-
item.languageiso639-1	en	-
Aparece en las colecciones:	(CNB) Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
1471-2105-7-41.pdf	Principal	1,32 MB	Adobe PDF	Visualizar/Abrir
1471-2105-7-41-s1.pdf	Archivo adicional 1	576,59 kB	Adobe PDF	Visualizar/Abrir
1471-2105-7-41-s2.xls	Archivo adicional 2	1,08 MB	Microsoft Excel	Visualizar/Abrir
1471-2105-7-41-s3.pdf	Archivo adicional 3	31,12 kB	Adobe PDF	Visualizar/Abrir
1471-2105-7-41-s4.pdf	Archivo adicional 4	23,5 kB	Adobe PDF	Visualizar/Abrir

Show simple item record

CORE Recommender

PubMed Central
Citations

25

checked on 13-abr-2024

SCOPUS^TM
Citations

65

checked on 24-abr-2024

WEB OF SCIENCE^TM
Citations

58

checked on 25-feb-2024

Page view(s)

454

checked on 24-abr-2024

Download(s)

521

checked on 24-abr-2024

Google Scholar^TM

Check

Ficheros en este ítem:

PubMed Central
Citations

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Page view(s)

Download(s)

Google Scholar^TM

Altmetric

Altmetric

Ficheros en este ítem:

PubMed Central Citations

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Altmetric

PubMed Central
Citations

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM