2024-03-29T15:51:40Zhttp://digital.csic.es/dspace-oai/requestoai:digital.csic.es:10261/1536602017-07-29T00:53:45Zcom_10261_11773com_10261_1col_10261_11774
00925njm 22002777a 4500
dc
Krallinger, Martin
author
Rojas, A. M.
author
Valencia, Alfonso
author
2009-03
High-throughput experimental techniques are generating large data collections with the aim of identifying novel entities involved in fundamental cellular processes as well as drawing a systematic picture of the relationships between individual components. Determining the accuracy of the resulting data and the selection of a subset of targets for more careful characterizations often requires relying on information provided by manually annotated data repositories. These repositories are incomplete and cover only a small fraction of the knowledge contained in the literature. We propose in this paper the use of text-mining technologies to extract, organize, and present information relevant for a particular biological topic. The aims of the resulting approach are (1) to enable topic-centric biological literature navigation, (2) to assist in the construction of manually revised data repositories, (3) to provide prioritization of biological entities for experimental studies, and (4) to enable human interpretation of large-scale experiments by providing direct links of bio-entities to relevant descriptions in the literature.
Annals of the New York Academy of Sciences 1158: 14-28 (2009)
0077-8923
http://hdl.handle.net/10261/153660
10.1111/j.1749-6632.2008.03750.x
1749-6632
http://dx.doi.org/10.13039/501100000780
Text mining
Systems biology
High-throughput evaluation
Text categorization
Creating Reference Datasets for Systems Biology Applications Using Text Mining