Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/33667
COMPARTIR / EXPORTAR:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invitar a revisión por pares abierta
Campo DC Valor Lengua/Idioma
dc.contributor.authorTamames, Javier-
dc.contributor.authorLorenzo, Víctor de-
dc.date.accessioned2011-03-22T11:28:51Z-
dc.date.available2011-03-22T11:28:51Z-
dc.date.issued2010-06-01-
dc.identifier.citationBMC Bioinformatics 11: 294 (2010)es_ES
dc.identifier.issn1471-2105-
dc.identifier.urihttp://hdl.handle.net/10261/33667-
dc.description10 páginas, 2 figuras, 5 tablas.-- Metodología.es_ES
dc.description.abstract[Background]: For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles). So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations) from textual sources of any kind. [Results]: EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved) of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings. Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude), thus allowing the calculation of distance between the individual locations. [Conclusion]: EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical variables of sampling sites, thus facilitating the performance of ecological analyses. EnvMine can also help in the development of standards for the annotation of environmental features.es_ES
dc.description.sponsorshipThis work was supported by project generous grants from the Spanish Ministry of Science and Innovation (CONSOLIDER), the 7 th Framework programme of the European Union (Projects BACSINE and MICROME), and funds from Comunidad de Madrid, Spain.es_ES
dc.language.isoenges_ES
dc.publisherBioMed Centrales_ES
dc.relation.isversionofPublisher's version-
dc.rightsopenAccesses_ES
dc.titleEnvMine: A text-mining system for the automatic extraction of contextual informationes_ES
dc.typeartículoes_ES
dc.identifier.doi10.1186/1471-2105-11-294-
dc.description.peerreviewedPeer reviewedes_ES
dc.relation.publisherversionhttp://dx.doi.org/10.1186/1471-2105-11-294es_ES
dc.identifier.pmid20515448-
dc.type.coarhttp://purl.org/coar/resource_type/c_6501es_ES
item.openairetypeartículo-
item.cerifentitytypePublications-
item.languageiso639-1en-
item.grantfulltextopen-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.fulltextWith Fulltext-
Aparece en las colecciones: (CNB) Artículos
Ficheros en este ítem:
Fichero Descripción Tamaño Formato
1471-2105-11-294.pdf1,19 MBAdobe PDFVista previa
Visualizar/Abrir
Show simple item record

CORE Recommender

PubMed Central
Citations

7
checked on 03-ene-2024

SCOPUSTM   
Citations

19
checked on 23-mar-2024

WEB OF SCIENCETM
Citations

15
checked on 22-feb-2024

Page view(s)

385
checked on 28-mar-2024

Download(s)

273
checked on 28-mar-2024

Google ScholarTM

Check

Altmetric

Altmetric


Artículos relacionados:


NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.