Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/1439
Share/Export:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE
Title

A sentence sliding window approach to extract protein annotations from biomedical articles

AuthorsKrallinger, Martin; Padron, Maria; Valencia, Alfonso
Issue Date24-May-2005
PublisherBioMed Central
CitationBMC Bioinformatics 2005, 6(Suppl 1):S19
Abstract[Background] Within the emerging field of text mining and statistical natural language processing (NLP) applied to biomedical articles, a broad variety of techniques have been developed during the past years. Nevertheless, there is still a great ned of comparative assessment of the performance of the proposed methods and the development of common evaluation criteria. This issue was addressed by the Critical Assessment of Text Mining Methods in Molecular Biology (BioCreative) contest. The aim of this contest was to assess the performance of text mining systems applied to biomedical texts including tools which recognize named entities such as genes and proteins, and tools which automatically extract protein annotations.
[Results] The "sentence sliding window" approach proposed here was found to efficiently extract text fragments from full text articles containing annotations on proteins, providing the highest number of correctly predicted annotations. Moreover, the number of correct extractions of individual entities (i.e. proteins and GO terms) involved in the relationships used for the annotations was significantly higher than the correct extractions of the complete annotations (protein-function relations).
[Conclusion] We explored the use of averaging sentence sliding windows for information extraction, especially in a context where conventional training data is unavailable. The combination of our approach with more refined statistical estimators and machine learning techniques might be a way to improve annotation extraction for future biomedical text mining applications.
DescriptionFrom A critical assessment of text mining methods in molecular biology
URIhttp://hdl.handle.net/10261/1439
DOI10.1186/1471-2105-6-S1-S19
ISSN1471-2105
Appears in Collections:(CNB) Artículos

Files in This Item:
File Description SizeFormat
1471-2105-6-S1-S19.pdf373,2 kBAdobe PDFThumbnail
View/Open
Show full item record
Review this work

PubMed Central
Citations

6
checked on May 15, 2022

SCOPUSTM   
Citations

14
checked on May 16, 2022

WEB OF SCIENCETM
Citations

12
checked on May 14, 2022

Page view(s)

354
checked on May 16, 2022

Download(s)

181
checked on May 16, 2022

Google ScholarTM

Check

Altmetric

Dimensions


Related articles:


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.