English   español  
Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/23116
Título

Moara: a Java library for extracting and normalizing gene and protein mentions

AutorNeves, Mariana L.; Carazo, José M.; Pascual-Montano, Alberto
Fecha de publicación26-mar-2010
EditorBioMed Central
CitaciónBMC Bioinformatics 11: 157 (2010)
Resumen[Background] Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein interactions, and extraction of semantic information, among others. Despite dedication to these problems and effective solutions being reported, easily integrated tools to perform these tasks are not readily available.
[Results] This study proposes a versatile and trainable Java library that implements gene/protein tagger and normalization steps based on machine learning approaches. The system has been trained for several model organisms and corpora but can be expanded to support new organisms and documents.
[Conclusions] Moara is a flexible, trainable and open-source system that is not specifically orientated to any organism and therefore does not requires specific tuning in the algorithms or dictionaries utilized. Moara can be used as a stand-alone application or can be incorporated in the workflow of a more general text mining system.
Descripción13 pages, 4 figures, 2 tables.-- Software.
Versión del editorhttp://dx.doi.org/10.1186/1471-2105-11-157
URIhttp://hdl.handle.net/10261/23116
DOI10.1186/1471-2105-11-157
ISSN1471-2105
Aparece en las colecciones: (CNB) Artículos
Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
1471-2105-11-157.pdf3,22 MBAdobe PDFVista previa
Visualizar/Abrir
Mostrar el registro completo
 

Artículos relacionados:


NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.