Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/33676
COMPARTIR / EXPORTAR:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invitar a revisión por pares abierta
Título

High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABERTOOTH

AutorTeichert, Florian; Minning, Jonas; Bastolla, Ugo CSIC ORCID; Porto, Markus
Fecha de publicación14-may-2010
EditorBioMed Central
CitaciónBMC Bioinformatics 11: 251 (2010)
Resumen[Background]: Protein alignments are an essential tool for many bioinformatics analyses. While sequence alignments are accurate for proteins of high sequence similarity, they become unreliable as they approach the so-called 'twilight zone' where sequence similarity gets indistinguishable from random. For such distant pairs, structure alignment is of much better quality. Nevertheless, sequence alignment is the only choice in the majority of cases where structural data is not available. This situation demands development of methods that extend the applicability of accurate sequence alignment to distantly related proteins. [Results]: We develop a sequence alignment method that combines the prediction of a structural profile based on the protein's sequence with the alignment of that profile using our recently published alignment tool SABERTOOTH. In particular, we predict the contact vector of protein structures using an artificial neural network based on position-specific scoring matrices generated by PSI-BLAST and align these predicted contact vectors. The resulting sequence alignments are assessed using two different tests: First, we assess the alignment quality by measuring the derived structural similarity for cases in which structures are available. In a second test, we quantify the ability of the significance score of the alignments to recognize structural and evolutionary relationships. As a benchmark we use a representative set of the SCOP (structural classification of proteins) database, with similarities ranging from closely related proteins at SCOP family level, to very distantly related proteins at SCOP fold level. Comparing these results with some prominent sequence alignment tools, we find that SABERTOOTH produces sequence alignments of better quality than those of Clustal W, T-Coffee, MUSCLE, and PSI-BLAST. HHpred, one of the most sophisticated and computationally expensive tools available, outperforms our alignment algorithm at family and superfamily levels, while the use of SABERTOOTH is advantageous for alignments at fold level. Our alignment scheme will profit from future improvements of structural profiles prediction. [Conclusions]: We present the automatic sequence alignment tool SABERTOOTH that computes pairwise sequence alignments of very high quality. SABERTOOTH is especially advantageous when applied to alignments of remotely related proteins. The source code is available at http://www.fkp.tu-darmstadt.de/sabertooth_project/ webcite, free for academic users upon request.
Descripción14 páginas, 3 figuras, 3 tablas.
Versión del editorhttp://dx.doi.org/10.1186/1471-2105-11-251
URIhttp://hdl.handle.net/10261/33676
DOI10.1186/1471-2105-11-251
ISSN1471-2105
Aparece en las colecciones: (CBM) Artículos




Ficheros en este ítem:
Fichero Descripción Tamaño Formato
1471-2105-11-251.pdf1,42 MBAdobe PDFVista previa
Visualizar/Abrir
Mostrar el registro completo

CORE Recommender

PubMed Central
Citations

6
checked on 13-abr-2024

SCOPUSTM   
Citations

18
checked on 24-abr-2024

WEB OF SCIENCETM
Citations

16
checked on 23-feb-2024

Page view(s)

348
checked on 24-abr-2024

Download(s)

254
checked on 24-abr-2024

Google ScholarTM

Check

Altmetric

Altmetric


Artículos relacionados:


NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.