English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/88836
Share/Impact:
Statistics
logo share SHARE logo core CORE   Add this article to your Mendeley library MendeleyBASE

Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE
Exportar a otros formatos:

Title

GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis

AuthorsContreras-Moreira, Bruno CSIC ORCID ; Vinuesa, Pablo
Issue Date2013
PublisherAmerican Society for Microbiology
CitationContreras-Moreira B, Vinuesa P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Applied and Environmental Microbiology 79 (24): 7696-7701 (2013)
AbstractGET_HOMOLOGUES is an open-source software package that builds on popular orthology-calling approaches making highly customizable and detailed pangenome analyses of microorganisms accessible to nonbioinformaticians. It can cluster homologous gene families using the bidirectional best-hit, COGtriangles, or OrthoMCL clustering algorithms. Clustering stringency can be adjusted by scanning the domain composition of proteins using the HMMER3 package, by imposing desired pairwise alignment coverage cutoffs, or by selecting only syntenic genes. The resulting homologous gene families can be made even more robust by computing consensus clusters from those generated by any combination of the clustering algorithms and filtering criteria. Auxiliary scripts make the construction, interrogation, and graphical display of core genome and pangenome sets easy to perform. Exponential and binomial mixture models can be fitted to the data to estimate theoretical core genome and pangenome sizes, and high-quality graphics can be generated. Furthermore, pangenome trees can be easily computed and basic comparative genomics performed to identify lineage-specific genes or gene family expansions. The software is designed to take advantage of modern multiprocessor personal computers as well as computer clusters to parallelize time-consuming tasks. To demonstrate some of these capabilities, we survey a set of 50 Streptococcus genomes annotated in the Orthologous Matrix (OMA) browser as a benchmark case. The package can be downloaded at http://www.eead.csic.es/compbio/soft/gethoms.php and http://maya.ccg.unam.mx/soft/gethoms.php.
Description25 Págs., 3 Figs. The definitive version is available at: http://aem.asm.org/content/79/24/7696
Publisher version (URL)http://dx.doi.org/10.1128/AEM.02411-13
URIhttp://hdl.handle.net/10261/88836
DOIhttp://dx.doi.org/10.1128/AEM.02411-13
ISSN0099-2240
E-ISSN1098-5336
Appears in Collections:(EEAD) Artículos
Files in This Item:
File Description SizeFormat 
ContrerasB_Appl Environ Microbiol_2013_79.pdf275,46 kBAdobe PDFThumbnail
View/Open
Show full item record
Review this work
 

Related articles:


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.