Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/345338
COMPARTIR / EXPORTAR:
logo share SHARE logo core CORE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invitar a revisión por pares abierta
Campo DC Valor Lengua/Idioma
dc.contributor.authorSánchez Fernández, Pabloes_ES
dc.contributor.authorCoutinho, Felipe Hernandeses_ES
dc.contributor.authorSebastián, Martaes_ES
dc.contributor.authorPernice, Massimoes_ES
dc.contributor.authorRodríguez-Martínez, Raqueles_ES
dc.contributor.authorSalazar, Guillemes_ES
dc.contributor.authorCornejo-Castillo, Francisco M.es_ES
dc.contributor.authorPesant, Stéphanees_ES
dc.contributor.authorLópez Alforja, Xabieres_ES
dc.contributor.authorLópez-García, Ester-Maríaes_ES
dc.contributor.authorAgustí, Susanaes_ES
dc.contributor.authorGojobori, Takashies_ES
dc.contributor.authorLogares, Ramiroes_ES
dc.contributor.authorSala, M. Montserrates_ES
dc.contributor.authorVaqué, Dolorses_ES
dc.contributor.authorMassana, Ramones_ES
dc.contributor.authorDuarte, Carlos M.es_ES
dc.contributor.authorAcinas, Silvia G.es_ES
dc.contributor.authorGasol, Josep M.es_ES
dc.date.accessioned2024-02-05T11:19:13Z-
dc.date.available2024-02-05T11:19:13Z-
dc.date.issued2024-02-
dc.identifier.citationScientific Data 11: 154 (2024)es_ES
dc.identifier.otherCEX2019-000940-M-
dc.identifier.urihttp://hdl.handle.net/10261/345338-
dc.description12 pages, 6 figures, supplementary information https://doi.org/10.1038/s41597-024-02974-1.-- Data Records: All sequencing products described here, as well as the primary metagenome assemblies, can be found under BioProject accession number PRJEB52452 hosted by the European Nucleotide Archive28. ENA accession numbers for each metagenome sequencing run and for each megahit assembly are provided in Supplementary Tables 1, 4 respectively. File 1: 17,425,759 non-redundant coding DNA sequences (gene catalog) can be found in MP-GeneDB-VP.fasta.gz23. File 2: Prokka annotation for each CDS from the gene catalog, plus annotations for PFAM, KEGG-KO, CAZy and lowest common ancestor taxonomy can be found in file MP-GeneDB-VP-annotation-enhanced.tsv.gz23. File 3: 16S rRNA mTAG-based OTU table of the 76 metagenomes can be found in file mp-mtags.otu.tsv23. File 4: Counts of reads from each metagenome mapping to the gene catalog can be found in file MP-GeneDB-VP-raw-counts.tbl.gz23. File 5: Counts of reads from each metagenome mapping to the gene catalog normalized by gene length can be found in file MP-GeneDB-VP-length-norm-counts.tbl.gz23. File 6: Counts of reads from each metagenome mapping to the gene catalog annotated to COGs, normalized by gene length and 10 universal single copy COGs can be found in file MP-GeneDB-VP-length-norm-scgNorm-counts-cog.tbl.gz23. File 7: Counts of reads from each metagenome mapping to the gene catalog annotated to KEGG KOs, normalized by gene length and 10 universal single copy KOs can be found in file MP-GeneDB-VP-length-norm-scgNorm-counts-ko.tbl.gz23. File 8. Counts of reads from each metagenome mapping to the gene catalog normalized by gene length and aggregated per COG can be found in file MP-GeneDB-VP-length-norm-cog.tbl.gz23. File 9. Counts of reads from each metagenome mapping to the gene catalog normalized by gene length and aggregated per KO can be found in file MP-GeneDB-VP-length-norm-ko.tbl.gz23. File 10. Counts of reads from each metagenome mapping to the gene catalog normalized by gene length and aggregated per PFAM can be found in file MP-GeneDB-VP-length-norm-pfam.tbl.gz23. File 11. Counts of reads from each metagenome mapping to the gene catalog normalized by gene length and aggregated per CAZy can be found in file MP-GeneDB-VP-length-norm-cazy.tbl.gz23. File 12: fasta sequences for the 2,672 MAGs with estimated genome completeness above 50% and contamination below 5% can be found at file Malaspina-VP-MAGs.tar.gz23. File 13. Functional annotation of each MAG can be found in file Malaspina-VP-MAGs_CDS-annotation.tsv.gz23. File 14. Amino acid sequences of predicted genes in the MAGs sequences can be found in file Malaspina-VP-MAGs_CDS.faa.gz23. File 15: Nucleotide sequences of predicted genes in the MAGs sequences can be found in file Malaspina-VP-MAGs_CDS.fna.gz23. File 16: Viral genomic sequences can be found in file Malaspina_Profiles_Viruses_Genomic_Sequences.fasta.gz23. File 17: Descriptive information on the viral genomic sequences can be found in file Malaspina_Profiles_Viruses_Genomic_Info.tsv23. File 18: Virus-derived coding DNA sequences can be found in file Malaspina_Profiles_Viruses_CDS_Sequences.fna.gz23. File 19: Information of the annotation of the protein encoding genes predicted in the viral genomic sequences can be found in file Malaspina_Profiles_Viruses_PEG_Annotation_Info.tsv23. Underway and meteorological data measured on board R/V Hesperides for all 7 legs of the Malaspina Expedition 2010 on board R/V Hespérides are available from the Marine Technology Unit (UTM, CSIC).-- Code availability: All the software used to process the data set presented here is publicly available and distributed by their developers. All versions have been specified in the main text, along with the options used when departing from defaults. Custom scripts used in intermediate or summarizing steps are available at https://gitlab.com/malaspina-public/picoplankton-vertical-profiles. Code for bin decontamination step can be found at https://github.com/felipehcoutinho/QueroBinses_ES
dc.description.abstractThe Ocean microbiome has a crucial role in Earth’s biogeochemical cycles. During the last decade, global cruises such as Tara Oceans and the Malaspina Expedition have expanded our understanding of the diversity and genetic repertoire of marine microbes. Nevertheless, there are still knowledge gaps regarding their diversity patterns throughout depth gradients ranging from the surface to the deep ocean. Here we present a dataset of 76 microbial metagenomes (MProfile) of the picoplankton size fraction (0.2–3.0 µm) collected in 11 vertical profiles covering contrasting ocean regions sampled during the Malaspina Expedition circumnavigation (7 depths, from surface to 4,000 m deep). The MProfile dataset produced 1.66 Tbp of raw DNA sequences from which we derived: 17.4 million genes clustered at 95% sequence similarity (M-GeneDB-VP), 2,672 metagenome-assembled genomes (MAGs) of Archaea and Bacteria (Malaspina-VP-MAGs), and over 100,000 viral genomic sequences. This dataset will be a valuable resource for exploring the functional and taxonomic connectivity between the photic and bathypelagic tropical and sub-tropical ocean, while increasing our general knowledge of the Ocean microbiomees_ES
dc.description.sponsorshipThis work was funded by the Spanish Ministry of Economy and Competitiveness (MINECO) through the Consolider-Ingenio program (Malaspina 2010 Expedition, ref. CSD2008-00077). The sequencing of 76 metagenomes from 11 vertical profiles was funded by project Malaspinomics and Malaspina-analytics (CTM2011-15461-E) awarded to C.M.D. by the Spanish Ministry of Economy and Competitiveness. Additional funding was provided by the project MAGGY (CTM2017-87736-R) to S.G.A. from the Spanish Ministry of Economy and Competitiveness, Grup de Recerca 2017SGR/ 1568 from Generalitat de Catalunya, and King Abdullah University of Science and Technology (KAUST) under contract OSR #3362. The ICM researchers have had the institutional support of the “Severo Ochoa Centre of Excellence” accreditation (CEX2019-000928-S) funded by AEI10.13039/501100011033. R.R-M. thanks CeBiB FB0001 supportes_ES
dc.language.isoenges_ES
dc.publisherNature Publishing Groupes_ES
dc.relationinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/CTM2017-87736-R/ES/RECONSTRUCCION DE GENOMAS MICROBIANOS MARINOS MEDIANTE METAGENOMICA, GENOMICA DE CELULAS INDIVIDUALES Y CULTIVOS/es_ES
dc.relation.isversionofPublisher's versiones_ES
dc.rightsopenAccesses_ES
dc.titleMarine picoplankton metagenomes and MAGs from eleven vertical profiles obtained by the Malaspina Expeditiones_ES
dc.typeartículoes_ES
dc.identifier.doi10.1038/s41597-024-02974-1-
dc.description.peerreviewedPeer reviewedes_ES
dc.relation.publisherversionhttps://doi.org/10.1038/s41597-024-02974-1es_ES
dc.identifier.e-issn2052-4463-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/es_ES
dc.contributor.funderMinisterio de Economía y Competitividad (España)es_ES
dc.contributor.funderGeneralitat de Catalunyaes_ES
dc.contributor.funderKing Abdullah University of Science and Technologyes_ES
dc.contributor.funderAgencia Estatal de Investigación (España)es_ES
dc.relation.csices_ES
oprm.item.hasRevisionno ko 0 false*
dc.identifier.funderhttp://dx.doi.org/10.13039/501100004052es_ES
dc.identifier.funderhttp://dx.doi.org/10.13039/501100011033es_ES
dc.identifier.funderhttp://dx.doi.org/10.13039/501100002809es_ES
dc.identifier.funderhttp://dx.doi.org/10.13039/501100003329es_ES
dc.type.coarhttp://purl.org/coar/resource_type/c_6501es_ES
dc.subject.sdgConserve and sustainably use the oceans, seas and marine resources for sustainable developmentes_ES
item.cerifentitytypePublications-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.grantfulltextopen-
item.fulltextWith Fulltext-
item.openairetypeartículo-
Aparece en las colecciones: (ICM) Artículos
Ficheros en este ítem:
Fichero Descripción Tamaño Formato
Sanchez_et_al_2024.pdf3,1 MBAdobe PDFVista previa
Visualizar/Abrir
Sanchez_et_al_2024_suppl.xlsx337,65 kBMicrosoft Excel XMLVisualizar/Abrir
Show simple item record

CORE Recommender
sdgo:Goal

Page view(s)

97
checked on 16-may-2024

Download(s)

104
checked on 16-may-2024

Google ScholarTM

Check

Altmetric

Altmetric


Este item está licenciado bajo una Licencia Creative Commons Creative Commons