Here we present the samples and data used in Metz & Huber et al. (2022) to construct <strong>µSudAqua[db].</strong> <strong>µSudAqua[db]</strong> contains over 866 georeferenced samples with contextual environmental information manually revised. For its integration and validation, we constructed a curated database (<strong>µSudAqua[db.sp]</strong>) using the DADA2 pipeline (https://github.com/microsudaqua/usudaquadb). It comprised ~60% of the total georeferenced samples of the <strong>µSudAqua[db]</strong>. Here we provide five files compressed in a zip file (microsudaqua_data_V1.0_July2022.zip). The <em>microsudaqua_metadata_V1.0_July2022.tsv </em>presents the metadata associated with the samples used to build the <strong>µSudAqua[db] </strong>database. For samples included in the <strong>µSudAqua[db.sp]</strong> database, the number of high-quality reads and Amplicon Sequences Variants (ASVs) defined is also indicated. The archive <em>microsudaqua_rawtable_V1.0_July2022.tsv </em>harvests the<em> </em>number of reads in each sample (<strong>µSudAqua[db.sp]</strong>). The archive <em>microsudaqua_rawseqs_V1.0_July2022.fasta</em> harvests the nucleotide sequences of each ASV (<strong>µSudAqua[db.sp]</strong>). The archive <em>microsudaqua_rawtaxonomy_blast_silva132_nr99_V1.0_July2022.tsv </em>harvests taxonomic classification of each ASV (<strong>µSudAqua[db.sp]</strong>). The archive <em>microsudaqua_bacteria_filtered_50reads_with_taxonomy_V1.0_July2022.tsv</em> harvests the Bacterial filtered ASVs, with more than 50 reads in at least three samples (<strong>µSudAqua[db.sp]</strong>). Further information regarding data usage and processing is available in the @microsudaqua GitHub (https://github.com/microsudaqua/usudaquadb)
Tópico:
Microbial Community Ecology and Physiology
Citaciones:
1
Citaciones por año:
Altmétricas:
0
Información de la Fuente:
FuenteZenodo (CERN European Organization for Nuclear Research)