3682 E. coli assemblies from NCBI

dc.contributor.affiliationUniversity of Helsinki-Jarno N. Alanko
dc.contributor.authorJarno N. Alanko
dc.date.accessioned2025-04-29T14:01:13Z
dc.date.issued2022-05-25
dc.date.issued2022-05-25
dc.descriptionThis is a dataset of 3682 E. coli assemblies downloaded from NCBI circa 2020 aiming to replicate the E. coli dataset in the paper "Succinct colored de Bruijn graphs" by Muggli et al. https://doi.org/10.1093/bioinformatics/btx067. The data is in 3682 FASTA files, one for each assembly. The uncompressed size is 18GB.
dc.identifierhttps://doi.org/10.5281/zenodo.6577997
dc.identifier.urihttps://datakatalogi.helsinki.fi/handle/123456789/4826
dc.rights.licensecc-by-4.0
dc.subjectmetagenomics
dc.subjectbioinformatics
dc.subjectecoli
dc.title3682 E. coli assemblies from NCBI
dc.typedataset