Mouse SIRV and simulated data used in the IsoQuant publication

dc.contributor.affiliationBrain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA-Hagen U. Tilgner
dc.contributor.authorHagen U. Tilgner
dc.date.accessioned2025-04-29T14:05:53Z
dc.date.issued2023-02-06
dc.date.issued2023-02-06
dc.descriptionThis dataset contain main simulated and SIRV sequencing data used in the IsoQuant publication. - Reduced mouse GENCODE vM26 annotation: 15% of expressed isoforms removed, expression profile based on real mouse brain sequencing data; also contains GTF with expressed transcripts. - Mouse PacBio simulated data: 6 million reads generated with IsoSeqSim, 1.4% error rate. Provided as a sorted BAM file, reads were mapped to GRCm39 genome using minimap v2.18. - Mouse ONT R10.4 simulated data: 30 million read generated with NanoSim, 2.8% error rate. Provided as a sorted BAM file, reads were mapped to GRCm39 genome using minimap v2.18. - Mouse ONT R9.4 simulated data: 30 million read generated with NanoSim, 15.9% error rate. Provided as a sorted BAM file, reads were mapped to GRCm39 genome using minimap v2.18. - Lexogen SIRV Set4 sequenced on ONT MinIon with R10.4 chemistry. Provided as a sorted BAM file, reads were mapped to SIRV reference using minimap v2.18. This data was used in the IsoQuant publication. - Lexogen SIRV Set4 sequenced on ONT MinIon with R9.4 chemistry, filtered >=Q12. Provided as a sorted BAM file, reads were mapped to SIRV reference using minimap v2.18. This data was NOT used in the IsoQuant publication. - SIRV reference genome and reduced annotation, also available at Lexogen download page.
dc.identifierhttps://doi.org/10.5281/zenodo.7611877
dc.identifier.urihttps://datakatalogi.helsinki.fi/handle/123456789/6278
dc.rights.licensecc-by-4.0
dc.subjectRNA-Seq
dc.subjectlong-read sequencing
dc.subjectsimulated data
dc.subjectSIRV
dc.subjectONT
dc.subjectNanopore
dc.subjectPacBio
dc.titleMouse SIRV and simulated data used in the IsoQuant publication
dc.typedataset

Files

Repositories