cc-by-4.0Taipale, Jussi2025-04-292021-07-162021-07-16https://datakatalogi.helsinki.fi/handle/123456789/5843This record contains the training, test and validation datasets used to train and evaluate the machine learning models in manuscript: Sahu, Biswajyoti, et al. "Sequence determinants of human gene regulatory elements." (2021). This record contains also the final hyperparameter-optimized models for each training dataset/task combination described in the manuscript. The README-files provided with the record describe the datasets and models in more detail. The datasets deposited here are derived from the original raw data (GEO accession: GSE180158) as described in the Methods of the manuscript.Gene regulationSTARR-seqDeep learningConvolutional Neural NetworksMachine learningMachine learning models, and training, validation and test datasets for: "Sequence determinants of human gene regulatory elements"dataset