- Corpus of Early English Correspondence Extension Sampler part 1 (CEECES 1)(2022-04-11) Saario, Lassi; University of Helsinki-Saario, Lassi
- Corpus of Early English Correspondence Extension Sampler part 2 (CEECES 2)(2022-04-11) Saario, Lassi; University of Helsinki-Saario, Lassi
- LatinISE test data for SemEval 2020 task 1 with additional token versions of the corpora(2020-08-20) Hengchen, Simon; University of Helsinki-Hengchen, Simon
- OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan(2022-09-14) Scherrer, Yves; University of Helsinki-Scherrer, Yves
- Tagged Corpus of Early English Correspondence Extension Sampler (TCEECES)(2022-04-11) Saario, Lassi; University of Helsinki-Saario, Lassi