Finnish OpenSubtitles 2017, Kielipankki Korp Version

dc.contributor.affiliationUniversity of Helsinki - Tatu Huovilainen
dc.contributor.authorTatu Huovilainen
dc.date.accessioned2025-04-29T12:54:59Z
dc.descriptionThe corpus is available in Kielipankki through the Interface Korp. The corpus contains Finnish subtitles for movies and TV-series from http://www.opensubtitles.org/ The corpus is a derivative of the [OPUS OpenSubtitles2018](http://opus.nlpl.eu/OpenSubtitles2018.php) multilingual corpus. Information on the material processing up to sentence splitting can be found in the original publication Lison & Tiedemann (2016). The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/). P. Lison and J. Tiedemann, 2016, OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
dc.disciplineLanguages
dc.identifierhttp://urn.fi/urn:nbn:fi:lb-2018060403
dc.identifier.urihttps://datakatalogi.helsinki.fi/handle/123456789/517
dc.languageFinnish
dc.rightsOpen
dc.rights.licenseCreative Commons Attribution 4.0 International (CC BY 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleFinnish OpenSubtitles 2017, Kielipankki Korp Version