Finnish Wikipedia 2017, source

dc.contributor.affiliationUniversity of Helsinki - Tatu Huovilainen
dc.contributor.authorTatu Huovilainen
dc.date.accessioned2025-04-29T12:54:59Z
dc.descriptionThe Finnish Wikipedia 2017 source material corpus is available for download. The corpus contains all Finnish articles from the online encyclopedia Wikipedia available in 1 January 2018. The text parts of the articles have been extracted from [Wikipedia Dumps](https://dumps.wikimedia.org/) with [WikiExtractor](https://github.com/attardi/wikiextractor). The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/) License: CC BY https://creativecommons.org/licenses/by/4.0/
dc.disciplineLanguages
dc.identifierhttp://urn.fi/urn:nbn:fi:lb-2019110803
dc.identifier.urihttps://datakatalogi.helsinki.fi/handle/123456789/518
dc.languageFinnish
dc.rightsOpen
dc.rights.licenseCreative Commons Attribution 4.0 International (CC BY 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleFinnish Wikipedia 2017, source

Files