Finnish Wikipedia 2017, Korp

No Thumbnail Available

Restricted Availability

Date

Persistent identifier of the Data Catalogue metadata

Creator/contributor

Tatu Huovilainen

Editor

Journal title

Journal volume

Publisher

Publication Type

Peer Review Status

Access rights

Open

ISBN

ISSN

Description

The Finnish Wikipedia 2017 Corpus will be available in the concordance tool Korp. The corpus contains all the Finnish articles from the online encyclopedia Wikipedia available in 1 January 2018. The text parts of the articles have been extracted from [Wikipedia Dumps](https://dumps.wikimedia.org/) with [WikiExtractor](https://github.com/attardi/wikiextractor). The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/)

Keyword (yso)

Keyword

Publication Series

Journal title

Location of the original dataset