Udmurt spatial cases dataset

2024, 2024
dataset
This is a dataset and accompanying R code for studying the variation between the two source cases (elative and egressive) and goal cases (illative and terminative). The data folder include four files including data for analysing the effect of properties of the Landmark and the associated verb form on the choice between the source and goal cases. The script folder contains four R markdown files, which contain the code to analyse the four data files. The data set is manually annotated for various variables that are hypotethised to affect the choice between the pairs of cases. Not all the variables are used in the analysis presented.  In V2.0 the R code is heavily changed. Only the code in files R_goal3 and R_source3 should be used in analysis. The data files are the same in both versions, but in V2.0 they have been checked and some minor mistakes have been fixed. Therefore, only the data files in the new version should be used if replicationg the analysis represented by the code in files R_goal3 and R_source3. The author of this dataset is Riku Erkkilä and it is published under CC-BY-NC-ND licence. If used in a publication, please refer to this publication as well as mention the original source: Arkhangelskiy, Timofey 2018: Udmurt corpus. http://udmurt.web-corpora.net/index.html. The R packages used in the analysis are:Hothorn, Torsten; Hornik, Kurt; Strobl, Carolin & Zeileis, Achim 2023: Party 1.3-13. https://CRAN.R-project.org/package=party.Wickham, Hadley 2023: Tidyverse 2.0.0. https://CRAN.R-project.org/package=tidyverse.