Language Technology Approach to "Seeing" in Akkadian
No Thumbnail Available
Restricted Availability
Date
2021-09-30, 2021-09-30
Persistent identifier of the Data Catalogue metadata
Creator/contributor
Editor
Journal title
Journal volume
Publisher
Publication Type
dataset
Peer Review Status
Repositories
Access rights
ISBN
ISSN
Description
Verbs of seeing in Akkadian
This repository contains scripts and data for the paper "Language Technology Approach to Seeing in Akkadian".
/data
aug18-nolex.txt
Lemmatized dataset from Oracc
results-pmi2-top50.log
Script parameters for pmizer (see https://github.com/asahala/Pmizer)
results-pmi2-top50.tsv
Results in .tsv format. Fields in the file:
keyword
translation from Oracc
collocate
translation from Oracc
period distribution
genre distribution
period and genre distribution
keyword freq
collocate freq
co-occurrence freq
PMI2 score
average distance between keyword and collocate (in words)
url to Korp (all links may not return results, as Korp Oracc had a major update in 2019: see https://www.kielipankki.fi/corpora/oracc/ for more info and user guide). Note that the co-occurrence of words (a, b) is symmetric, meaning that (a, b) == (b, a). Thus, if you search results in Korp using the links and do not get any results, you may have to switch the search boxes in reverse order.
period/genre-distribution-matrix.tsv
Distribution of seeing verbs in different genres and periods as a matrix representation