Data for "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role"

Vihman, Virve-Anneli; Miljan, Merilin

Vihman, Virve-Anneli; Miljan, Merilin

Nimi	Suurus	Kirjeldus
README.txt	4.655Kb	README-file
Nouns_23-10-19.csv	299.0Kb	Coded corpus data: Nominative, Genitive, Partitive Nouns - version 2
Nouns_23-08-20.csv	296.5Kb	Coded corpus data: Nominative, Genitive, Partitive Nouns - version 1
Raw_data.pdf	147.2Kb	Additional material with tables and raw figures

Kuupäev

2023

URI

https://datadoi.ee/handle/33/567
http://dx.doi.org/10.23673/re-429

Metaandmed

Näita täielikku nimetuse kirjet

Kokkuvõte

This dataset makes available the sample of clauses used in the study "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role". It includes 751 clauses from the fiction subcorpus of the University of Tartu’s Balanced Corpus of Written Estonian (cl.ut.ee/korpused) and 758 clauses from the Corpus of Spoken Estonian, maintained by the University of Tartu’s research group of Spoken Estonian (not publicly available at the time of publicatiion). The spoken language selection derives from a subset of everyday (face-to-face and telephone) conversations. The dataset includes both the randomly selected clauses and manual coding, described in the paper.... Rohkem Vähem

Märksõna

Case-marking; Nouns; Written and spoken corpus

Kirje tüüp

info:eu-repo/semantics/dataset

Kollektsioonid

Eesti ja üldkeeleteaduse andmed

Selle nimetusega on seotud järgmised litsentsifailid:

Creative Commons

Kui pole teisiti märgitud, kirjeldatakse litsentsi järgnevalt info:eu-repo/semantics/openAccess