Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

21 to 30 of 34 Results
Sep 13, 2021
Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire, 2021, "Phonetic Corpus of Estonian Spontaneous Speech v1.2", https://doi.org/10.23673/RE-293, DATADOI, V1
The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training vario...
Aug 25, 2021
Taremaa, Piia, 2021, "Data and R code for "Verbs of horizontal and vertical motion: a corpus study in Estonian"", https://doi.org/10.23673/RE-292, DATADOI, V1
Data and statistical code used in the paper "Verbs of horizontal and vertical motion: a corpus study in Estonian" (accepted by the Finnish Journal of Linguistics 2021)
Feb 10, 2021
Taremaa, Piia, 2021, "Andmekogum ja lisamaterjalid artiklile „Liikumisverbid horisontaalsel ja vertikaalsel teljel. Ühe sorteerimiskatse tulemused“ (Keel ja Kirjandus 3/2021; Piia Taremaa)", https://doi.org/10.23673/RE-272, DATADOI, V1
Admekogum artiklile „Liikumisverbid horisontaalsel ja vertikaalsel teljel. Ühe sorteerimiskatse tulemused“ (Keel ja Kirjandus 3/2021; Piia Taremaa). Andmekogumisse kuuluvad: 1) statistiline kood; 2) statistilise analüüsi aluseks olev andmetabel; 3) artiklis esitatud MDS-joonis PDFina; 4) sorteerimiskatse koondtulemuste tabel; 5) sorteerimiskatse hi...
Feb 1, 2021
Taremaa, Piia; Hint, Helen; Reile, Maria; Pajusalu, Renate, 2021, "Data and R code for "Constructional variation in Estonian: demonstrative pronouns and adverbs as determiners in noun phrases"", https://doi.org/10.23673/RE-269, DATADOI, V1
Data and R code used in the paper "Constructional variation in Estonian: demonstrative pronouns and adverbs as determiners in noun phrases" (accepted by Lingua 2021)
Sep 24, 2020
Yallop, Taremaa, & Leijen, 2020, "Distribution of categorised feedback comments (e.g. class, sub-class, and features) by feedback exchange group and by group member", https://doi.org/10.23673/REE3, DATADOI, V1
This entry contains data on the categorisation and classification of asynchronous written peer feedback comments within one doctorate writing group over a three-month period. The research data should be used in tandem with the following publication: Yallop, R. M. A., Taremaa, P. & Leijen, D. A. J (forthcoming/2020). The affect and effect of asynchr...
Sep 7, 2019
Edela, Anna, 2019, "Kodavere kihelkonnas 19. sajandil sündinud lapsed", https://doi.org/10.15155/RE-71, DATADOI, V1
Anna Edela bakalaureusetöös kasutatud andmed, mis pärinevad 19. sajandi EELK Kodavere koguduse sünnimeetrikatest, mis on üleval Eesti ajalooarhiivi Saaga andmebaasis. Need sisaldavad Kodavere kihelkonnas 1835., 1840., 1865., 1870., 1885., 1890. aastal sündinute eesnime(sid), sünnikuud, ristimiskuud, sugu, isa eesnime, ema eesnime, vaderite eesnimes...
Jun 28, 2019
Lippus, Pärtel, 2019, "Foneetikakorpuse sagedussõnastik", https://doi.org/10.15155/RE-62, DATADOI, V1
Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja 18 minutit kõnet). Vt korpuse kohta lähemalt https://www.keel.ut.ee/et/foneetikakorpus Korpus lemmatiseeriti ESTMORF morfoloogilis...
Apr 11, 2019
Aedmaa, Eleri, 2019, "Pretrained word and multi-sense embeddings for Estonian", https://doi.org/10.15155/RE-60, DATADOI, V1
Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced from word embeddings. Models were trained using various parameter settings. The values of architecture, number of dimensions, windo...
Nov 9, 2018
Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire, 2018, "Inari Saami geminates", https://doi.org/10.15155/RE-57, DATADOI, V1
Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words with six different foot structures were used: CVCV, CVCCV, CVC:CV(C), CVVCV(C), CVVCCV(C), CVVC:CVC. In total there were 1463 wo...
Oct 23, 2018
Aedmaa, Eleri, 2018, "(Non-)Literalness ratings for Estonian particle verbs", https://doi.org/10.15155/RE-56, DATADOI, V1
(Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle verb. The first version of the dataset was introduced by Eleri Aedmaa, Maximilian Köper, Sabine Schulte im Walde (2018). Combin...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.