Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

11 to 20 of 34 Results
Jun 11, 2024
Malmi, Anton; Leppik, Katrin, 2024, "Vene emakeelega õppijate häälduskorpus", https://doi.org/10.23673/RE-467, DATADOI, V1
Projekti raames koguti erineva keeletaseme ja -taustaga eesti keele õppijatelt häälduskorpus. Salvestused korpuse jaoks tehti enne ja pärast hääldustreeningut ning hääldustreeningu ajal. Hääldustreeninguks kasutati mobiilirakendust SayEst. Pärast andmete kogumist transkribeeriti salvestused häälikutasandini ning andmestikku kasutatati eesti keele t...
Feb 22, 2024
Vihman, Virve-Anneli; Pilvik, Maarja-Liisa; Mandel, Aive; Kängsepp, Annika; Aigro, Mari; Koreinik, Kadri; Praakli, Kristiina; Lindström, Liina, 2024, "Estonian Teen Language Corpus", https://doi.org/10.23673/RE-455, DATADOI, V1
Estonian Teen Language Corpus (Eesti teismeliste keele korpus) is a corpus representing spoken and written language data, collected from Estonian teenagers (ages 9-18) between 2019-2023. The corpus consists of four types of files. Spoken language data is represented by .eaf and .tsv files (spoken_eaf.zip, spoken_tsv.zip), and contain transcriptions...
Oct 23, 2023
Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire, 2023, "Phonetic Corpus of Estonian Spontaneous Speech v1.3", https://doi.org/10.23673/RE-438, DATADOI, V1
The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training vario...
Aug 21, 2023
Vihman, Virve-Anneli; Miljan, Merilin, 2023, "Data for "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role"", https://doi.org/10.23673/RE-429, DATADOI, V1
This dataset makes available the sample of clauses used in the study "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role". It includes 751 clauses from the fiction subcorpus of the University of Tartu’s Balanced Corpus of Written Estonian (cl.ut.ee/korpused) and 758 clauses from the...
Apr 20, 2023
Veismann, Ann; Proos, Mariann; Taremaa, Piia, 2023, "Andmed ja R-i kood artiklile "Kas moos ja buss seisavad endiselt? "seisma"-verbi polüseemia ja seismise kehaline kogemus"", https://doi.org/10.23673/RE-403, DATADOI, V1
See andmekogu sisaldab kahe keeleteadusliku katse toorandmeid ja puhastatud andmeid, katsete tulemustel põhineb artikkel "Kas moos ja buss seisavad endiselt? "seisma"-verbi polüseemia ja seismise kehaline kogemus". Samuti on andmekogusse lisatud statistiliseks analüüsiks kasutatud R-i kood.
Nov 23, 2022
Lindström, Liina; Todesk, Triin; Pilvik, Maarja-Liisa, 2022, "Eesti murrete korpus", https://doi.org/10.23673/RE-365, DATADOI, V1
Eesti murrete korpus on kõiki eesti murdeid hõlmav elektrooniline andmekogu. Korpus koosneb helisalvestistest, foneetilises transkriptsioonis murdetekstidest, lihtsustatud transkriptsioonis murdetekstidest, morfoloogiliselt märgendatud tekstidest, süntaktiliselt märgendatud tekstidest ja metaandmetest. Selles repositooriumis on kättesaadavaks tehtu...
Nov 17, 2022
Taremaa, Piia; Kopecka, Anetta, 2022, "Data and R code for 'Speed and space' (Taremaa & Kopecka)", https://doi.org/10.23673/RE-364, DATADOI, V1
Data and statistical code used in the paper "Speed and space: semantic asymmetries in motion descriptions in Estonian" (published in Cognitive Linguistics; Ahead of Print, published online 8 December 2022)
Oct 20, 2022
Taremaa, Piia; Kiik, Johanna; Toots, Leena Karin; Veismann, Ann, 2022, "Data and R code for 'Speed as a dimension of manner in Estonian frog stories' (Taremaa et al.)", https://doi.org/10.23673/RE-360, DATADOI, V1
Data and statistical code used in the paper "Speed as a dimension of manner in Estonian frog stories" (accepted by the Journal of Nordic Linguistics in 2022)
Nov 3, 2021
Taremaa, Piia; Kopecka, Anetta, 2021, "Data and R code for "Manner of motion in Estonian: A descriptive account of speed"", https://doi.org/10.23673/RE-296, DATADOI, V1
Data and statistical code used in the paper "Manner of motion in Estonian: A descriptive account of speed" (accepted by the Studies in Language in 2021). Authors of the paper: Piia Taremaa and Anetta Kopecka.
Oct 14, 2021
Branets, Anna; Bahtina, Daria, 2021, "Annex 1 to the article "The role of language exposure in mediated receptive multilingualism"", https://doi.org/10.23673/RE-295, DATADOI, V1
The annex 1 to the article "The role of language exposure in mediated receptive multilingualism" presents the socio-linguistic questionnaire that was used in the current study.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.