Metrics
896 Downloads
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

271 to 280 of 873 Results
Nov 14, 2023 - Suuline eesti keel arvudes
Lippus, Pärtel; Alumäe, Tanel; Orasmaa, Siim; Tsepelina, Katrin; Lindström, Liina, 2023, "Eesti Rahvusringhäälingu raadiosaadete korpus", https://doi.org/10.23673/RE-441, DATADOI, V1
Korpus koosneb ERR-i raadiosaadetest ja nende transkriptsioonidest. Korpuses on 53 000 raadiosaadet kogukestusega 16 tuhat tundi, mis on salvestatud vahemikus 1930–2022. Salvestused on transkribeeritud Tallinna Tehnikaülikooli automaatse kõnetuvastusega ning tekstid on automaatselt morfanalüüsitud EstNLTK-ga. Kokku on korpuses 109 miljonit sõna. Ko...
Comma Separated Values - 11.2 MB - MD5: a099b09a27df87f0b980bdb658bdd7a0
List of shows and their metadata
ZIP Archive - 4.7 GB - MD5: 423a19c9ec33af33990a4b872a988a4d
Transcription files
HTML - 10.3 KB - MD5: d882f116578afa63b47805133c276d3c
2026. aasta migratsiooni käigus varasemast DataDOI süsteemist üle kantud kasutusstatistika kajastab tegevust eelmises DSpace-põhises süsteemis ega näita Dataverse’i uusi kasutusandmeid. Usage statistics carried over from the previous DataDOI system as part of the 2026 migration reflect activity in the former DSpace-based system and do not represent...
Plain Text - 8.6 KB - MD5: 216bd295d33050f1b8e9e71be24f45af
Description of the dataset and using conditions
Oct 23, 2023 - Eesti ja üldkeeleteaduse andmed
Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire, 2023, "Phonetic Corpus of Estonian Spontaneous Speech v1.3", https://doi.org/10.23673/RE-438, DATADOI, V1
The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training vario...
ZIP Archive - 22.2 KB - MD5: a98f378658a2714d74075f3a7dabb869
metadata
HTML - 4.9 MB - MD5: a43aacb5fc3c9f0a54f59e5ac4d685f6
paper describing background, materials and methods
HTML - 4.9 MB - MD5: 75da01eb3efe15480c6fda112b5c5410
Korpuse tutvustus eesti keeles
HTML - 1.2 MB - MD5: eb517dd0ebd6da9d19004b01e360a708
Annotation principles (In Estonian)
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.