Instituudi põhiülesanne on teha õppe- ning teadus- ja arendustööd ning osutada ühiskonnale vajalikke teenuseid eesti keele, soome-ugri keelte ja üldkeeleteaduse alal.

The Institute of Estonian and General Linguistics conducts in-depth teaching and world-class research on Estonian and related languages in comparison with other world languages.

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

31 to 38 of 38 Results
Jun 28, 2019 - Eesti ja üldkeeleteaduse andmed
Lippus, Pärtel, 2019, "Foneetikakorpuse sagedussõnastik", https://doi.org/10.15155/RE-62, DATADOI, V1
Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja 18 minutit kõnet). Vt korpuse kohta lähemalt https://www.keel.ut.ee/et/foneetikakorpus Korpus lemmatiseeriti ESTMORF morfoloogilis...
Apr 11, 2019 - Eesti ja üldkeeleteaduse andmed
Aedmaa, Eleri, 2019, "Pretrained word and multi-sense embeddings for Estonian", https://doi.org/10.15155/RE-60, DATADOI, V1
Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced from word embeddings. Models were trained using various parameter settings. The values of architecture, number of dimensions, windo...
Nov 9, 2018 - Eesti ja üldkeeleteaduse andmed
Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire, 2018, "Inari Saami geminates", https://doi.org/10.15155/RE-57, DATADOI, V1
Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words with six different foot structures were used: CVCV, CVCCV, CVC:CV(C), CVVCV(C), CVVCCV(C), CVVC:CVC. In total there were 1463 wo...
Oct 23, 2018 - Eesti ja üldkeeleteaduse andmed
Aedmaa, Eleri, 2018, "(Non-)Literalness ratings for Estonian particle verbs", https://doi.org/10.15155/RE-56, DATADOI, V1
(Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle verb. The first version of the dataset was introduced by Eleri Aedmaa, Maximilian Köper, Sabine Schulte im Walde (2018). Combin...
Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed
Türk, Helen; Lippus, Pärtel; Šimko, Juraj, 2018, "Context-dependent articulation of consonant gemination in Estonian (data)", https://doi.org/10.15155/RE-34, DATADOI, V1
This dataset is collected from 4 native Estonian speakers with Carstens AG-500 electromagnetic articulograph articluating the 27 combinations of disyllabic words for the purpose of studying gemination in the Estonian three-way quantity system. The data was used for Türk, H., Lippus, P., & Šimko, J. (2017). Context-dependent articulation of consonan...
Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed
Lehiste, Ilse; Teras, Pire; Help, Toomas; Lippus, Pärtel; Meister, Einar; Pajusalu, Karl; Viitso, Tiit-Rein, 2018, "Meadow Mari Prosody data", https://doi.org/10.15155/RE-33, DATADOI, V1
This dataset contains the segmental durations, F0 measurements and formant values F1-F3 from the vowels in 1-4 syllable words in Meadow Mari, a Finno-Ugric language. 8 native speakers read a list of 100 sentences, each containing two test words. The results are published in Lehiste, I., Teras, P., Help, T., Lippus, P., Meister, E., Pajusalu, K., &...
Dec 7, 2017 - Eesti ja üldkeeleteaduse andmed
Malmi, Anton, 2017, "The quality and quantity of Estonian intervocalic /l/ (data)", https://doi.org/10.15155/REPOS-7, DATADOI, V1
No description provided.
Jun 5, 2017 - Eesti ja üldkeeleteaduse andmed
Lippus, Pärtel; Asu-Garcia, Eva Liina; Teras, Pire; Tuisk, Tuuli, 2017, "Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data)", https://doi.org/10.15155/REPO-16, DATADOI, V1
This dataset is collected from the University of Tartu Phonetic Corpus of Estonian Spontaneous Speech. The dataset consists of words with CVCV (consonant-vowel-consonant-vowel) and CVCCV structure and it has been collected for studying Estonian Quantity.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.