Eesti ja üldkeeleteaduse instituut

Instituudi põhiülesanne on teha õppe- ning teadus- ja arendustööd ning osutada ühiskonnale vajalikke teenuseid eesti keele, soome-ugri keelte ja üldkeeleteaduse alal.

The Institute of Estonian and General Linguistics conducts in-depth teaching and world-class research on Estonian and related languages in comparison with other world languages.

Eesti ja üldkeeleteaduse andmed

Suuline eesti keel arvudes

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

31 to 40 of 40 Results

Distribution of categorised feedback comments (e.g. class, sub-class, and features) by feedback exchange group and by group member Sep 24, 2020 - Eesti ja üldkeeleteaduse andmed Yallop, Taremaa, & Leijen, 2020, "Distribution of categorised feedback comments (e.g. class, sub-class, and features) by feedback exchange group and by group member", https://doi.org/10.23673/REE3, DATADOI, V1 This entry contains data on the categorisation and classification of asynchronous written peer feedback comments within one doctorate writing group over a three-month period. The research data should be used in tandem with the following publication: Yallop, R. M. A., Taremaa, P. & Leijen, D. A. J (forthcoming/2020). The affect and effect of asynchr...
Kodavere kihelkonnas 19. sajandil sündinud lapsed Sep 7, 2019 - Eesti ja üldkeeleteaduse andmed Edela, Anna, 2019, "Kodavere kihelkonnas 19. sajandil sündinud lapsed", https://doi.org/10.15155/RE-71, DATADOI, V1 Anna Edela bakalaureusetöös kasutatud andmed, mis pärinevad 19. sajandi EELK Kodavere koguduse sünnimeetrikatest, mis on üleval Eesti ajalooarhiivi Saaga andmebaasis. Need sisaldavad Kodavere kihelkonnas 1835., 1840., 1865., 1870., 1885., 1890. aastal sündinute eesnime(sid), sünnikuud, ristimiskuud, sugu, isa eesnime, ema eesnime, vaderite eesnimes...
Foneetikakorpuse sagedussõnastik Jun 28, 2019 - Eesti ja üldkeeleteaduse andmed Lippus, Pärtel, 2019, "Foneetikakorpuse sagedussõnastik", https://doi.org/10.15155/RE-62, DATADOI, V1 Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja 18 minutit kõnet). Vt korpuse kohta lähemalt https://www.keel.ut.ee/et/foneetikakorpus Korpus lemmatiseeriti ESTMORF morfoloogilis...
Pretrained word and multi-sense embeddings for Estonian Apr 11, 2019 - Eesti ja üldkeeleteaduse andmed Aedmaa, Eleri, 2019, "Pretrained word and multi-sense embeddings for Estonian", https://doi.org/10.15155/RE-60, DATADOI, V1 Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced from word embeddings. Models were trained using various parameter settings. The values of architecture, number of dimensions, windo...
Inari Saami geminates Nov 9, 2018 - Eesti ja üldkeeleteaduse andmed Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire, 2018, "Inari Saami geminates", https://doi.org/10.15155/RE-57, DATADOI, V1 Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words with six different foot structures were used: CVCV, CVCCV, CVC:CV(C), CVVCV(C), CVVCCV(C), CVVC:CVC. In total there were 1463 wo...
(Non-)Literalness ratings for Estonian particle verbs Oct 23, 2018 - Eesti ja üldkeeleteaduse andmed Aedmaa, Eleri, 2018, "(Non-)Literalness ratings for Estonian particle verbs", https://doi.org/10.15155/RE-56, DATADOI, V1 (Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle verb. The first version of the dataset was introduced by Eleri Aedmaa, Maximilian Köper, Sabine Schulte im Walde (2018). Combin...
Context-dependent articulation of consonant gemination in Estonian (data) Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed Türk, Helen; Lippus, Pärtel; Šimko, Juraj, 2018, "Context-dependent articulation of consonant gemination in Estonian (data)", https://doi.org/10.15155/RE-34, DATADOI, V1 This dataset is collected from 4 native Estonian speakers with Carstens AG-500 electromagnetic articulograph articluating the 27 combinations of disyllabic words for the purpose of studying gemination in the Estonian three-way quantity system. The data was used for Türk, H., Lippus, P., & Šimko, J. (2017). Context-dependent articulation of consonan...
Meadow Mari Prosody data Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed Lehiste, Ilse; Teras, Pire; Help, Toomas; Lippus, Pärtel; Meister, Einar; Pajusalu, Karl; Viitso, Tiit-Rein, 2018, "Meadow Mari Prosody data", https://doi.org/10.15155/RE-33, DATADOI, V1 This dataset contains the segmental durations, F0 measurements and formant values F1-F3 from the vowels in 1-4 syllable words in Meadow Mari, a Finno-Ugric language. 8 native speakers read a list of 100 sentences, each containing two test words. The results are published in Lehiste, I., Teras, P., Help, T., Lippus, P., Meister, E., Pajusalu, K., &...
The quality and quantity of Estonian intervocalic /l/ (data) Dec 7, 2017 - Eesti ja üldkeeleteaduse andmed Malmi, Anton, 2017, "The quality and quantity of Estonian intervocalic /l/ (data)", https://doi.org/10.15155/REPOS-7, DATADOI, V1 No description provided.
Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data) Jun 5, 2017 - Eesti ja üldkeeleteaduse andmed Lippus, Pärtel; Asu-Garcia, Eva Liina; Teras, Pire; Tuisk, Tuuli, 2017, "Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data)", https://doi.org/10.15155/REPO-16, DATADOI, V1 This dataset is collected from the University of Tartu Phonetic Corpus of Estonian Spontaneous Speech. The dataset consists of words with CVCV (consonant-vowel-consonant-vowel) and CVCCV structure and it has been collected for studying Estonian Quantity.

Distribution of categorised feedback comments (e.g. class, sub-class, and features) by feedback exchange group and by group member

Sep 24, 2020 - Eesti ja üldkeeleteaduse andmed

Yallop, Taremaa, & Leijen, 2020, "Distribution of categorised feedback comments (e.g. class, sub-class, and features) by feedback exchange group and by group member", https://doi.org/10.23673/REE3, DATADOI, V1

This entry contains data on the categorisation and classification of asynchronous written peer feedback comments within one doctorate writing group over a three-month period. The research data should be used in tandem with the following publication: Yallop, R. M. A., Taremaa, P. & Leijen, D. A. J (forthcoming/2020). The affect and effect of asynchr...

Kodavere kihelkonnas 19. sajandil sündinud lapsed

Sep 7, 2019 - Eesti ja üldkeeleteaduse andmed

Edela, Anna, 2019, "Kodavere kihelkonnas 19. sajandil sündinud lapsed", https://doi.org/10.15155/RE-71, DATADOI, V1

Anna Edela bakalaureusetöös kasutatud andmed, mis pärinevad 19. sajandi EELK Kodavere koguduse sünnimeetrikatest, mis on üleval Eesti ajalooarhiivi Saaga andmebaasis. Need sisaldavad Kodavere kihelkonnas 1835., 1840., 1865., 1870., 1885., 1890. aastal sündinute eesnime(sid), sünnikuud, ristimiskuud, sugu, isa eesnime, ema eesnime, vaderite eesnimes...

Foneetikakorpuse sagedussõnastik

Jun 28, 2019 - Eesti ja üldkeeleteaduse andmed

Lippus, Pärtel, 2019, "Foneetikakorpuse sagedussõnastik", https://doi.org/10.15155/RE-62, DATADOI, V1

Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja 18 minutit kõnet). Vt korpuse kohta lähemalt https://www.keel.ut.ee/et/foneetikakorpus Korpus lemmatiseeriti ESTMORF morfoloogilis...

Pretrained word and multi-sense embeddings for Estonian

Apr 11, 2019 - Eesti ja üldkeeleteaduse andmed

Aedmaa, Eleri, 2019, "Pretrained word and multi-sense embeddings for Estonian", https://doi.org/10.15155/RE-60, DATADOI, V1

Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced from word embeddings. Models were trained using various parameter settings. The values of architecture, number of dimensions, windo...

Inari Saami geminates

Nov 9, 2018 - Eesti ja üldkeeleteaduse andmed

Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire, 2018, "Inari Saami geminates", https://doi.org/10.15155/RE-57, DATADOI, V1

Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words with six different foot structures were used: CVCV, CVCCV, CVC:CV(C), CVVCV(C), CVVCCV(C), CVVC:CVC. In total there were 1463 wo...

(Non-)Literalness ratings for Estonian particle verbs

Oct 23, 2018 - Eesti ja üldkeeleteaduse andmed

Aedmaa, Eleri, 2018, "(Non-)Literalness ratings for Estonian particle verbs", https://doi.org/10.15155/RE-56, DATADOI, V1

(Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle verb. The first version of the dataset was introduced by Eleri Aedmaa, Maximilian Köper, Sabine Schulte im Walde (2018). Combin...

Context-dependent articulation of consonant gemination in Estonian (data)

Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed

Türk, Helen; Lippus, Pärtel; Šimko, Juraj, 2018, "Context-dependent articulation of consonant gemination in Estonian (data)", https://doi.org/10.15155/RE-34, DATADOI, V1

This dataset is collected from 4 native Estonian speakers with Carstens AG-500 electromagnetic articulograph articluating the 27 combinations of disyllabic words for the purpose of studying gemination in the Estonian three-way quantity system. The data was used for Türk, H., Lippus, P., & Šimko, J. (2017). Context-dependent articulation of consonan...

Meadow Mari Prosody data

Feb 6, 2018 - Eesti ja üldkeeleteaduse andmed

Lehiste, Ilse; Teras, Pire; Help, Toomas; Lippus, Pärtel; Meister, Einar; Pajusalu, Karl; Viitso, Tiit-Rein, 2018, "Meadow Mari Prosody data", https://doi.org/10.15155/RE-33, DATADOI, V1

This dataset contains the segmental durations, F0 measurements and formant values F1-F3 from the vowels in 1-4 syllable words in Meadow Mari, a Finno-Ugric language. 8 native speakers read a list of 100 sentences, each containing two test words. The results are published in Lehiste, I., Teras, P., Help, T., Lippus, P., Meister, E., Pajusalu, K., &...

The quality and quantity of Estonian intervocalic /l/ (data)

Dec 7, 2017 - Eesti ja üldkeeleteaduse andmed

Malmi, Anton, 2017, "The quality and quantity of Estonian intervocalic /l/ (data)", https://doi.org/10.15155/REPOS-7, DATADOI, V1

No description provided.

Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data)

Jun 5, 2017 - Eesti ja üldkeeleteaduse andmed

Lippus, Pärtel; Asu-Garcia, Eva Liina; Teras, Pire; Tuisk, Tuuli, 2017, "Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data)", https://doi.org/10.15155/REPO-16, DATADOI, V1

This dataset is collected from the University of Tartu Phonetic Corpus of Estonian Spontaneous Speech. The dataset consists of words with CVCV (consonant-vowel-consonant-vowel) and CVCCV structure and it has been collected for studying Estonian Quantity.

Add Data

Share Dataverse

Link Dataverse

Reset Modifications