Now showing items 11-22 of 22

    • Eesti murrete korpus 

      Lindström, Liina; Todesk, Triin; Pilvik, Maarja-Liisa (Tartu Ülikool, eesti ja üldkeeleteaduse instituut, 2022-11-23)
      Eesti murrete korpus on kõiki eesti murdeid hõlmav elektrooniline andmekogu. Korpus koosneb helisalvestistest, foneetilises transkriptsioonis murdetekstidest, lihtsustatud transkriptsioonis murdetekstidest, morfoloogiliselt ...
    • Estonian Teen Language Corpus 

      Vihman, Virve-Anneli; Pilvik, Maarja-Liisa; Mandel, Aive; Kängsepp, Annika; Aigro, Mari; Koreinik, Kadri; Praakli, Kristiina; Lindström, Liina (Institute of Estonian and General Linguistics, University of Tartu, 2023)
      Estonian Teen Language Corpus (Eesti teismeliste keele korpus) is a corpus representing spoken and written language data, collected from Estonian teenagers (ages 9-18) between 2019-2023. The corpus consists of four types ...
    • Foneetikakorpuse sagedussõnastik 

      Lippus, Pärtel (2019-06-20)
      Eesti keele spontaanse kõne foneetilise korpuse sagedussõnastik on koostatud korpuse v.1.0.5 (20.06.2019, doi:10.15155/1-00-0000-0000-0000-001A3L) versiooni põhjal, kui korpuses oli märgendatud 685 750 sõna (89 tundi ja ...
    • Inari Saami geminates 

      Türk, Helen; Lippus, Pärtel; Pajusalu, Karl; Teras, Pire (2018-11-08)
      Data extracted from the Inari Saami prosody corpus (http://dx.doi.org/10.15155/1-00-0000-0000-0000-00150L), used in Türk et al (2018). The Acoustic Correlates of Quantity in Inari Saami. Journal of Phonetics. Target words ...
    • Kodavere kihelkonnas 19. sajandil sündinud lapsed 

      Edela, Anna (2019)
      Anna Edela bakalaureusetöös kasutatud andmed, mis pärinevad 19. sajandi EELK Kodavere koguduse sünnimeetrikatest, mis on üleval Eesti ajalooarhiivi Saaga andmebaasis. Need sisaldavad Kodavere kihelkonnas 1835., 1840., ...
    • Meadow Mari Prosody data 

      Lehiste, Ilse; Teras, Pire; Help, Toomas; Lippus, Pärtel; Meister, Einar; Pajusalu, Karl; Viitso, Tiit-Rein (2005)
      This dataset contains the segmental durations, F0 measurements and formant values F1-F3 from the vowels in 1-4 syllable words in Meadow Mari, a Finno-Ugric language. 8 native speakers read a list of 100 sentences, each ...
    • (Non-)Literalness ratings for Estonian particle verbs 

      Aedmaa, Eleri (2018-06)
      (Non-)literalness dataset of 1481 sentences formed with 184 Estonian particle verbs. Sentences are evaluated by 3 native speakers of Estonian on a 6-point scale [0,5] indicating the degree of compositionality of a particle ...
    • Phonetic Corpus of Estonian Spontaneous Speech v1.2 

      Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire (Institute of Estonian and General Linguistics, University of Tartu, 2021-09-08)
      The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly ...
    • Phonetic Corpus of Estonian Spontaneous Speech v1.3 

      Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire (Institute of Estonian and General Linguistics, University of Tartu, 2023-10-20)
      The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly ...
    • Pretrained word and multi-sense embeddings for Estonian 

      Aedmaa, Eleri (2019)
      Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced ...
    • Quantity-related variation of duration, pitch and vowel quality in spontaneous Estonian (data) 

      Lippus, Pärtel; Asu-Garcia, Eva Liina; Teras, Pire; Tuisk, Tuuli (2013)
      This dataset is collected from the University of Tartu Phonetic Corpus of Estonian Spontaneous Speech. The dataset consists of words with CVCV (consonant-vowel-consonant-vowel) and CVCCV structure and it has been collected ...