Recent Submissions

  • Estonian Teen Language Corpus 

    Vihman, Virve-Anneli; Pilvik, Maarja-Liisa; Mandel, Aive; Kängsepp, Annika; Aigro, Mari; Koreinik, Kadri; Praakli, Kristiina; Lindström, Liina (Institute of Estonian and General Linguistics, University of Tartu, 2023)
    Estonian Teen Language Corpus (Eesti teismeliste keele korpus) is a corpus representing spoken and written language data, collected from Estonian teenagers (ages 9-18) between 2019-2023. The corpus consists of four types ...
  • Phonetic Corpus of Estonian Spontaneous Speech v1.3 

    Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire (Institute of Estonian and General Linguistics, University of Tartu, 2023-10-20)
    The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly ...
  • Data for "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role" 

    Vihman, Virve-Anneli; Miljan, Merilin (University of Tartu, Institute of Estonian and General Linguistics, 2023)
    This dataset makes available the sample of clauses used in the study "A corpus study of grammatical case forms in written and spoken Estonian: Frequency, distribution and grammatical role". It includes 751 clauses from the ...
  • Eesti murrete korpus 

    Lindström, Liina; Todesk, Triin; Pilvik, Maarja-Liisa (Tartu Ülikool, eesti ja üldkeeleteaduse instituut, 2022-11-23)
    Eesti murrete korpus on kõiki eesti murdeid hõlmav elektrooniline andmekogu. Korpus koosneb helisalvestistest, foneetilises transkriptsioonis murdetekstidest, lihtsustatud transkriptsioonis murdetekstidest, morfoloogiliselt ...
  • Data and R code for 'Speed and space' (Taremaa & Kopecka) 

    Taremaa, Piia; Kopecka, Anetta (University of Tartu, 2022)
    Data and statistical code used in the paper "Speed and space: semantic asymmetries in motion descriptions in Estonian" (published in Cognitive Linguistics; Ahead of Print, published online 8 December 2022)

View more