DataDOI
    • English
    • Eesti
  • English 
    • English
    • Eesti
  • Login
View Item 
  •   DataDOI
  • UT Humaniora
  • Eesti ja üldkeeleteaduse instituut
  • Eesti ja üldkeeleteaduse andmed
  • View Item
  •   DataDOI
  • UT Humaniora
  • Eesti ja üldkeeleteaduse instituut
  • Eesti ja üldkeeleteaduse andmed
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Pretrained word and multi-sense embeddings for Estonian

Aedmaa, Eleri
  • BibTex
  • EndNote (RIS)
Loading
NameSizeDescription
README.pdf46.31KbREADME
Thumbnail
Date
2019
URI
http://datadoi.ee/handle/33/91
https://doi.org/10.15155/re-60
Metadata
Show full item record
Abstract
Word and multi-sense embedding for Estonian trained on lemmatized etTenTen: Corpus of the Estonian Web. Word embeddings are trained with word2vec. Sense embeddings are trained with SenseGram. Sense inventory is induced from word embeddings. Models were trained using various parameter settings. The values of architecture, number of dimensions, window size, minimum frequency threshold and number of iterations vary.
Keyword
word embeddings; sense embeddings; Estonian
Item type
info:eu-repo/semantics/dataset
Collections
  • Eesti ja üldkeeleteaduse andmed

University of Tartu Library
Open Science
Contact Us
DSpace software
Mirage 2 Theme
 

 

Browse

Communities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Statistics

View Usage Statistics

University of Tartu Library
Open Science
Contact Us
DSpace software
Mirage 2 Theme