Näita lihtsat nimetuse kirjet

dc.contributor.authorLippus, Pärtel
dc.contributor.authorAare, Kätlin
dc.contributor.authorMalmi, Anton
dc.contributor.authorTuisk, Tuuli
dc.contributor.authorTeras, Pire
dc.coverage.spatialEstoniaen
dc.date.accessioned2023-10-23T09:27:32Z
dc.date.available2023-10-23T09:27:32Z
dc.date.issued2023-10-20
dc.identifier.urihttps://datadoi.ee/handle/33/577
dc.identifier.urihttp://dx.doi.org/10.23673/re-438
dc.description.abstractThe Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training various language technological applications (e.g. speech recognition, dialogue systems). In addition to the detailed phonetic segmentation the corpus has word-level annotation uses standard orthography so the corpus can be used with most NLP tools built for written language. The corpus includes: - Studio quality sound recordings, separate channels for each speaker; Spontaneous conversation between 2-3 speakers, approximately 30 minutes for each recording; - Manual transcription of words and phonemes; - 207 individual speakers in the age range of 20–85 years; - A total of 135 hours of speech recordings; - Word & phoneme level annotation of 106 hours / one milion word level intervals.en
dc.formatWAVen
dc.formatTextGriden
dc.formatJSONen
dc.formatTXTen
dc.language.isoeten
dc.publisherInstitute of Estonian and General Linguistics, University of Tartuen
dc.relationEKTB3en
dc.rightsinfo:eu-repo/semantics/restrictedAccessen
dc.subjectspeech corpusen
dc.subjectphonetic annotationen
dc.subjecttime aligned annotationen
dc.subjectmultimodal speechen
dc.subjectdialoguesen
dc.subjectvoice qualityen
dc.subjectmorphological analysisen
dc.subjectEstonian languageen
dc.titlePhonetic Corpus of Estonian Spontaneous Speech v1.3en
dc.typeinfo:eu-repo/semantics/dataseten
dc.typeData Paperen
dc.typeAudiovisualen
dc.typeSounden


Failid selles nimetuses

Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail

Nimetus asub järgmis(t)es andmekogumi(te)s:

Näita lihtsat nimetuse kirjet