Phonetic Corpus of Estonian Spontaneous Speech v1.2

Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire

Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire

Name	Size	Description
README.txt	2.478Kb	Short summary
ekskfk_info_eng.html	6.162Mb	paper describing background, materials and methods
ekskfk_info.html	6.176Mb	Korpuse tutvustus eesti keeles
ekskfk_margendus_2020.html	1.184Mb	Annotation principles (In Estonian)
SKK0_WAV.zip	13.63Gb	studio dialogue wav files
SKK1_WAV.zip	2.757Gb	monologue wav files
SKK2_WAV.zip	3.222Gb	fieldwork dialogue wav files
SKK3_WAV.zip	3.286Gb	trialogue wav files
SKK0_TG.zip	84.24Mb	TextGrid files
SKK1_TG.zip	21.41Mb	TextGrid files
SKK2_TG.zip	19.95Mb	TextGrid files
SKK3_TG.zip	4.065Mb	TextGrid files
SKK3_resp_WAV.zip	735.5Mb	respiratory data wav files
SKK3_resp_TG.zip	1.401Mb	respiratory data TextGrid files
SKK0_keypoints.zip	12.00Gb	OpenPose json files
SKK3_keypoints.zip	1.841Gb	OpenPose json files
EKSKFK_words_by_IPU_full_corpus.txt	12.37Mb	text version of the corpus
EKSKFK_doc.zip	21.64Kb	metadata

Date

2021-09-08

URI

https://datadoi.ee/handle/33/351
https://doi.org/10.23673/re-293

Metadata

Show full item record

Abstract

The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training various language technological applications (e.g. speech recognition, dialogue systems). In addition to the detailed phonetic segmentation the corpus has wword-level annotation uses standard orthography so the corpus can be used with most NLP tools built for written language. The corpus includes: - Studio quality sound recordings, separate channels for each speaker Spontaneous conversation between 2-3 speakers, approximately 30 minutes for each recording - Manual transcription of words and phonemes - 205 individual speakers in the age range of 20–85 years - A total of 134 hours of speech recordings - Word & phoneme level annotation of 106 hours / 914 thousand word level intervals... Show more Show less

Keyword

speech corpus; phonetic annotation; phoneme segments; multimodal speech; dialogues; voice quality; morphological analysis

Item type

info:eu-repo/semantics/dataset; Data Paper; Audiovisual; Sound

Collections

Eesti ja üldkeeleteaduse andmed