Phonetic Corpus of Estonian Spontaneous Speech v1.2
Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire
Loading
Nimi | Suurus | Kirjeldus |
---|---|---|
README.txt | 2.478Kb | Short summary |
ekskfk_info_eng.html | 6.162Mb | paper describing background, materials and methods |
ekskfk_info.html | 6.176Mb | Korpuse tutvustus eesti keeles |
ekskfk_margendus_2020.html | 1.184Mb | Annotation principles (In Estonian) |
SKK0_WAV.zip | 13.63Gb | studio dialogue wav files |
SKK1_WAV.zip | 2.757Gb | monologue wav files |
SKK2_WAV.zip | 3.222Gb | fieldwork dialogue wav files |
SKK3_WAV.zip | 3.286Gb | trialogue wav files |
SKK0_TG.zip | 84.24Mb | TextGrid files |
SKK1_TG.zip | 21.41Mb | TextGrid files |
SKK2_TG.zip | 19.95Mb | TextGrid files |
SKK3_TG.zip | 4.065Mb | TextGrid files |
SKK3_resp_WAV.zip | 735.5Mb | respiratory data wav files |
SKK3_resp_TG.zip | 1.401Mb | respiratory data TextGrid files |
SKK0_keypoints.zip | 12.00Gb | OpenPose json files |
SKK3_keypoints.zip | 1.841Gb | OpenPose json files |
EKSKFK_words_by_IPU_full_corpus.txt | 12.37Mb | text version of the corpus |
EKSKFK_doc.zip | 21.64Kb | metadata |
Kokkuvõte
The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training various language technological applications (e.g. speech recognition, dialogue systems). In addition to the detailed phonetic segmentation the corpus has wword-level annotation uses standard orthography so the corpus can be used with most NLP tools built for written language.
The corpus includes:
- Studio quality sound recordings, separate channels for each speaker
Spontaneous conversation between 2-3 speakers, approximately 30 minutes for each recording
- Manual transcription of words and phonemes
- 205 individual speakers in the age range of 20–85 years
- A total of 134 hours of speech recordings
- Word & phoneme level annotation of 106 hours / 914 thousand word level intervals... Rohkem Vähem