Phonetic Corpus of Estonian Spontaneous Speech v1.3

Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire

Lippus, Pärtel; Aare, Kätlin; Malmi, Anton; Tuisk, Tuuli; Teras, Pire

Name	Size	Description
README.txt	3.027Kb	Short summary
ekskfk_info_eng.html	4.858Mb	paper describing background, materials and methods
ekskfk_info.html	4.856Mb	Korpuse tutvustus eesti keeles
ekskfk_margendus.html	1.169Mb	Annotation principles (In Estonian)
SKK0_TG.zip	94.71Mb	TextGrid files
SKK1_TG.zip	21.41Mb	TextGrid files
SKK2_TG.zip	19.95Mb	TextGrid files
SKK3_TG.zip	17.12Mb	TextGrid files
SKK0_WAV_part_1-2.zip	7.310Gb	studio dialogue wav files
SKK0_WAV_part_2-2.zip	6.460Gb	studio dialogue wav files
SKK1_WAV.zip	2.757Gb	monologue wav files
SKK2_WAV.zip	3.222Gb	fieldwork dialogue wav files
SKK3_WAV.zip	3.286Gb	trialogue wav files
SKK0_keypoints_part_1-2.zip	5.780Gb	OpenPose json files
SKK0_keypoints_part_2-2.zip	7.745Gb	OpenPose json files
SKK3_keypoints.zip	1.841Gb	OpenPose json files
SKK3_resp_TG.zip	1.404Mb	respiratory data TextGrid files
SKK3_resp_WAV.zip	735.5Mb	respiratory data wav files
EKSKFK_v1-3_words-by-IPU.txt	12.48Mb	text version of the corpus
EKSKFK_doc.zip	22.21Kb	metadata

Date

2023-10-20

URI

https://datadoi.ee/handle/33/577
http://dx.doi.org/10.23673/re-438

Metadata

Show full item record

Abstract

The Phonetic Corpus of Estonian Spontaneous Speech consists of recordings that have been annotated on different linguistic tiers including words and segments and their boundaries in the speech signal. The corpus mainly contains dialogues. The corpus can be used for studying different phonetic and linguistic research questions and for training various language technological applications (e.g. speech recognition, dialogue systems). In addition to the detailed phonetic segmentation the corpus has wword-level annotation uses standard orthography so the corpus can be used with most NLP tools built for written language. The corpus includes: - Studio quality sound recordings, separate channels for each speaker; Spontaneous conversation between 2-3 speakers, approximately 30 minutes for each recording; - Manual transcription of words and phonemes; - 207 individual speakers in the age range of 20–85 years; - A total of 135 hours of speech recordings; - Word & phoneme level annotation of 106 hours / one milion word level intervals.... Show more Show less

Keyword

speech corpus; phonetic annotation; time aligned annotation; multimodal speech; dialogues; voice quality; morphological analysis; Estonian language

Item type

info:eu-repo/semantics/dataset; Data Paper; Audiovisual; Sound

Collections

Eesti ja üldkeeleteaduse andmed