Author of the README file: Piia Taremaa Last updated: 17.02.2021 ------------------- GENERAL INFORMATION ------------------- Title of the dataset: Data and R code for "Constructional variation in Estonian: demonstrative pronouns and adverbs as determiners in noun phrases" DOI: http://dx.doi.org/10.23673/re-269 URI: https://datadoi.ee/handle/33/327 Description: data and statistical code used in the paper "Constructional variation in Estonian: demonstrative pronouns and adverbs as determiners in noun phrases" (accepted by Lingua 2021); Piia Taremaa, Helen Hint, Maria Reile, Renate Pajusalu -------------------- DATA & FILE OVERVIEW -------------------- The dataset consists of the following files: - Documentation: '00_README_constrVarInEst.txt'. This file (= the current document) contains the documentation of the dataset. - Data: 'CVE.txt'. - Statistical code: 'R code for 'Constructional variation in Estonian' (Taremaa et al. 2021).txt'. --------------------------------------- DATA-SPECIFIC INFORMATION FOR 'CVE.txt' --------------------------------------- File: 'CVE.txt'. This file contains corpus data. Corpus clauses are taken from the Estonian National Corpus 2017. Clauses are manually coded by the authors of the paper. List of the variables (levels): * SubConstruction (seal_N_ine, siin_N_ine, selles_N_ine, tolles_N_ine, seal_N_ade, siin_N_ade, sellel_N_ade, tollel_N_ade, sealt_N_ela, siit_N_ela, sellest_N_ela, tollest_N_ela, sealt_N_abl, siit_N_abl, sellelt_N_abl, tollelt_N_abl, sinna_N_ill, siia_N_ill, sellesse_N_ill, tollesse_N_ill, sinna_N_all, siia_N_all, sellele_N_all, tollele_N_all) * Construction (demPronNP, demAdvNP) * DemDistance (proximal, distal) * DemSpatRel (Source, Location, Goal) * DemCase (innerCase, outerCase) * NounAnimacy (animate, inanimate) * NounSize (large, small, unclear) * NounConcreteness (abstract, ambiguous, concrete, unknown) * NounMobility (mobile, stative, unclear) * NounTemporality (temporal, non-temporal) * NounLemmaLength (2...5, 6...8, 9...19) * NounFrequency (highly frequent, frequent, infrequent) * MotionVerb (yes, no) * VerbPlacement (left, right, no (finite) verb) * TextType (informal, institutional, journalistic, Wikipedia, other) * SubCorpus (1990-2008, 2013, 2017)