This readme file was generated on 2026-04-27 by Kirke Raidmets GENERAL INFORMATION Title of Dataset: Data: Wolf–dog Introgression in a High-Harvest Landscape Author/Principal Investigator Information Name: Kirke Raidmets ORCID:0009-0000-2037-6572 Institution:University of Tartu Address:J. Liivi tn 2, 50409 Tartu Email:kirke.raidmets@ut.ee Author/Associate or Co-investigator Information Name:Maris Hindrikson ORCID:0000-0001-8094-1860 Institution:University of Tartu Address:J. Liivi tn 2, 50409 Tartu Email:maris.hindrikson@ut.ee - Date of data collection: 2014-01-01 - 2022-12-31 - Geographic location of data collection: Estonia - Information about funding sources that supported the collection of the data:Grants PSG715 and PRG1209 SHARING/ACCESS INFORMATION - Licenses/restrictions placed on the data:Open for academic use with citation required - Links to publications that cite or use the data:None yet - Links to other publicly accessible locations of the data:Not publicly available - Links/relationships to ancillary data sets:None - Was data derived from another source? - If yes, list source(s):https://doi.org/10.5061/dryad.76hdr7stk - Recommended citation for this dataset:Raidmets, K. (2026). Data: Wolf–dog Introgression in a High-Harvest Landscape. University of Tartu. DATA & FILE OVERVIEW File List: - README.txt - Estonia_plus_referencedata_ped.txt - Estonia_plus_referencedata_map.txt - Relationship between files, if important: No - Additional related data collected that was not included in the current data package:No - Are there multiple versions of the dataset? - If yes, name of file(s) that was updated:No - Why was the file updated? - When was the file updated? METHODOLOGICAL INFORMATION Description of methods used for collection/generation of data: Estonian data were collected by hunters who gathered tissue samples from harvested wolves. The reference dataset is described in the article by Harmoinen (2021). Methods for processing the data: DNA was extracted from tissue samples collected from harvested wolves in Estonia, and genotyped using 93 SNP markers. In the Estonian dataset, quality control was applied to the resulting genotype data, during which low-quality individuals were removed. The presented dataset contains filtered SNP analysis results. The reference dataset is described in Harmoinen (2021), which provides detailed information on sample collection, genotyping, and quality control procedures. Instrument- or software-specific information needed to interpret the data: PLINK 2.0 - Standards and calibration information, if appropriate: Standard laboratory procedures were used for DNA extraction and SNP genotyping. No additional instrument-specific calibration was recorded beyond routine laboratory practice. - Environmental/experimental conditions:DNA extraction and genotyping were performed in a molecular genetics laboratory under controlled conditions. - Describe any quality-assurance procedures performed on the data:Low-quality SNP results were excluded from further analyses, and they are not included in the dataset presented here. - People involved with sample collection, processing, analysis and/or submission:Kirke Raidmets, Jenni Harmoinen, Mia Valtonen, Egle Tammeleht, Kristiina Prants, Inga Jõgisalu, Maris Hindrikson DATA-SPECIFIC INFORMATION FOR: Estonia_plus_referencedata_ped.txt - Number of variables:6 metadata variables + 93 SNP marker pairs (186 allele columns) - Number of cases/rows: 1172 individuals (548 Estonia + 624 reference dataset) - Variable List: - Population/Origin ID (e.g. CL_Estonia) – country or population of origin - Individual ID – unique sample identifier - Paternal ID – father ID (0 = unknown) - Maternal ID – mother ID (0 = unknown) - Sex – sex of individual (0 = unknown/not specified) - Phenotype – trait value (−9 = missing value) - 93 SNP markers – genotype data encoded as allele pairs (e.g., A A, G T, C C) - Missing data codes: - 0 – missing parental or sex information - −9 – missing phenotype value - 0 0 – missing genotype data (PLINK format) - Specialized formats or other abbreviations used:Data are in PLINK PED format DATA-SPECIFIC INFORMATION FOR: Estonia_plus_referencedata_map.txt - Number of variables:4 - Number of cases/rows:93 - Variable List: - Chromosome – chromosome number - SNP ID – marker name - Genetic distance – distance in centimorgans (cM) - Base-pair position – position in the genome (bp) - Missing data codes: 0 – genetic distance not available - Specialized formats or other abbreviations used:PLINK MAP format