Genome-wide association studies (GWAS) aim at identifying genomic regions involved in phenotype expression, but identifying causative variants is difficult. Pig Combined Annotation Dependent Depletion Show more
Genome-wide association studies (GWAS) aim at identifying genomic regions involved in phenotype expression, but identifying causative variants is difficult. Pig Combined Annotation Dependent Depletion (pCADD) scores provide a measure of the predicted consequences of genetic variants. Incorporating pCADD into the GWAS pipeline may help their identification. Our objective was to identify genomic regions associated with loin depth and muscle pH, and identify regions of interest for fine-mapping and further experimental work. Genotypes forβ~β40,000 single nucleotide morphisms (SNPs) were used to perform GWAS for these two traits, using de-regressed breeding values (dEBV) for 329,964 pigs from four commercial lines. Imputed sequence data was used to identify SNPs in strong ([Formula: see text] 0.80) linkage disequilibrium with lead GWAS SNPs with the highest pCADD scores. Fifteen distinct regions were associated with loin depth and one with loin pH at genome-wide significance. Regions on chromosomes 1, 2, 5, 7, and 16, explained between 0.06 and 3.55% of the additive genetic variance and were strongly associated with loin depth. Only a small part of the additive genetic variance in muscle pH was attributed to SNPs. The results of our pCADD analysis suggests that high-scoring pCADD variants are enriched for missense mutations. Two close but distinct regions on SSC1 were associated with loin depth, and pCADD identified the previously identified missense variant within the MC4R gene for one of the lines. For loin pH, pCADD identified a synonymous variant in the RNF25 gene (SSC15) as the most likely candidate for the muscle pH association. The missense mutation in the PRKAG3 gene known to affect glycogen content was not prioritised by pCADD for loin pH. For loin depth, we identified several strong candidate regions for further statistical fine-mapping that are supported in the literature, and two novel regions. For loin muscle pH, we identified one previously identified associated region. We found mixed evidence for the utility of pCADD as an extension of heuristic fine-mapping. The next step is to perform more sophisticated fine-mapping and expression quantitative trait loci (eQTL) analysis, and then interrogate candidate variants in vitro by perturbation-CRISPR assays. Show less
Backfat thickness is an important carcass composition trait for pork production and is commonly included in swine breeding programmes. In this paper, we report the results of a large genome-wide assoc Show more
Backfat thickness is an important carcass composition trait for pork production and is commonly included in swine breeding programmes. In this paper, we report the results of a large genome-wide association study for backfat thickness using data from eight lines of diverse genetic backgrounds. Data comprised 275,590 pigs from eight lines with diverse genetic backgrounds (breeds included Large White, Landrace, Pietrain, Hampshire, Duroc, and synthetic lines) genotyped and imputed for 71,324 single-nucleotide polymorphisms (SNPs). For each line, we estimated SNP associations using a univariate linear mixed model that accounted for genomic relationships. SNPs with significant associations were identified using a threshold of pβ<β10 We found significant associations with backfat thickness for 264 SNPs across 27 genomic regions. Six genomic regions were detected in three or more lines. The average estimate of the SNP-based heritability was 0.48, with estimates by line ranging from 0.30 to 0.58. The genomic regions jointly explained from 3.2 to 19.5% of the additive genetic variance of backfat thickness within a line. Individual genomic regions explained up to 8.0% of the additive genetic variance of backfat thickness within a line. Some of these 27 genomic regions also explained up to 1.6% of the additive genetic variance in lines for which the genomic region was not statistically significant. We identified 64 candidate genes with annotated functions that can be related to fat metabolism, including well-studied genes such as MC4R, IGF2, and LEPR, and more novel candidate genes such as DHCR7, FGF23, MEDAG, DGKI, and PTN. Our results confirm the polygenic architecture of backfat thickness and the role of genes involved in energy homeostasis, adipogenesis, fatty acid metabolism, and insulin signalling pathways for fat deposition in pigs. The results also suggest that several less well-understood metabolic pathways contribute to backfat development, such as those of phosphate, calcium, and vitamin D homeostasis. Show less
This paper reviews the localization of published potential causative variants in contemporary pig and cattle reference genomes, and the evidence for their causality. In spite of the difficulties inher Show more
This paper reviews the localization of published potential causative variants in contemporary pig and cattle reference genomes, and the evidence for their causality. In spite of the difficulties inherent to the identification of causative variants from genetic mapping and genome-wide association studies, researchers in animal genetics have proposed putative causative variants for several traits relevant to livestock breeding. For this review, we read the literature that supports potential causative variants in 13 genes (ABCG2, DGAT1, GHR, IGF2, MC4R, MSTN, NR6A1, PHGK1, PRKAG3, PLRL, RYR1, SYNGR2 and VRTN) in cattle and pigs, and localized them in contemporary reference genomes. We review the evidence for their causality, by aiming to separate the evidence for the locus, the proposed causative gene and the proposed causative variant, and report the bioinformatic searches and tactics needed to localize the sequence variants in the cattle or pig genome. Taken together, there is usually good evidence for the association at the locus level, some evidence for a specific causative gene at eight of the loci, and some experimental evidence for a specific causative variant at six of the loci. We recommend that researchers who report new potential causative variants use referenced coordinate systems, show local sequence context, and submit variants to repositories. Show less