Development of therapies for CLN3 disease, a rare pediatric lysosomal storage disorder, has been hindered by the lack of etiological insights and translatable biomarkers to clinics. We used a deep mul Show more
Development of therapies for CLN3 disease, a rare pediatric lysosomal storage disorder, has been hindered by the lack of etiological insights and translatable biomarkers to clinics. We used a deep multi-omics approach to discover blood-based biomarkers using longitudinal serum samples from a porcine model of CLN3 disease. Comprehensive metabolomics was combined with a nanoparticle-based LC-MS-based proteomic profiling coupled with TMTpro 18-plex to generate quantitative data on 769 metabolites and 2634 proteins, collectively the most exhaustive multi-omics profile conducted on serum from a porcine model. This was previously impossible due to lack of efficient deep serum proteome profiling technologies compatible with model organisms. Here we show that the presymptomatic disease state is characterized by elevations in glycerophosphodiester species and lysosomal proteases, while later timepoints are enriched with species involved in immune cell activation and sphingolipid metabolism. Cathepsin S (CTSS), Cathepsin B (CTSB), glycerophosphoinositol, and glycerophosphoethanolamine captured a large portion of the genotype-correlated variation between healthy and diseased animals, suggesting that an index score based on these analytes could have great utility in the clinic. This study's findings demonstrate the potential of deep multi-omics profiling for uncovering disease-specific biomarkers, providing valuable insights for understanding disease and facilitating the identification of potential drug targets, thus offering valuable insights for therapeutic interventions. Show less
Chromatin regulation involves the selective recruitment of chromatin factors to facilitate DNA repair, replication and transcription. Here we demonstrate the utility of coupling unbiased functional ge Show more
Chromatin regulation involves the selective recruitment of chromatin factors to facilitate DNA repair, replication and transcription. Here we demonstrate the utility of coupling unbiased functional genomics with chromatin immunoprecipitation (CRISPR-ChIP) to identify the factors associated with active chromatin modifications in mammalian cells. Specifically, an integrated reporter containing a cis-regulatory element of interest and a single guide RNA provide a chromatinized template for a direct readout for regulators of histone modifications associated with actively transcribed genes such as H3K4me3 and H3K79me2. With CRISPR-ChIP, we identify all the nonredundant COMPASS complex members required for H3K4me3 and demonstrate that RNA polymerase II is dispensable for the maintenance of H3K4me3. As H3K79me2 has a putative oncogenic function in leukemia cells driven by MLL translocations, using CRISPR-ChIP we reveal a functional partitioning of H3K79 methylation into two distinct regulatory units: an oncogenic DOT1L complex directed by the MLL fusion protein in a Menin-dependent manner and a separate endogenous DOT1L complex, where catalytic activity is directed by MLLT10. Overall, CRISPR-ChIP provides a powerful tool for the unbiased interrogation of the mechanisms underpinning chromatin regulation. Show less
Triglyceride (TG)-lowering LPL variants in combination with genetic LDL-C-lowering variants are associated with reduced risk of coronary artery disease (CAD). Genetic variation in the APOA5 gene encod Show more
Triglyceride (TG)-lowering LPL variants in combination with genetic LDL-C-lowering variants are associated with reduced risk of coronary artery disease (CAD). Genetic variation in the APOA5 gene encoding apolipoprotein A-V also strongly affects TG levels, but the potential clinical impact and underlying mechanisms are yet to be resolved. Here, we aimed to study the effects of APOA5 genetic variation on CAD risk and plasma lipoproteins through factorial genetic association analyses. Using data from 309,780 European-ancestry participants from the UK Biobank, we evaluated the effects of lower TG levels as a result of genetic variation in APOA5 and/or LPL on CAD risk with or without a background of reduced LDL-C. Next, we compared lower TG levels via APOA5 and LPL variation with over 100 lipoprotein measurements in a combined sample from the Netherlands Epidemiology of Obesity study (N = 4,838) and the Oxford Biobank (N = 6,999). We found that lower TG levels due to combined APOA5 and LPL variation and genetically-influenced lower LDL-C levels afforded the largest reduction in CAD risk (odds ratio: 0.78 (0.73-0.82)). Compared to patients with genetically-influenced lower TG via LPL, genetically-influenced lower TG via APOA5 had similar and independent, but notably larger, effects on the lipoprotein profile. Our results suggest that lower TG levels as a result of APOA5 variation have strong beneficial effects on CAD risk and the lipoprotein profile, which suggest apo A-V may be a potential novel therapeutic target for CAD prevention. Show less
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding var Show more
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding variants from which pinpointing causal genes remains challenging. Here we combined data from 718,734 individuals to discover rare and low-frequency (minor allele frequency (MAF) < 5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which 8 variants were in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2 and ZNF169) newly implicated in human obesity, 2 variants were in genes (MC4R and KSR2) previously observed to be mutated in extreme obesity and 2 variants were in GIPR. The effect sizes of rare variants are ~10 times larger than those of common variants, with the largest effect observed in carriers of an MC4R mutation introducing a stop codon (p.Tyr35Ter, MAF = 0.01%), who weighed ~7 kg more than non-carriers. Pathway analyses based on the variants associated with BMI confirm enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically supported therapeutic targets in obesity. Show less
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated w Show more
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD. Show less
In sequencing studies of common diseases and quantitative traits, power to test rare and low frequency variants individually is weak. To improve power, a common approach is to combine statistical evid Show more
In sequencing studies of common diseases and quantitative traits, power to test rare and low frequency variants individually is weak. To improve power, a common approach is to combine statistical evidence from several genetic variants in a region. Major challenges are how to do the combining and which statistical framework to use. General approaches for testing association between rare variants and quantitative traits include aggregating genotypes and trait values, referred to as 'collapsing', or using a score-based variance component test. However, little attention has been paid to alternative models tailored for protein truncating variants. Recent studies have highlighted the important role that protein truncating variants, commonly referred to as 'loss of function' variants, may have on disease susceptibility and quantitative levels of biomarkers. We propose a Bayesian modelling framework for the analysis of protein truncating variants and quantitative traits. Our simulation results show that our models have an advantage over the commonly used methods. We apply our models to sequence and exome-array data and discover strong evidence of association between low plasma triglyceride levels and protein truncating variants at APOC3 (Apolipoprotein C3). Software is available from http://www.well.ox.ac.uk/~rivas/mamba Show less
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between bod Show more
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and ∼ 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 × 10⁻⁸), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation. Show less
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, Show more
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, fasting insulin and indices of beta-cell function (HOMA-B) and insulin resistance (HOMA-IR) in up to 46,186 nondiabetic participants. Follow-up of 25 loci in up to 76,558 additional subjects identified 16 loci associated with fasting glucose and HOMA-B and two loci associated with fasting insulin and HOMA-IR. These include nine loci newly associated with fasting glucose (in or near ADCY5, MADD, ADRA2A, CRY2, FADS1, GLIS3, SLC2A2, PROX1 and C2CD4B) and one influencing fasting insulin and HOMA-IR (near IGF1). We also demonstrated association of ADCY5, PROX1, GCK, GCKR and DGKB-TMEM195 with type 2 diabetes. Within these loci, likely biological candidate genes influence signal transduction, cell proliferation, development, glucose-sensing and circadian regulation. Our results demonstrate that genetic studies of glycemic traits can identify type 2 diabetes risk loci, as well as loci containing gene variants that are associated with a modest elevation in glucose levels but are not associated with overt diabetes. Show less