Circulating lipoprotein(a) [Lp(a)] levels are highly heritable and linked to atherosclerotic cardiovascular disease, yet clinical measurement rates remain low (<1%) in the United States. The high heri Show more
Circulating lipoprotein(a) [Lp(a)] levels are highly heritable and linked to atherosclerotic cardiovascular disease, yet clinical measurement rates remain low (<1%) in the United States. The high heritability of Lp(a) across populations makes genetic prediction an attractive approach for closing this testing gap, but existing polygenic scores transfer poorly across populations. Haplotype-based prediction models, which use standard genome-wide genotype data to capture common-, rare-, and structural-variation at the LPA locus, could bridge this gap, enabling opportunistic identification of individuals with elevated Lp(a) levels across diverse populations within existing large, genotyped cohorts. This study sought to develop and validate a haplotype-based prediction model using genome-wide genotype data to identify individuals with elevated Lp(a) levels across diverse populations. We developed an Among PMBB (n = 1856), MGBB (n = 1401), and BioMe (n = 1686) participants with available genotype and Lp(a) measurements, average age was 60 years, and 51% were female. Overall r A haplotype-based genetic model effectively identified individuals with elevated Lp(a) levels across diverse populations, with potential utility for opportunistic screening among cohorts where genotype data is available, but Lp(a) testing rates are low. Show less
In a recent study by Zhao et al., rare protein-truncating variants (PTVs) in the BSN and APBA1 genes showed effects on obesity that exceeded those of well-known genes such as MC4R in a UK cohort. In t Show more
In a recent study by Zhao et al., rare protein-truncating variants (PTVs) in the BSN and APBA1 genes showed effects on obesity that exceeded those of well-known genes such as MC4R in a UK cohort. In this study, we leveraged the All of Us Research Program, to investigate the association of predicted LoF (pLoF) PTVs in BSN and APBA1 with body mass index (BMI) across a population of diverse ancestry. Our analysis revealed that the impact of pLoF variants in BSN and APBA1 on BMI was notably greater in this cohort, especially among individuals of European ancestry. Additionally, a phenome-wide association study (PheWAS) using the extensive phenotypic data available in the All of Us Research Program uncovered novel associations of Show less
Melanocortin-4 receptor (MC4R) plays an essential role in food intake and energy homeostasis. More than 170 MC4R variants have been described over the past two decades, with conflicting reports regard Show more
Melanocortin-4 receptor (MC4R) plays an essential role in food intake and energy homeostasis. More than 170 MC4R variants have been described over the past two decades, with conflicting reports regarding the prevalence and phenotypic effects of these variants in diverse cohorts. To determine the frequency of MC4R variants in large cohort of different ancestries, we evaluated the MC4R coding region for 20,537 eMERGE participants with sequencing data plus additional 77,454 independent individuals with genome-wide genotyping data at this locus. The sequencing data were obtained from the eMERGE phase III study, in which multisample variant call format calls have been generated, curated, and annotated. In addition to penetrance estimation using body mass index (BMI) as a binary outcome, GWAS and PheWAS were performed using median BMI in linear regression analyses. All results were adjusted for principal components, age, sex, and sites of genotyping. Targeted sequencing data of MC4R revealed 125 coding variants in 1839 eMERGE participants including 30 unreported coding variants that were predicted to be functionally damaging. Highly penetrant unreported variants included (L325I, E308K, D298N, S270F, F261L, T248A, D111V, and Y80F) in which seven participants had obesity class III defined as BMI ≥ 40 kg/m MC4R screening in a large eMERGE cohort confirmed many previous findings, extend the MC4R pleotropic effects, and discovered additional MC4R rare alleles that probably contribute to obesity. Show less
Elevated triglycerides (TG) are associated with, and may be causal for, cardiovascular disease (CVD), and co-morbidities such as type II diabetes and metabolic syndrome. Pathogenic variants in APOA5 a Show more
Elevated triglycerides (TG) are associated with, and may be causal for, cardiovascular disease (CVD), and co-morbidities such as type II diabetes and metabolic syndrome. Pathogenic variants in APOA5 and APOC3 as well as risk SNVs in other genes [APOE (rs429358, rs7412), APOA1/C3/A4/A5 gene cluster (rs964184), INSR (rs7248104), CETP (rs7205804), GCKR (rs1260326)] have been shown to affect TG levels. Knowledge of genetic causes for elevated TG may lead to early intervention and targeted treatment for CVD. We previously identified linkage and association of a rare, highly conserved missense variant in SLC25A40, rs762174003, with hypertriglyceridemia (HTG) in a single large family, and replicated this association with rare, highly conserved missense variants in a European American and African American sample. Here, we analyzed a longitudinal mixed-ancestry cohort (European, African and Asian ancestry, N = 8966) from the Electronic Medical Record and Genomics (eMERGE) Network. We tested associations between median TG and the genes of interest, using linear regression, adjusting for sex, median age, median BMI, and the first two principal components of ancestry. We replicated the association between TG and APOC3, APOA5, and risk variation at APOE, APOA1/C3/A4/A5 gene cluster, and GCKR. We failed to replicate the association between rare, highly conserved variation at SLC25A40 and TG, as well as for risk variation at INSR and CETP. Analysis using data from electronic health records presents challenges that need to be overcome. Although large amounts of genotype data is becoming increasingly accessible, usable phenotype data can be challenging to obtain. We were able to replicate known, strong associations, but were unable to replicate moderate associations due to the limited sample size and missing drug information. Show less
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding var Show more
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding variants from which pinpointing causal genes remains challenging. Here we combined data from 718,734 individuals to discover rare and low-frequency (minor allele frequency (MAF) < 5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which 8 variants were in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2 and ZNF169) newly implicated in human obesity, 2 variants were in genes (MC4R and KSR2) previously observed to be mutated in extreme obesity and 2 variants were in GIPR. The effect sizes of rare variants are ~10 times larger than those of common variants, with the largest effect observed in carriers of an MC4R mutation introducing a stop codon (p.Tyr35Ter, MAF = 0.01%), who weighed ~7 kg more than non-carriers. Pathway analyses based on the variants associated with BMI confirm enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically supported therapeutic targets in obesity. Show less
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated w Show more
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD. Show less
Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseas Show more
Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseases or traits. This study sought to systematically test if genetic variants identified for non-CAD diseases/traits also associate with CAD and to undertake a comprehensive analysis of the extent of pleiotropy of all CAD loci. In discovery analyses involving 42,335 CAD cases and 78,240 control subjects we tested the association of 29,383 common (minor allele frequency >5%) single nucleotide polymorphisms available on the exome array, which included a substantial proportion of known or suspected single nucleotide polymorphisms associated with common diseases or traits as of 2011. Suggestive association signals were replicated in an additional 30,533 cases and 42,530 control subjects. To evaluate pleiotropy, we tested CAD loci for association with cardiovascular risk factors (lipid traits, blood pressure phenotypes, body mass index, diabetes, and smoking behavior), as well as with other diseases/traits through interrogation of currently available genome-wide association study catalogs. We identified 6 new loci associated with CAD at genome-wide significance: on 2q37 (KCNJ13-GIGYF2), 6p21 (C2), 11p15 (MRVI1-CTR9), 12q13 (LRP1), 12q24 (SCARB1), and 16q13 (CETP). Risk allele frequencies ranged from 0.15 to 0.86, and odds ratio per copy of the risk allele ranged from 1.04 to 1.09. Of 62 new and known CAD loci, 24 (38.7%) showed statistical association with a traditional cardiovascular risk factor, with some showing multiple associations, and 29 (47%) showed associations at p < 1 × 10 We identified 6 loci associated with CAD at genome-wide significance. Several CAD loci show substantial pleiotropy, which may help us understand the mechanisms by which these loci affect CAD risk. Show less
White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise Show more
White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise healthy individuals can provide insights into genes and biologic pathways involved in production, differentiation, or clearance of particular WBC lineages (myeloid, lymphoid) and also potentially inform the genetic basis of autoimmune, allergic, and blood diseases. We performed an exome array-based meta-analysis of total WBC and subtype counts (neutrophils, monocytes, lymphocytes, basophils, and eosinophils) in a multi-ancestry discovery and replication sample of ∼157,622 individuals from 25 studies. We identified 16 common variants (8 of which were coding variants) associated with one or more WBC traits, the majority of which are pleiotropically associated with autoimmune diseases. Based on functional annotation, these loci included genes encoding surface markers of myeloid, lymphoid, or hematopoietic stem cell differentiation (CD69, CD33, CD87), transcription factors regulating lineage specification during hematopoiesis (ASXL1, IRF8, IKZF1, JMJD1C, ETS2-PSMG1), and molecules involved in neutrophil clearance/apoptosis (C10orf54, LTA), adhesion (TNXB), or centrosome and microtubule structure/function (KIF9, TUBD1). Together with recent reports of somatic ASXL1 mutations among individuals with idiopathic cytopenias or clonal hematopoiesis of undetermined significance, the identification of a common regulatory 3' UTR variant of ASXL1 suggests that both germline and somatic ASXL1 mutations contribute to lower blood counts in otherwise asymptomatic individuals. These association results shed light on genetic mechanisms that regulate circulating WBC counts and suggest a prominent shared genetic architecture with inflammatory and autoimmune diseases. Show less