BackgroundIdentifying genetic variants conferring resilience to Alzheimer's disease and related dementia (ADRD) may hold promise for developing therapeutics.ObjectiveTo determine genetic associations Show more
BackgroundIdentifying genetic variants conferring resilience to Alzheimer's disease and related dementia (ADRD) may hold promise for developing therapeutics.ObjectiveTo determine genetic associations with being dementia-free at age 85 (DF85).MethodsWe examined genetic associations, using whole genome sequencing data, with DF85 in three Trans-Omics for Precision Medicine cohorts and the Alzheimer's Disease Sequencing Project Phenotype Harmonization Consortium. We tested common variants individually and aggregation of rare (MAF ≤ 1%) coding and non-coding variants in DF85 participants (n = 3657) against individuals who were not DF85 (n = 20,010). We verified associations using a stricter control set who developed dementia before age 85 (n = 5552).ResultsWe observed an association at Show less
Atrial fibrillation (AF) is a prevalent and morbid abnormality of the heart rhythm with a strong genetic component. Here, we meta-analyzed genome and exome sequencing data from 36 studies that include Show more
Atrial fibrillation (AF) is a prevalent and morbid abnormality of the heart rhythm with a strong genetic component. Here, we meta-analyzed genome and exome sequencing data from 36 studies that included 52,416 AF cases and 277,762 controls. In burden tests of rare coding variation, we identified novel associations between AF and the genes MYBPC3, LMNA, PKP2, FAM189A2 and KDM5B. We further identified associations between AF and rare structural variants owing to deletions in CTNNA3 and duplications of GATA4. We broadly replicated our findings in independent samples from MyCode, deCODE and UK Biobank. Finally, we found that CRISPR knockout of KDM5B in stem-cell-derived atrial cardiomyocytes led to a shortening of the action potential duration and widespread transcriptomic dysregulation of genes relevant to atrial homeostasis and conduction. Our results highlight the contribution of rare coding and structural variants to AF, including genetic links between AF and cardiomyopathies, and expand our understanding of the rare variant architecture for this common arrhythmia. Show less
Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data fr Show more
Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals (P < 5 × 10 Show less
Annually 300,000 Americans experience sudden cardiac arrest (SCA). Studies in referral SCA cohorts have observed rare variants in genes associated with arrhythmia and cardiomyopathy. We sought to: (1) Show more
Annually 300,000 Americans experience sudden cardiac arrest (SCA). Studies in referral SCA cohorts have observed rare variants in genes associated with arrhythmia and cardiomyopathy. We sought to: (1) establish the population prevalence of rare disease-causing variants in a set of candidate genes and (2) confirm the association of disease-causing variants in these genes with SCA in two prospective population-based studies. SCA patients (n=3264) were accrued from the Oregon Sudden Unexpected Death Study and the PREdiction of Sudden death in mulTi-ethnic cOmmunities (PRESTO) study and compared to control patients (n=13713) from the Atherosclerosis Risk in Communities (ARIC) study. Whole genome sequencing was performed. Disease-causing (likely pathogenic or pathogenic) variants in candidate genes associated with arrhythmia/cardiomyopathy were identified using updated American College of Medical Genetics and Genomics criteria. Gene- collapsing case-control analysis was performed using the conditional logistic regression-sequence kernel association test. We identified 300 disease-causing variants, the majority of which were in cardiomyopathy genes (71%). There were 136 patients (4.2%) in the SCA group and 351 patients (2.6%) in the control group with one or more disease-causing variants (OR 1.66, 95% confidence interval 1.33-2.07, p<0.001). We identified 13 genes associated with an increased risk of SCA, nine associated with cardiomyopathy ( Disease-causing variants in cardiomyopathy genes were the predominant genetic cause of SCA. These findings inform which genes to include in genetic screening for SCA. Show less
Metabolic pathways are related to physiological functions and disease states and are influenced by genetic variation and environmental factors. Hispanics/Latino individuals have ancestry-derived genom Show more
Metabolic pathways are related to physiological functions and disease states and are influenced by genetic variation and environmental factors. Hispanics/Latino individuals have ancestry-derived genomic regions (local ancestry) from their recent admixture that have been less characterized for associations with metabolite abundance and disease risk. We performed admixture mapping of 640 circulating metabolites in 3887 Hispanic/Latino individuals from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). Metabolites were quantified in fasting serum through non-targeted mass spectrometry (MS) analysis using ultra-performance liquid chromatography-MS/MS. Replication was performed in 1856 nonoverlapping HCHS/SOL participants with metabolomic data. By leveraging local ancestry, this study identified significant ancestry-enriched associations for 78 circulating metabolites at 484 independent regions, including 116 novel metabolite-genomic region associations that replicated in an independent sample. Among the main findings, we identified Native American enriched genomic regions at chromosomes 11 and 15, mapping to FADS1/FADS2 and LIPC, respectively, associated with reduced long-chain polyunsaturated fatty acid metabolites implicated in metabolic and inflammatory pathways. An African-derived genomic region at chromosome 2 was associated with N-acetylated amino acid metabolites. This region, mapped to ALMS1, is associated with chronic kidney disease, a disease that disproportionately burdens individuals of African descent. Our findings provide important insights into differences in metabolite quantities related to ancestry in admixed populations including metabolites related to regulation of lipid polyunsaturated fatty acids and N-acetylated amino acids, which may have implications for common diseases in populations. Show less
Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data fr Show more
Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals ( Show less
We conducted cohort- and race-specific epigenome-wide association analyses of mitochondrial deoxyribonucleic acid (mtDNA) copy number (mtDNA CN) measured in whole blood from participants of African an Show more
We conducted cohort- and race-specific epigenome-wide association analyses of mitochondrial deoxyribonucleic acid (mtDNA) copy number (mtDNA CN) measured in whole blood from participants of African and European origins in five cohorts (n = 6182, mean age = 57-67 years, 65% women). In the meta-analysis of all the participants, we discovered 21 mtDNA CN-associated DNA methylation sites (CpG) (P < 1 × 10-7), with a 0.7-3.0 standard deviation increase (3 CpGs) or decrease (18 CpGs) in mtDNA CN corresponding to a 1% increase in DNA methylation. Several significant CpGs have been reported to be associated with at least two risk factors (e.g. chronological age or smoking) for cardiovascular disease (CVD). Five genes [PR/SET domain 16, nuclear receptor subfamily 1 group H member 3 (NR1H3), DNA repair protein, DNA polymerase kappa and decaprenyl-diphosphate synthase subunit 2], which harbor nine significant CpGs, are known to be involved in mitochondrial biosynthesis and functions. For example, NR1H3 encodes a transcription factor that is differentially expressed during an adipose tissue transition. The methylation level of cg09548275 in NR1H3 was negatively associated with mtDNA CN (effect size = -1.71, P = 4 × 10-8) and was positively associated with the NR1H3 expression level (effect size = 0.43, P = 0.0003), which indicates that the methylation level in NR1H3 may underlie the relationship between mtDNA CN, the NR1H3 transcription factor and energy expenditure. In summary, the study results suggest that mtDNA CN variation in whole blood is associated with DNA methylation levels in genes that are involved in a wide range of mitochondrial activities. These findings will help reveal molecular mechanisms between mtDNA CN and CVD. Show less
Genome-wide association studies have identified multiple genomic loci associated with coronary artery disease, but most are common variants in non-coding regions that provide limited information on ca Show more
Genome-wide association studies have identified multiple genomic loci associated with coronary artery disease, but most are common variants in non-coding regions that provide limited information on causal genes and etiology of the disease. To overcome the limited scope that common variants provide, we focused our investigation on low-frequency and rare sequence variations primarily residing in coding regions of the genome. Using samples of individuals of European ancestry from ten cohorts within the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, both cross-sectional and prospective analyses were conducted to examine associations between genetic variants and myocardial infarction (MI), coronary heart disease (CHD), and all-cause mortality following these events. For prevalent events, a total of 27,349 participants of European ancestry, including 1831 prevalent MI cases and 2518 prevalent CHD cases were used. For incident cases, a total of 55,736 participants of European ancestry were included (3,031 incident MI cases and 5,425 incident CHD cases). There were 1,860 all-cause deaths among the 3,751 MI and CHD cases from six cohorts that contributed to the analysis of all-cause mortality. Single variant and gene-based analyses were performed separately in each cohort and then meta-analyzed for each outcome. A low-frequency intronic variant (rs988583) in PLCL1 was significantly associated with prevalent MI (OR = 1.80, 95% confidence interval: 1.43, 2.27; P = 7.12 × 10-7). We conducted gene-based burden tests for genes with a cumulative minor allele count (cMAC) ≥ 5 and variants with minor allele frequency (MAF) < 5%. TMPRSS5 and LDLRAD1 were significantly associated with prevalent MI and CHD, respectively, and RC3H2 and ANGPTL4 were significantly associated with incident MI and CHD, respectively. No loci were significantly associated with all-cause mortality following a MI or CHD event. This study identified one known locus (ANGPTL4) and four new loci (PLCL1, RC3H2, TMPRSS5, and LDLRAD1) associated with cardiovascular disease risk that warrant further investigation. Show less
Alcohol intake influences plasma lipid levels, and such effects may be moderated by genetic variants. We aimed to characterize the role of aggregated rare and low-frequency protein-coding variants in Show more
Alcohol intake influences plasma lipid levels, and such effects may be moderated by genetic variants. We aimed to characterize the role of aggregated rare and low-frequency protein-coding variants in gene by alcohol consumption interactions associated with fasting plasma lipid levels. In the Cohorts for Heart and Aging Research in Genomic Epidemiology consortium, fasting plasma triglycerides and high- and low-density lipoprotein cholesterol were measured in 34 153 individuals with European ancestry from 5 discovery studies and 32 277 individuals from 6 replication studies. Rare and low-frequency functional protein-coding variants (minor allele frequency, ≤5%) measured by an exome array were aggregated by genes and evaluated by a gene-environment interaction test and a joint test of genetic main and gene-environment interaction effects. Two dichotomous self-reported alcohol consumption variables, current drinker, defined as any recurrent drinking behavior, and regular drinker, defined as the subset of current drinkers who consume at least 2 drinks per week, were considered. We discovered and replicated 21 gene-lipid associations at 13 known lipid loci through the joint test. Eight loci ( In conclusion, this study applied new gene-based statistical approaches and suggested that rare and low-frequency genetic variants interacted with alcohol consumption on lipid levels. Show less
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding var Show more
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding variants from which pinpointing causal genes remains challenging. Here we combined data from 718,734 individuals to discover rare and low-frequency (minor allele frequency (MAF) < 5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which 8 variants were in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2 and ZNF169) newly implicated in human obesity, 2 variants were in genes (MC4R and KSR2) previously observed to be mutated in extreme obesity and 2 variants were in GIPR. The effect sizes of rare variants are ~10 times larger than those of common variants, with the largest effect observed in carriers of an MC4R mutation introducing a stop codon (p.Tyr35Ter, MAF = 0.01%), who weighed ~7 kg more than non-carriers. Pathway analyses based on the variants associated with BMI confirm enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically supported therapeutic targets in obesity. Show less
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated w Show more
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD. Show less
Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Although 58 genomic regions have been associated with CAD thus far, most of the heritability is unexplained, indi Show more
Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Although 58 genomic regions have been associated with CAD thus far, most of the heritability is unexplained, indicating that additional susceptibility loci await identification. An efficient discovery strategy may be larger-scale evaluation of promising associations suggested by genome-wide association studies (GWAS). Hence, we genotyped 56,309 participants using a targeted gene array derived from earlier GWAS results and performed meta-analysis of results with 194,427 participants previously genotyped, totaling 88,192 CAD cases and 162,544 controls. We identified 25 new SNP-CAD associations (P < 5 × 10 Show less
Circulating blood cell counts and indices are important indicators of hematopoietic function and a number of clinical parameters, such as blood oxygen-carrying capacity, inflammation, and hemostasis. Show more
Circulating blood cell counts and indices are important indicators of hematopoietic function and a number of clinical parameters, such as blood oxygen-carrying capacity, inflammation, and hemostasis. By performing whole-exome sequence association analyses of hematologic quantitative traits in 15,459 community-dwelling individuals, followed by in silico replication in up to 52,024 independent samples, we identified two previously undescribed coding variants associated with lower platelet count: a common missense variant in CPS1 (rs1047891, MAF = 0.33, discovery + replication p = 6.38 × 10(-10)) and a rare synonymous variant in GFI1B (rs150813342, MAF = 0.009, discovery + replication p = 1.79 × 10(-27)). By performing CRISPR/Cas9 genome editing in hematopoietic cell lines and follow-up targeted knockdown experiments in primary human hematopoietic stem and progenitor cells, we demonstrate an alternative splicing mechanism by which the GFI1B rs150813342 variant suppresses formation of a GFI1B isoform that preferentially promotes megakaryocyte differentiation and platelet production. These results demonstrate how unbiased studies of natural variation in blood cell traits can provide insight into the regulation of human hematopoiesis. Show less
White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise Show more
White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise healthy individuals can provide insights into genes and biologic pathways involved in production, differentiation, or clearance of particular WBC lineages (myeloid, lymphoid) and also potentially inform the genetic basis of autoimmune, allergic, and blood diseases. We performed an exome array-based meta-analysis of total WBC and subtype counts (neutrophils, monocytes, lymphocytes, basophils, and eosinophils) in a multi-ancestry discovery and replication sample of ∼157,622 individuals from 25 studies. We identified 16 common variants (8 of which were coding variants) associated with one or more WBC traits, the majority of which are pleiotropically associated with autoimmune diseases. Based on functional annotation, these loci included genes encoding surface markers of myeloid, lymphoid, or hematopoietic stem cell differentiation (CD69, CD33, CD87), transcription factors regulating lineage specification during hematopoiesis (ASXL1, IRF8, IKZF1, JMJD1C, ETS2-PSMG1), and molecules involved in neutrophil clearance/apoptosis (C10orf54, LTA), adhesion (TNXB), or centrosome and microtubule structure/function (KIF9, TUBD1). Together with recent reports of somatic ASXL1 mutations among individuals with idiopathic cytopenias or clonal hematopoiesis of undetermined significance, the identification of a common regulatory 3' UTR variant of ASXL1 suggests that both germline and somatic ASXL1 mutations contribute to lower blood counts in otherwise asymptomatic individuals. These association results shed light on genetic mechanisms that regulate circulating WBC counts and suggest a prominent shared genetic architecture with inflammatory and autoimmune diseases. Show less
General cognitive function is substantially heritable across the human life course from adolescence to old age. We investigated the genetic contribution to variation in this important, health- and wel Show more
General cognitive function is substantially heritable across the human life course from adolescence to old age. We investigated the genetic contribution to variation in this important, health- and well-being-related trait in middle-aged and older adults. We conducted a meta-analysis of genome-wide association studies of 31 cohorts (N=53,949) in which the participants had undertaken multiple, diverse cognitive tests. A general cognitive function phenotype was tested for, and created in each cohort by principal component analysis. We report 13 genome-wide significant single-nucleotide polymorphism (SNP) associations in three genomic regions, 6q16.1, 14q12 and 19q13.32 (best SNP and closest gene, respectively: rs10457441, P=3.93 × 10(-9), MIR2113; rs17522122, P=2.55 × 10(-8), AKAP6; rs10119, P=5.67 × 10(-9), APOE/TOMM40). We report one gene-based significant association with the HMGN1 gene located on chromosome 21 (P=1 × 10(-6)). These genes have previously been associated with neuropsychiatric phenotypes. Meta-analysis results are consistent with a polygenic model of inheritance. To estimate SNP-based heritability, the genome-wide complex trait analysis procedure was applied to two large cohorts, the Atherosclerosis Risk in Communities Study (N=6617) and the Health and Retirement Study (N=5976). The proportion of phenotypic variation accounted for by all genotyped common SNPs was 29% (s.e.=5%) and 28% (s.e.=7%), respectively. Using polygenic prediction analysis, ~1.2% of the variance in general cognitive function was predicted in the Generation Scotland cohort (N=5487; P=1.5 × 10(-17)). In hypothesis-driven tests, there was significant association between general cognitive function and four genes previously associated with Alzheimer's disease: TOMM40, APOE, ABCG1 and MEF2C. Show less
Myocardial infarction (MI), a leading cause of death around the world, displays a complex pattern of inheritance. When MI occurs early in life, genetic inheritance is a major component to risk. Previo Show more
Myocardial infarction (MI), a leading cause of death around the world, displays a complex pattern of inheritance. When MI occurs early in life, genetic inheritance is a major component to risk. Previously, rare mutations in low-density lipoprotein (LDL) genes have been shown to contribute to MI risk in individual families, whereas common variants at more than 45 loci have been associated with MI risk in the population. Here we evaluate how rare mutations contribute to early-onset MI risk in the population. We sequenced the protein-coding regions of 9,793 genomes from patients with MI at an early age (≤50 years in males and ≤60 years in females) along with MI-free controls. We identified two genes in which rare coding-sequence mutations were more frequent in MI cases versus controls at exome-wide significance. At low-density lipoprotein receptor (LDLR), carriers of rare non-synonymous mutations were at 4.2-fold increased risk for MI; carriers of null alleles at LDLR were at even higher risk (13-fold difference). Approximately 2% of early MI cases harbour a rare, damaging mutation in LDLR; this estimate is similar to one made more than 40 years ago using an analysis of total cholesterol. Among controls, about 1 in 217 carried an LDLR coding-sequence mutation and had plasma LDL cholesterol > 190 mg dl(-1). At apolipoprotein A-V (APOA5), carriers of rare non-synonymous mutations were at 2.2-fold increased risk for MI. When compared with non-carriers, LDLR mutation carriers had higher plasma LDL cholesterol, whereas APOA5 mutation carriers had higher plasma triglycerides. Recent evidence has connected MI risk with coding-sequence mutations at two genes functionally related to APOA5, namely lipoprotein lipase and apolipoprotein C-III (refs 18, 19). Combined, these observations suggest that, as well as LDL cholesterol, disordered metabolism of triglyceride-rich lipoproteins contributes to MI risk. Show less
Lipoprotein-associated phospholipase A2 (LpPLA2) activity was associated with higher CHD risk in a meta-analysis, which was partly dependent on circulating lipid levels. Apolipoprotein C3 loss-of-func Show more
Lipoprotein-associated phospholipase A2 (LpPLA2) activity was associated with higher CHD risk in a meta-analysis, which was partly dependent on circulating lipid levels. Apolipoprotein C3 loss-of-function (ApoC3 LOF) mutations were related with reduced postprandial lipemia and CHD risk. However, the association of LpPLA2 activity with ApoC3 LOF is not known. We examined the association of LpPLA2 activity and ApoC3 LOF mutations and incident cardiovascular disease (CVD) (defined as coronary heart disease [CHD] plus ischemic stroke) and all-cause mortality in the biracial longitudinal Atherosclerosis Risk In Communities (ARIC) study. The mean LpPLA2 activity was 229.3 nmol/min/mL and was higher in men and whites. LpPLA2 activity correlated positively with atherogenic dyslipidemia. ApoC3 LOF carriers had lower LpPLA2 activity levels compared to non-carriers, and there was inverse association between LpPLA2 activity and ApoC3 LOF mutations in whites. In a fully adjusted model, greater LpPLA2 activity was independently associated with incident CVD (HR 1.35, 1.09-1.68 for highest vs. lowest quintile), which was mainly explained by its association with CHD, and was also associated with all-cause mortality (HR 1.65, 1.38-1.98). Greater LpPLA2 activity was associated with increased CHD and all-cause mortality in both whites and African-Americans in the ARIC study. The inverse relation between LpPLA2 activity and ApoC3 LOF mutations suggests that delayed lipoprotein clearance may at least in part explain the observed association of LpPLA2 activity with increased CVD risk. Show less
A typical human exome harbors dozens of loss-of-function (LOF) variants, which can lower disease risk factor levels and affect drug efficacy. We hypothesized that LOF variants are enriched in genes in Show more
A typical human exome harbors dozens of loss-of-function (LOF) variants, which can lower disease risk factor levels and affect drug efficacy. We hypothesized that LOF variants are enriched in genes influencing risk factor levels and the onset of common chronic diseases, such as cardiovascular disease and diabetes. To test this hypothesis, we sequenced the exomes of 8,554 individuals and analyzed the effects of predicted LOF variants on 20 chronic disease risk factor phenotypes. Analysis of this sample as discovery and replication strata of equal size verified two relationships in well-studied genes (PCSK9 and APOC3) and identified eight new loci. Previously unknown relationships included elevated fasting glucose in carriers of heterozygous LOF variation in TXNDC5, which encodes a biomarker for type 1 diabetes progression, and apparent recessive effects of C1QTNF8 on serum magnesium levels. These data demonstrate the utility of functional-variant annotation within a large sample of deeply phenotyped individuals for gene discovery. Show less
Common variation at the 11p11.2 locus, encompassing MADD, ACP2, NR1H3, MYBPC3, and SPI1, has been associated in genome-wide association studies with fasting glucose and insulin (FI). In the Cohorts fo Show more
Common variation at the 11p11.2 locus, encompassing MADD, ACP2, NR1H3, MYBPC3, and SPI1, has been associated in genome-wide association studies with fasting glucose and insulin (FI). In the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study, we sequenced 5 gene regions at 11p11.2 to identify rare, potentially functional variants influencing fasting glucose or FI levels. Sequencing (mean depth, 38×) across 16.1 kb in 3566 individuals without diabetes mellitus identified 653 variants, 79.9% of which were rare (minor allele frequency <1%) and novel. We analyzed rare variants in 5 gene regions with FI or fasting glucose using the sequence kernel association test. At NR1H3, 53 rare variants were jointly associated with FI (P=2.73×10(-3)); of these, 7 were predicted to have regulatory function and showed association with FI (P=1.28×10(-3)). Conditioning on 2 previously associated variants at MADD (rs7944584, rs10838687) did not attenuate this association, suggesting that there are >2 independent signals at 11p11.2. One predicted regulatory variant, chr11:47227430 (hg18; minor allele frequency=0.00068), contributed 20.6% to the overall sequence kernel association test score at NR1H3, lies in intron 2 of NR1H3, and is a predicted binding site for forkhead box A1 (FOXA1), a transcription factor associated with insulin regulation. In human HepG2 hepatoma cells, the rare chr11:47227430 A allele disrupted FOXA1 binding and reduced FOXA1-dependent transcriptional activity. Sequencing at 11p11.2-NR1H3 identified rare variation associated with FI. One variant, chr11:47227430, seems to be functional, with the rare A allele reducing transcription factor FOXA1 binding and FOXA1-dependent transcriptional activity. Show less
Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to ident Show more
Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to identify rare mutations that have a large effect on phenotype. We sequenced the protein-coding regions of 18,666 genes in each of 3734 participants of European or African ancestry in the Exome Sequencing Project. We conducted tests to determine whether rare mutations in coding sequence, individually or in aggregate within a gene, were associated with plasma triglyceride levels. For mutations associated with triglyceride levels, we subsequently evaluated their association with the risk of coronary heart disease in 110,970 persons. An aggregate of rare mutations in the gene encoding apolipoprotein C3 (APOC3) was associated with lower plasma triglyceride levels. Among the four mutations that drove this result, three were loss-of-function mutations: a nonsense mutation (R19X) and two splice-site mutations (IVS2+1G→A and IVS3+1G→T). The fourth was a missense mutation (A43T). Approximately 1 in 150 persons in the study was a heterozygous carrier of at least one of these four mutations. Triglyceride levels in the carriers were 39% lower than levels in noncarriers (P<1×10(-20)), and circulating levels of APOC3 in carriers were 46% lower than levels in noncarriers (P=8×10(-10)). The risk of coronary heart disease among 498 carriers of any rare APOC3 mutation was 40% lower than the risk among 110,472 noncarriers (odds ratio, 0.60; 95% confidence interval, 0.47 to 0.75; P=4×10(-6)). Rare mutations that disrupt APOC3 function were associated with lower levels of plasma triglycerides and APOC3. Carriers of these mutations were found to have a reduced risk of coronary heart disease. (Funded by the National Heart, Lung, and Blood Institute and others.). Show less
Phenotypes proximal to gene action generally reflect larger genetic effect sizes than those that are distant. The human metabolome, a result of multiple cellular and biological processes, are function Show more
Phenotypes proximal to gene action generally reflect larger genetic effect sizes than those that are distant. The human metabolome, a result of multiple cellular and biological processes, are functional intermediate phenotypes proximal to gene action. Here, we present a genome-wide association study of 308 untargeted metabolite levels among African Americans from the Atherosclerosis Risk in Communities (ARIC) Study. Nineteen significant common variant-metabolite associations were identified, including 13 novel loci (p<1.6 × 10(-10)). These loci were associated with 7-50% of the difference in metabolite levels per allele, and the variance explained ranged from 4% to 20%. Fourteen genes were identified within the nineteen loci, and four of them contained non-synonymous substitutions in four enzyme-encoding genes (KLKB1, SIAE, CPS1, and NAT8); the other significant loci consist of eight other enzyme-encoding genes (ACE, GATM, ACY3, ACSM2B, THEM4, ADH4, UGT1A, TREH), a transporter gene (SLC6A13) and a polycystin protein gene (PKD2L1). In addition, four potential disease-associated paths were identified, including two direct longitudinal predictive relationships: NAT8 with N-acetylornithine, N-acetyl-1-methylhistidine and incident chronic kidney disease, and TREH with trehalose and incident diabetes. These results highlight the value of using endophenotypes proximal to gene function to discover new insights into biology and disease pathology. Show less
Metabolic syndrome (MetS) has become a health and financial burden worldwide. The MetS definition captures clustering of risk factors that predict higher risk for diabetes mellitus and cardiovascular Show more
Metabolic syndrome (MetS) has become a health and financial burden worldwide. The MetS definition captures clustering of risk factors that predict higher risk for diabetes mellitus and cardiovascular disease. Our study hypothesis is that additional to genes influencing individual MetS risk factors, genetic variants exist that influence MetS and inflammatory markers forming a predisposing MetS genetic network. To test this hypothesis a staged approach was undertaken. (a) We analyzed 17 metabolic and inflammatory traits in more than 85,500 participants from 14 large epidemiological studies within the Cross Consortia Pleiotropy Group. Individuals classified with MetS (NCEP definition), versus those without, showed on average significantly different levels for most inflammatory markers studied. (b) Paired average correlations between 8 metabolic traits and 9 inflammatory markers from the same studies as above, estimated with two methods, and factor analyses on large simulated data, helped in identifying 8 combinations of traits for follow-up in meta-analyses, out of 130,305 possible combinations between metabolic traits and inflammatory markers studied. (c) We performed correlated meta-analyses for 8 metabolic traits and 6 inflammatory markers by using existing GWAS published genetic summary results, with about 2.5 million SNPs from twelve predominantly largest GWAS consortia. These analyses yielded 130 unique SNPs/genes with pleiotropic associations (a SNP/gene associating at least one metabolic trait and one inflammatory marker). Of them twenty-five variants (seven loci newly reported) are proposed as MetS candidates. They map to genes MACF1, KIAA0754, GCKR, GRB14, COBLL1, LOC646736-IRS1, SLC39A8, NELFE, SKIV2L, STK19, TFAP2B, BAZ1B, BCL7B, TBL2, MLXIPL, LPL, TRIB1, ATXN2, HECTD4, PTPN11, ZNF664, PDXDC1, FTO, MC4R and TOMM40. Based on large data evidence, we conclude that inflammation is a feature of MetS and several gene variants show pleiotropic genetic associations across phenotypes and might explain a part of MetS correlated genetic architecture. These findings warrant further functional investigation. Show less
Genome-wide association studies (GWAS) have identified ~100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trai Show more
Genome-wide association studies (GWAS) have identified ~100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1 × 10(-4) in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies. Show less
Genome-wide association studies (GWASs) primarily performed in European-ancestry (EA) populations have identified numerous loci associated with body mass index (BMI). However, it is still unclear whet Show more
Genome-wide association studies (GWASs) primarily performed in European-ancestry (EA) populations have identified numerous loci associated with body mass index (BMI). However, it is still unclear whether these GWAS loci can be generalized to other ethnic groups, such as African Americans (AAs). Furthermore, the putative functional variant or variants in these loci mostly remain under investigation. The overall lower linkage disequilibrium in AA compared to EA populations provides the opportunity to narrow in or fine-map these BMI-related loci. Therefore, we used the Metabochip to densely genotype and evaluate 21 BMI GWAS loci identified in EA studies in 29,151 AAs from the Population Architecture using Genomics and Epidemiology (PAGE) study. Eight of the 21 loci (SEC16B, TMEM18, ETV5, GNPDA2, TFAP2B, BDNF, FTO, and MC4R) were found to be associated with BMI in AAs at 5.8 × 10(-5). Within seven out of these eight loci, we found that, on average, a substantially smaller number of variants was correlated (r(2) > 0.5) with the most significant SNP in AA than in EA populations (16 versus 55). Conditional analyses revealed GNPDA2 harboring a potential additional independent signal. Moreover, Metabochip-wide discovery analyses revealed two BMI-related loci, BRE (rs116612809, p = 3.6 × 10(-8)) and DHX34 (rs4802349, p = 1.2 × 10(-7)), which were significant when adjustment was made for the total number of SNPs tested across the chip. These results demonstrate that fine mapping in AAs is a powerful approach for both narrowing in on the underlying causal variants in known loci and discovering BMI-related loci. Show less
Fabiana Quagliarini, Yan Wang, Julia Kozlitina+7 more · 2012 · Proceedings of the National Academy of Sciences of the United States of America · National Academy of Sciences · added 2026-04-24
Angiopoietin-like proteins (ANGPTLs) play major roles in the trafficking and metabolism of lipids. Inactivation of ANGPTL3, a gene located in an intron of DOCK7, results in very low levels of LDL-chol Show more
Angiopoietin-like proteins (ANGPTLs) play major roles in the trafficking and metabolism of lipids. Inactivation of ANGPTL3, a gene located in an intron of DOCK7, results in very low levels of LDL-cholesterol (C), HDL-C and triglyceride (TAG). We identified another ANGPTL family member, ANGPTL8, which is located in the corresponding intron of DOCK6. A variant in this family member (rs2278426, R59W) was associated with lower plasma LDL-C and HDL-C levels in three populations. ANGPTL8 is expressed in liver and adipose tissue, and circulates in plasma of humans. Expression of ANGPTL8 was reduced by fasting and increased by refeeding in both mice and humans. To examine the functional relationship between the two ANGPTL family members, we expressed ANGPTL3 at physiological levels alone or together with ANGPTL8 in livers of mice. Plasma TAG level did not change in mice expressing ANGPTL3 alone, whereas coexpression with ANGPTL8 resulted in hypertriglyceridemia, despite a reduction in circulating ANGPTL3. ANGPTL8 coimmunoprecipitated with the N-terminal domain of ANGPTL3 in plasma of these mice. In cultured hepatocytes, ANGPTL8 expression increased the appearance of N-terminal ANGPTL3 in the medium, suggesting ANGPTL8 may activate ANGPTL3. Consistent with this scenario, expression of ANGPTL8 in Angptl3(-/-) mice failed to promote hypertriglyceridemia. Thus, ANGPTL8, a paralog of ANGPTL3 that arose through duplication of an ancestral DOCK gene, regulates postprandial TAG and fatty acid metabolism by controlling activation of its progenitor, and perhaps other ANGPTLs. Inhibition of ANGPTL8 provides a new therapeutic strategy for reducing plasma lipoprotein levels. Show less
Hyperglycaemia disproportionately affects African-Americans (AfAs). We tested the transferability of 18 single-nucleotide polymorphisms (SNPs) associated with glycaemic traits identified in European a Show more
Hyperglycaemia disproportionately affects African-Americans (AfAs). We tested the transferability of 18 single-nucleotide polymorphisms (SNPs) associated with glycaemic traits identified in European ancestry (EuA) populations in 5,984 non-diabetic AfAs. We meta-analysed SNP associations with fasting glucose (FG) or insulin (FI) in AfAs from five cohorts in the Candidate Gene Association Resource. We: (1) calculated allele frequency differences, variations in linkage disequilibrium (LD), fixation indices (F(st)s) and integrated haplotype scores (iHSs); (2) tested EuA SNPs in AfAs; and (3) interrogated within ± 250 kb around each EuA SNP in AfAs. Allele frequency differences ranged from 0.6% to 54%. F(st) exceeded 0.15 at 6/16 loci, indicating modest population differentiation. All iHSs were <2, suggesting no recent positive selection. For 18 SNPs, all directions of effect were the same and 95% CIs of association overlapped when comparing EuA with AfA. For 17 of 18 loci, at least one SNP was nominally associated with FG in AfAs. Four loci were significantly associated with FG (GCK, p = 5.8 × 10(-8); MTNR1B, p = 8.5 × 10(-9); and FADS1, p = 2.2 × 10(-4)) or FI (GCKR, p = 5.9 × 10(-4)). At GCK and MTNR1B the EuA and AfA SNPs represented the same signal, while at FADS1, and GCKR, the EuA and best AfA SNPs were weakly correlated (r(2) <0.2), suggesting allelic heterogeneity for association with FG at these loci. Few glycaemic SNPs showed strict evidence of transferability from EuA to AfAs. Four loci were significantly associated in both AfAs and those with EuA after accounting for varying LD across ancestral groups, with new signals emerging to aid fine-mapping. Show less
Chronic kidney disease (CKD) is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and s Show more
Chronic kidney disease (CKD) is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and stratified for key CKD risk factors. We uncovered 6 new loci in association with estimated glomerular filtration rate (eGFR), the primary clinical measure of CKD, in or near MPPED2, DDX1, SLC47A1, CDK12, CASP9, and INO80. Morpholino knockdown of mpped2 and casp9 in zebrafish embryos revealed podocyte and tubular abnormalities with altered dextran clearance, suggesting a role for these genes in renal function. By providing new insights into genes that regulate renal function, these results could further our understanding of the pathogenesis of CKD. Show less
OBJECTIVE The metabolic syndrome (MetS) is defined as concomitant disorders of lipid and glucose metabolism, central obesity, and high blood pressure, with an increased risk of type 2 diabetes and car Show more
OBJECTIVE The metabolic syndrome (MetS) is defined as concomitant disorders of lipid and glucose metabolism, central obesity, and high blood pressure, with an increased risk of type 2 diabetes and cardiovascular disease. This study tests whether common genetic variants with pleiotropic effects account for some of the correlated architecture among five metabolic phenotypes that define MetS. RESEARCH DESIGN AND METHODS Seven studies of the STAMPEED consortium, comprising 22,161 participants of European ancestry, underwent genome-wide association analyses of metabolic traits using a panel of ∼2.5 million imputed single nucleotide polymorphisms (SNPs). Phenotypes were defined by the National Cholesterol Education Program (NCEP) criteria for MetS in pairwise combinations. Individuals exceeding the NCEP thresholds for both traits of a pair were considered affected. RESULTS Twenty-nine common variants were associated with MetS or a pair of traits. Variants in the genes LPL, CETP, APOA5 (and its cluster), GCKR (and its cluster), LIPC, TRIB1, LOC100128354/MTNR1B, ABCB11, and LOC100129150 were further tested for their association with individual qualitative and quantitative traits. None of the 16 top SNPs (one per gene) associated simultaneously with more than two individual traits. Of them 11 variants showed nominal associations with MetS per se. The effects of 16 top SNPs on the quantitative traits were relatively small, together explaining from ∼9% of the variance in triglycerides, 5.8% of high-density lipoprotein cholesterol, 3.6% of fasting glucose, and 1.4% of systolic blood pressure. CONCLUSIONS Qualitative and quantitative pleiotropic tests on pairs of traits indicate that a small portion of the covariation in these traits can be explained by the reported common genetic variants. Show less
Nonalcoholic fatty liver disease (NAFLD) is an escalating health problem that is frequently associated with obesity and insulin resistance. The mechanistic relationship between NAFLD, obesity, and ins Show more
Nonalcoholic fatty liver disease (NAFLD) is an escalating health problem that is frequently associated with obesity and insulin resistance. The mechanistic relationship between NAFLD, obesity, and insulin resistance is not well understood. A nonsynonymous variant in patatin-like phospholipase domain containing 3 (rs738409, I148M) has been reproducibly associated with increased hepatic triglyceride content (HTGC) but has not been associated with either the body mass index (BMI) or indices of insulin resistance. Conversely, two sequence variants in apolipoprotein C3 (APOC3) that have been linked to hypertriglyceridemia (rs2854117 C > T and rs2854116 T > C) have recently been reported to be associated with both hepatic fat content and insulin resistance. Here we genotyped two APOC3 variants in 1228 African Americans, 843 European Americans and 426 Hispanics from a multiethnic population based study, the Dallas Heart Study and test for association with HTGC and homeostatic model of insulin resistance (HOMA-IR). We also examined the relationship between these two variants and HOMA-IR in the Atherosclerosis Risk in Communities (ARIC) study. No significant difference in hepatic fat content was found between carriers and noncarriers in the Dallas Heart Study. Neither APOC3 variant was associated with HOMA-IR in the Dallas Heart Study; this lack of association was confirmed in the ARIC study, even after the analysis was restricted to lean (BMI < 25 kg/m(2) ) individuals (n = 4399). Our data do not support a causal relationship between these two variants in APOC3 and either HTGC or insulin resistance in middle-aged men and women. Show less
Coronary heart disease (CHD) is the leading cause of mortality in African Americans. To identify common genetic polymorphisms associated with CHD and its risk factors (LDL- and HDL-cholesterol (LDL-C Show more
Coronary heart disease (CHD) is the leading cause of mortality in African Americans. To identify common genetic polymorphisms associated with CHD and its risk factors (LDL- and HDL-cholesterol (LDL-C and HDL-C), hypertension, smoking, and type-2 diabetes) in individuals of African ancestry, we performed a genome-wide association study (GWAS) in 8,090 African Americans from five population-based cohorts. We replicated 17 loci previously associated with CHD or its risk factors in Caucasians. For five of these regions (CHD: CDKN2A/CDKN2B; HDL-C: FADS1-3, PLTP, LPL, and ABCA1), we could leverage the distinct linkage disequilibrium (LD) patterns in African Americans to identify DNA polymorphisms more strongly associated with the phenotypes than the previously reported index SNPs found in Caucasian populations. We also developed a new approach for association testing in admixed populations that uses allelic and local ancestry variation. Using this method, we discovered several loci that would have been missed using the basic allelic and global ancestry information only. Our conclusions suggest that no major loci uniquely explain the high prevalence of CHD in African Americans. Our project has developed resources and methods that address both admixture- and SNP-association to maximize power for genetic discovery in even larger African-American consortia. Show less