We determined the relationships between DNA sequence variation and DNA methylation using blood samples from 3,799 Europeans and 3,195 South Asians. We identify 11,165,559 SNP-CpG associations (methyla Show more
We determined the relationships between DNA sequence variation and DNA methylation using blood samples from 3,799 Europeans and 3,195 South Asians. We identify 11,165,559 SNP-CpG associations (methylation quantitative trait loci (meQTL), P < 10 Show less
Migraine affects over a billion individuals worldwide but its genetic underpinning remains largely unknown. Here, we performed a genome-wide association study of 102,084 migraine cases and 771,257 con Show more
Migraine affects over a billion individuals worldwide but its genetic underpinning remains largely unknown. Here, we performed a genome-wide association study of 102,084 migraine cases and 771,257 controls and identified 123 loci, of which 86 are previously unknown. These loci provide an opportunity to evaluate shared and distinct genetic components in the two main migraine subtypes: migraine with aura and migraine without aura. Stratification of the risk loci using 29,679 cases with subtype information indicated three risk variants that seem specific for migraine with aura (in HMOX2, CACNA1A and MPPED2), two that seem specific for migraine without aura (near SPINK2 and near FECH) and nine that increase susceptibility for migraine regardless of subtype. The new risk loci include genes encoding recent migraine-specific drug targets, namely calcitonin gene-related peptide (CALCA/CALCB) and serotonin 1F receptor (HTR1F). Overall, genomic annotations among migraine-associated variants were enriched in both vascular and central nervous system tissue/cell types, supporting unequivocally that neurovascular mechanisms underlie migraine pathophysiology. Show less
The prevalence of type 2 diabetes in youth has increased substantially, yet the genetic underpinnings remain largely unexplored. To identify genetic variants predisposing to youth-onset type 2 diabete Show more
The prevalence of type 2 diabetes in youth has increased substantially, yet the genetic underpinnings remain largely unexplored. To identify genetic variants predisposing to youth-onset type 2 diabetes, we formed ProDiGY, a multiethnic collaboration of three studies (TODAY, SEARCH, and T2D-GENES) with 3,006 youth case subjects with type 2 diabetes (mean age 15.1 ± 2.9 years) and 6,061 diabetes-free adult control subjects (mean age 54.2 ± 12.4 years). After stratifying by principal component-clustered ethnicity, we performed association analyses on ∼10 million imputed variants using a generalized linear mixed model incorporating a genetic relationship matrix to account for population structure and adjusting for sex. We identified seven genome-wide significant loci, including the novel locus rs10992863 in Show less
We conducted cohort- and race-specific epigenome-wide association analyses of mitochondrial deoxyribonucleic acid (mtDNA) copy number (mtDNA CN) measured in whole blood from participants of African an Show more
We conducted cohort- and race-specific epigenome-wide association analyses of mitochondrial deoxyribonucleic acid (mtDNA) copy number (mtDNA CN) measured in whole blood from participants of African and European origins in five cohorts (n = 6182, mean age = 57-67 years, 65% women). In the meta-analysis of all the participants, we discovered 21 mtDNA CN-associated DNA methylation sites (CpG) (P < 1 × 10-7), with a 0.7-3.0 standard deviation increase (3 CpGs) or decrease (18 CpGs) in mtDNA CN corresponding to a 1% increase in DNA methylation. Several significant CpGs have been reported to be associated with at least two risk factors (e.g. chronological age or smoking) for cardiovascular disease (CVD). Five genes [PR/SET domain 16, nuclear receptor subfamily 1 group H member 3 (NR1H3), DNA repair protein, DNA polymerase kappa and decaprenyl-diphosphate synthase subunit 2], which harbor nine significant CpGs, are known to be involved in mitochondrial biosynthesis and functions. For example, NR1H3 encodes a transcription factor that is differentially expressed during an adipose tissue transition. The methylation level of cg09548275 in NR1H3 was negatively associated with mtDNA CN (effect size = -1.71, P = 4 × 10-8) and was positively associated with the NR1H3 expression level (effect size = 0.43, P = 0.0003), which indicates that the methylation level in NR1H3 may underlie the relationship between mtDNA CN, the NR1H3 transcription factor and energy expenditure. In summary, the study results suggest that mtDNA CN variation in whole blood is associated with DNA methylation levels in genes that are involved in a wide range of mitochondrial activities. These findings will help reveal molecular mechanisms between mtDNA CN and CVD. Show less
The genetic basis of lacunar stroke is poorly understood, with a single locus on 16q24 identified to date. We sought to identify novel associations and provide mechanistic insights into the disease. W Show more
The genetic basis of lacunar stroke is poorly understood, with a single locus on 16q24 identified to date. We sought to identify novel associations and provide mechanistic insights into the disease. We did a pooled analysis of data from newly recruited patients with an MRI-confirmed diagnosis of lacunar stroke and existing genome-wide association studies (GWAS). Patients were recruited from hospitals in the UK as part of the UK DNA Lacunar Stroke studies 1 and 2 and from collaborators within the International Stroke Genetics Consortium. Cases and controls were stratified by ancestry and two meta-analyses were done: a European ancestry analysis, and a transethnic analysis that included all ancestry groups. We also did a multi-trait analysis of GWAS, in a joint analysis with a study of cerebral white matter hyperintensities (an aetiologically related radiological trait), to find additional genetic associations. We did a transcriptome-wide association study (TWAS) to detect genes for which expression is associated with lacunar stroke; identified significantly enriched pathways using multi-marker analysis of genomic annotation; and evaluated cardiovascular risk factors causally associated with the disease using mendelian randomisation. Our meta-analysis comprised studies from Europe, the USA, and Australia, including 7338 cases and 254 798 controls, of which 2987 cases (matched with 29 540 controls) were confirmed using MRI. Five loci (ICA1L-WDR12-CARF-NBEAL1, ULK4, SPI1-SLC39A13-PSMC3-RAPSN, ZCCHC14, ZBTB14-EPB41L3) were found to be associated with lacunar stroke in the European or transethnic meta-analyses. A further seven loci (SLC25A44-PMF1-BGLAP, LOX-ZNF474-LOC100505841, FOXF2-FOXQ1, VTA1-GPR126, SH3PXD2A, HTRA1-ARMS2, COL4A2) were found to be associated in the multi-trait analysis with cerebral white matter hyperintensities (n=42 310). Two of the identified loci contain genes (COL4A2 and HTRA1) that are involved in monogenic lacunar stroke. The TWAS identified associations between the expression of six genes (SCL25A44, ULK4, CARF, FAM117B, ICA1L, NBEAL1) and lacunar stroke. Pathway analyses implicated disruption of the extracellular matrix, phosphatidylinositol 5 phosphate binding, and roundabout binding (false discovery rate <0·05). Mendelian randomisation analyses identified positive associations of elevated blood pressure, history of smoking, and type 2 diabetes with lacunar stroke. Lacunar stroke has a substantial heritable component, with 12 loci now identified that could represent future treatment targets. These loci provide insights into lacunar stroke pathogenesis, highlighting disruption of the vascular extracellular matrix (COL4A2, LOX, SH3PXD2A, GPR126, HTRA1), pericyte differentiation (FOXF2, GPR126), TGF-β signalling (HTRA1), and myelination (ULK4, GPR126) in disease risk. British Heart Foundation. Show less
Emily DiBlasi, Andrey A Shabalin, Eric T Monson+21 more · 2021 · American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics · Wiley · added 2026-04-24
Identification of genetic factors leading to increased risk of suicide death is critical to combat rising suicide rates, however, only a fraction of the genetic variation influencing risk has been acc Show more
Identification of genetic factors leading to increased risk of suicide death is critical to combat rising suicide rates, however, only a fraction of the genetic variation influencing risk has been accounted for. To address this limitation, we conducted the first comprehensive analysis of rare genetic variation in suicide death leveraging the largest suicide death biobank, the Utah Suicide Genetic Risk Study (USGRS). We conducted a single-variant association analysis of rare (minor allele frequency <1%) putatively functional single-nucleotide polymorphisms (SNPs) present on the Illumina PsychArray genotyping array in 2,672 USGRS suicide deaths of non-Finnish European (NFE) ancestry and 51,583 NFE controls from the Genome Aggregation Database. Secondary analyses used an independent control sample of 21,324 NFE controls from the Psychiatric Genomics Consortium. Five novel, high-impact, rare SNPs were identified with significant associations with suicide death (SNAPC1, rs75418419; TNKS1BP1, rs143883793; ADGRF5, rs149197213; PER1, rs145053802; and ESS2, rs62223875). 119 suicide decedents carried these high-impact SNPs. Both PER1 and SNAPC1 have other supporting gene-level evidence of suicide risk, and psychiatric associations exist for PER1 (bipolar disorder, schizophrenia), and for TNKS1BP1 and ESS2 (schizophrenia). Three of the genes (PER1, TNKS1BP1, and ADGRF5), together with additional genes implicated by genome-wide association studies on suicidal behavior, showed significant enrichment in immune system, homeostatic and signal transduction processes. No specific diagnostic phenotypes were associated with the subset of suicide deaths with the identified rare variants. These findings suggest an important role for rare variants in suicide risk and implicate genes and gene pathways for targeted replication. Show less
Dementia with Lewy bodies (DLB) and Parkinson's disease (PD) are clinically, pathologically and etiologically disorders embedded in the Lewy body disease (LBD) continuum, characterized by neuronal α-s Show more
Dementia with Lewy bodies (DLB) and Parkinson's disease (PD) are clinically, pathologically and etiologically disorders embedded in the Lewy body disease (LBD) continuum, characterized by neuronal α-synuclein pathology. Rare homozygous and compound heterozygous premature termination codon (PTC) mutations in the Vacuolar Protein Sorting 13 homolog C gene (VPS13C) are associated with early-onset recessive PD. We observed in two siblings with early-onset age (< 45) and autopsy confirmed DLB, compound heterozygous missense mutations in VPS13C, p.Trp395Cys and p.Ala444Pro, inherited from their healthy parents in a recessive manner. In lymphoblast cells of the index patient, the missense mutations reduced VPS13C expression by 90% (p = 0.0002). Subsequent, we performed targeted resequencing of VPS13C in 844 LBD patients and 664 control persons. Using the optimized sequence kernel association test, we obtained a significant association (p = 0.0233) of rare VPS13C genetic variants (minor allele frequency ≤ 1%) with LBD. Among the LBD patients, we identified one patient with homozygous missense mutations and three with compound heterozygous missense mutations in trans position, indicative for recessive inheritance. In four patients with compound heterozygous mutations, we were unable to determine trans position. The frequency of LBD patient carriers of proven recessive compound heterozygous missense mutations is 0.59% (5/844). In autopsy brain tissue of two unrelated LBD patients, the recessive compound heterozygous missense mutations reduced VPS13C expression. Overexpressing of wild type or mutant VPS13C in HeLa or SH-SY5Y cells, demonstrated that the mutations p.Trp395Cys or p.Ala444Pro, abolish the endosomal/lysosomal localization of VPS13C. Overall, our data indicate that rare missense mutations in VPS13C are associated with LBD and recessive compound heterozygous missense mutations might have variable effects on the expression and functioning of VPS13C. We conclude that comparable to the recessive inherited PTC mutations in VPS13C, combinations of rare recessive compound heterozygous missense mutations reduce VPS13C expression and contribute to increased risk of LBD. Show less
Blood pressure (BP) was inconsistently associated with migraine and the mechanisms of BP-lowering medications in migraine prophylaxis are unknown. Leveraging large-scale summary statistics for migrain Show more
Blood pressure (BP) was inconsistently associated with migraine and the mechanisms of BP-lowering medications in migraine prophylaxis are unknown. Leveraging large-scale summary statistics for migraine (N Show less
The current study aimed to identify metabolites associated with age-related macular degeneration (AMD) by performing the largest metabolome association analysis in AMD to date, as well as aiming to de Show more
The current study aimed to identify metabolites associated with age-related macular degeneration (AMD) by performing the largest metabolome association analysis in AMD to date, as well as aiming to determine the effect of AMD-associated genetic variants on metabolite levels and investigate associations between the identified metabolites and activity of the complement system, one of the main AMD-associated disease pathways. Case-control association analysis of metabolomics data. Five European cohorts consisting of 2267 AMD patients and 4266 control participants. Metabolomics was performed using a high-throughput proton nuclear magnetic resonance metabolomics platform, which allows quantification of 146 metabolite measurements and 79 derivative values. Metabolome-AMD associations were studied using univariate logistic regression analyses. The effect of 52 AMD-associated genetic variants on the identified metabolites was investigated using linear regression. In addition, associations between the identified metabolites and activity of the complement pathway (defined by the C3d-to-C3 ratio) were investigated using linear regression. Metabolites associated with AMD. We identified 60 metabolites that were associated significantly with AMD, including increased levels of large and extra-large high-density lipoprotein (HDL) subclasses and decreased levels of very low-density lipoprotein (VLDL), amino acids, and citrate. Of 52 AMD-associated genetic variants, 7 variants were associated significantly with 34 of the identified metabolites. The strongest associations were identified for genetic variants located in or near genes involved in lipid metabolism (ABCA1, CETP, APOE, and LIPC) with metabolites belonging to the large and extra-large HDL subclasses. Also, 57 of 60 metabolites were associated significantly with complement activation levels, independent of AMD status. Increased large and extra-large HDL levels and decreased VLDL and amino acid levels were associated with increased complement activation. Lipoprotein levels were associated with AMD-associated genetic variants, whereas decreased essential amino acids may point to nutritional deficiencies in AMD. We observed strong associations between the vast majority of the AMD-associated metabolites and systemic complement activation levels, independent of AMD status. This may indicate biological interactions between the main AMD disease pathways and suggests that multiple pathways may need to be targeted simultaneously for successful treatment of AMD. Show less
Gene-gene interactions (G × G) potentially play a role in the etiology of complex human diseases, including inflammatory bowel disease (IBD), and may partially explain their 'missing heritability'. Us Show more
Gene-gene interactions (G × G) potentially play a role in the etiology of complex human diseases, including inflammatory bowel disease (IBD), and may partially explain their 'missing heritability'. Using the largest genotype dataset available for IBD (16,636 Crohn's disease (CD) and 12,888 ulcerative colitis (UC) cases) we analyzed G × G with the powerful case-only (CO) design. We studied 169 single nucleotide polymorphisms (SNPs) for CD (156 for UC), previously shown to be associated with the respective diseases. To ensure the validity of the CO design, we confined our analysis to pairs of unlinked SNPs. We used principal component analysis at the center level to adjust for possible causes of genotypic association other than G × G, such as population stratification and genotyping batch effects. Results from center-wise logistic regression analyses were combined by a random effects meta-analysis. A number of nominally significant ( We were able to exemplify the utility of the CO design for analyzing G × G, but had to recognize that such interactions are probably scarce for IBD. Show less
High serum urate is a prerequisite for gout and associated with metabolic disease. Genome-wide association studies (GWAS) have reported dozens of loci associated with serum urate control; however, the Show more
High serum urate is a prerequisite for gout and associated with metabolic disease. Genome-wide association studies (GWAS) have reported dozens of loci associated with serum urate control; however, there has been little progress in understanding the molecular basis of the associated loci. Here, we employed trans-ancestral meta-analysis using data from European and East Asian populations to identify 10 new loci for serum urate levels. Genome-wide colocalization with cis-expression quantitative trait loci (eQTL) identified a further five new candidate loci. By cis- and trans-eQTL colocalization analysis, we identified 34 and 20 genes, respectively, where the causal eQTL variant has a high likelihood that it is shared with the serum urate-associated locus. One new locus identified was SLC22A9 that encodes organic anion transporter 7 (OAT7). We demonstrate that OAT7 is a very weak urate-butyrate exchanger. Newly implicated genes identified in the eQTL analysis include those encoding proteins that make up the dystrophin complex, a scaffold for signaling proteins and transporters at the cell membrane; MLXIP that, with the previously identified MLXIPL, is a transcription factor that may regulate serum urate via the pentose-phosphate pathway and MRPS7 and IDH2 that encode proteins necessary for mitochondrial function. Functional fine mapping identified six loci (RREB1, INHBC, HLF, UBE2Q2, SFMBT1 and HNF4G) with colocalized eQTL containing putative causal SNPs. This systematic analysis of serum urate GWAS loci identified candidate causal genes at 24 loci and a network of previously unidentified genes likely involved in control of serum urate levels, further illuminating the molecular mechanisms of urate control. Show less
Although hundreds of genome-wide association studies-implicated loci have been reported for adult obesity-related traits, less is known about the genetics specific for early-onset obesity and with onl Show more
Although hundreds of genome-wide association studies-implicated loci have been reported for adult obesity-related traits, less is known about the genetics specific for early-onset obesity and with only a few studies conducted in non-European populations to date. Searching for additional genetic variants associated with childhood obesity, we performed a trans-ancestral meta-analysis of 30 studies consisting of up to 13 005 cases (≥95th percentile of body mass index (BMI) achieved 2-18 years old) and 15 599 controls (consistently <50th percentile of BMI) of European, African, North/South American and East Asian ancestry. Suggestive loci were taken forward for replication in a sample of 1888 cases and 4689 controls from seven cohorts of European and North/South American ancestry. In addition to observing 18 previously implicated BMI or obesity loci, for both early and late onset, we uncovered one completely novel locus in this trans-ancestral analysis (nearest gene, METTL15). The variant was nominally associated with only the European subgroup analysis but had a consistent direction of effect in other ethnicities. We then utilized trans-ancestral Bayesian analysis to narrow down the location of the probable causal variant at each genome-wide significant signal. Of all the fine-mapped loci, we were able to narrow down the causative variant at four known loci to fewer than 10 single nucleotide polymorphisms (SNPs) (FAIM2, GNPDA2, MC4R and SEC16B loci). In conclusion, an ethnically diverse setting has enabled us to both identify an additional pediatric obesity locus and further fine-map existing loci. Show less
Antipsychotic-induced metabolic disturbance (AIMD) is a common adverse effect of antipsychotics with genetics partly underpinning variation in susceptibility among schizophrenia patients. Melanocortin Show more
Antipsychotic-induced metabolic disturbance (AIMD) is a common adverse effect of antipsychotics with genetics partly underpinning variation in susceptibility among schizophrenia patients. Melanocortin4 receptor (MC4R) gene, one of the candidate genes for AIMD, has been under-studied in the Chinese patients. We conducted a pharmacogenetic study in a large cohort of Chinese patients with schizophrenia. In this study, we investigated the genetic variation of MC4R in Chinese population by genotyping two SNPs (rs489693 and rs17782313) in 1,991 Chinese patients and examined association of these variants with the metabolic effects that were often observed to be related to AIMD. Metabolic measures, including body mass index (BMI), waist circumference (WC), glucose, triglyceride, high-density lipoprotein (HDL), and low-density lipoprotein (LDL) levels were assessed at baseline and after 6-week antipsychotic treatment. We found that interaction of SNP×medication status (drug-naïve/medicated) was significantly associated with BMI, WC, and HDL change %, respectively. Both SNPs were significantly associated with baseline BMI and WC in the medicated group. Moderate association of rs489693 with WC, Triglyceride, and HDL change % were observed in the whole sample. In the drug-naïve group, we found recessive effects of rs489693 on BMI gain more than 7%, WC and Triglyceride change %, with AA incurring more metabolic adverse effects. In conclusion, the association between rs489693 and the metabolic measures is ubiquitous but moderate. Rs17782313 is less involved in AIMD. Two SNPs confer risk of AIMD to patients treated with different antipsychotics in a similar way. Show less
Genetic and epidemiologic studies have shown that lipid genes and high-density lipoproteins (HDLs) are implicated in age-related macular degeneration (AMD). We studied circulating lipid levels in rela Show more
Genetic and epidemiologic studies have shown that lipid genes and high-density lipoproteins (HDLs) are implicated in age-related macular degeneration (AMD). We studied circulating lipid levels in relationship to AMD in a large European dataset. Pooled analysis of cross-sectional data. Individuals (N = 30 953) aged 50 years or older participating in the European Eye Epidemiology (E3) consortium and 1530 individuals from the Rotterdam Study with lipid subfraction data. AMD features were graded on fundus photographs using the Rotterdam classification. Routine blood lipid measurements, genetics, medication, and potential confounders were extracted from the E3 database. In a subgroup of the Rotterdam Study, lipid subfractions were identified by the Nightingale biomarker platform. Random-intercepts mixed-effects models incorporating confounders and study site as a random effect were used to estimate associations. AMD features and stage; lipid measurements. HDL was associated with an increased risk of AMD (odds ratio [OR], 1.21 per 1-mmol/l increase; 95% confidence interval [CI], 1.14-1.29), whereas triglycerides were associated with a decreased risk (OR, 0.94 per 1-mmol/l increase; 95% CI, 0.91-0.97). Both were associated with drusen size. Higher HDL raised the odds of larger drusen, whereas higher triglycerides decreases the odds. LDL cholesterol reached statistical significance only in the association with early AMD (P = 0.045). Regarding lipid subfractions, the concentration of extra-large HDL particles showed the most prominent association with AMD (OR, 1.24; 95% CI, 1.10-1.40). The cholesteryl ester transfer protein risk variant (rs17231506) for AMD was in line with increased HDL levels (P = 7.7 × 10 Our study suggested that HDL cholesterol is associated with increased risk of AMD and that triglycerides are negatively associated. Both show the strongest association with early AMD and drusen. Extra-large HDL subfractions seem to be drivers in the relationship with AMD, and variants in lipid genes play a more ambiguous role in this association. Whether systemic lipids directly influence AMD or represent lipid metabolism in the retina remains to be answered. Show less
The Iberian Peninsula stands out as having variable levels of population admixture and isolation, making Spain an interesting setting for studying the genetic architecture of neurodegenerative disease Show more
We sought to identify susceptibility genes for high-grade serous ovarian cancer (HGSOC) by performing a transcriptome-wide association study of gene expression and splice junction usage in HGSOC-relev Show more
We sought to identify susceptibility genes for high-grade serous ovarian cancer (HGSOC) by performing a transcriptome-wide association study of gene expression and splice junction usage in HGSOC-relevant tissue types (N = 2,169) and the largest genome-wide association study available for HGSOC (N = 13,037 cases and 40,941 controls). We identified 25 transcriptome-wide association study significant genes, 7 at the junction level only, including LRRC46 at 19q21.32, (P = 1 × 10 Show less
The objective of this study was to perform a proof-of-concept experiment that validates a precision medicine approach to identify variants associated with hypertrophic cardiomyopathy (HCM). We hypothe Show more
The objective of this study was to perform a proof-of-concept experiment that validates a precision medicine approach to identify variants associated with hypertrophic cardiomyopathy (HCM). We hypothesized that whole-genome sequencing would identify variant(s) associated with HCM in two affected Maine Coon/Maine Coon cross cats when compared with 79 controls of various breeds. Two affected and two control Maine Coon/Maine Coon cross cats had whole-genome sequencing performed at approximately × 30 coverage. Variants were called in these four cats and 77 cats of various breeds as part of the 99 Lives Cat Genome Sequencing Initiative ( http://felinegenetics.missouri.edu/99lives ) using Platypus v0.7.9.1, annotated with dbSNP ID, and variants' effect predicted by SnpEff. Strict filtering criteria (alternate allele frequency >49%) were applied to identify homozygous-alternate or heterozygous variants in the two HCM-affected samples when compared with 79 controls of various breeds. A total of four variants were identified in the two Maine Coon/Maine Coon cross cats with HCM when compared with 79 controls after strict filtering. Three of the variants identified in genes This proof-of-concept experiment identified the previously reported Show less
To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,2 Show more
To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha ( In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the Show less
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding var Show more
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding variants from which pinpointing causal genes remains challenging. Here we combined data from 718,734 individuals to discover rare and low-frequency (minor allele frequency (MAF) < 5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which 8 variants were in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2 and ZNF169) newly implicated in human obesity, 2 variants were in genes (MC4R and KSR2) previously observed to be mutated in extreme obesity and 2 variants were in GIPR. The effect sizes of rare variants are ~10 times larger than those of common variants, with the largest effect observed in carriers of an MC4R mutation introducing a stop codon (p.Tyr35Ter, MAF = 0.01%), who weighed ~7 kg more than non-carriers. Pathway analyses based on the variants associated with BMI confirm enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically supported therapeutic targets in obesity. Show less
The Million Veteran Program (MVP) was established in 2011 as a national research initiative to determine how genetic variation influences the health of US military veterans. Here we genotyped 312,571 Show more
The Million Veteran Program (MVP) was established in 2011 as a national research initiative to determine how genetic variation influences the health of US military veterans. Here we genotyped 312,571 MVP participants using a custom biobank array and linked the genetic data to laboratory and clinical phenotypes extracted from electronic health records covering a median of 10.0 years of follow-up. Among 297,626 veterans with at least one blood lipid measurement, including 57,332 black and 24,743 Hispanic participants, we tested up to around 32 million variants for association with lipid levels and identified 118 novel genome-wide significant loci after meta-analysis with data from the Global Lipids Genetics Consortium (total n > 600,000). Through a focus on mutations predicted to result in a loss of gene function and a phenome-wide association study, we propose novel indications for pharmaceutical inhibitors targeting PCSK9 (abdominal aortic aneurysm), ANGPTL4 (type 2 diabetes) and PDE3B (triglycerides and coronary disease). Show less
Migraine and major depressive disorder (MDD) are common brain disorders that frequently co-occur. Despite epidemiological evidence that migraine and MDD share a genetic basis, their overlap at the mol Show more
Migraine and major depressive disorder (MDD) are common brain disorders that frequently co-occur. Despite epidemiological evidence that migraine and MDD share a genetic basis, their overlap at the molecular genetic level has not been thoroughly investigated. Using single-nucleotide polymorphism (SNP) and gene-based analysis of genome-wide association study (GWAS) genotype data, we found significant genetic overlap across the two disorders. LD Score regression revealed a significant SNP-based heritability for both migraine (h Show less
A novel autosomal recessive disorder characterized by pre- and postnatal growth restriction with microcephaly, distinctive craniofacial features, congenital alopecia, hypoplastic kidneys with renal in Show more
A novel autosomal recessive disorder characterized by pre- and postnatal growth restriction with microcephaly, distinctive craniofacial features, congenital alopecia, hypoplastic kidneys with renal insufficiency, global developmental delay, severe congenital sensorineural hearing loss, early mortality, hydrocephalus, and genital hypoplasia was observed in 4 children from 3 families of New Mexican Hispanic heritage. Three of the children died before 3 years of age from uremia and/or sepsis. Exome sequencing of the surviving individual identified a homozygous c.587T>C (p.Ile196Thr) mutation in ZPR1 Zinc Finger (ZPR1) that segregated appropriately in her family. In a second family, the identical variant was shown to be heterozygous in the affected individual's parents and not homozygous in any of her unaffected siblings. ZPR1 is a ubiquitously expressed, highly conserved protein postulated to transmit proliferative signals from the cell membrane to the nucleus. Structural modeling reveals that p.Ile196Thr disrupts the hydrophobic core of ZPR1. Patient fibroblast cells showed no detectable levels of ZPR1 and the cells showed a defect in cell cycle progression where a significant number of cells remained arrested in the G1 phase. We provide genetic and molecular evidence that a homozygous missense mutation in ZPR1 is associated with a rare and recognizable multisystem syndrome. Show less
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated w Show more
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD. Show less
Apolipoprotein A-IV (apoA-IV) has been observed to be associated with lipids, kidney function, adiposity- and diabetes-related parameters. To assess the causal relationship of apoA-IV with these pheno Show more
Apolipoprotein A-IV (apoA-IV) has been observed to be associated with lipids, kidney function, adiposity- and diabetes-related parameters. To assess the causal relationship of apoA-IV with these phenotypes, we conducted bidirectional Mendelian randomization (MR) analyses using publicly available summary-level datasets from GWAS consortia on apoA-IV concentrations (n = 13,813), kidney function (estimated glomerular filtration rate (eGFR), n = 133,413), lipid traits (HDL cholesterol, LDL cholesterol, triglycerides, n = 188,577), adiposity-related traits (body-mass-index (n = 322,206), waist-hip-ratio (n = 210,088)) and fasting glucose (n = 133,010). Main analyses consisted in inverse-variance weighted and multivariable MR, whereas MR-Egger regression and weighted median estimation were used as sensitivity analyses. We found that eGFR is likely to be causal on apoA-IV concentrations (53 SNPs; causal effect estimate per 1-SD increase in eGFR = -0.39; 95% CI = [-0.54, -0.24]; p-value = 2.4e-07). Triglyceride concentrations were also causally associated with apoA-IV concentrations (40 SNPs; causal effect estimate per 1-SD increase in triglycerides = -0.06; 95% CI = [-0.08, -0.04]; p-value = 4.8e-07), independently of HDL-C and LDL-C concentrations (causal effect estimate from multivariable MR = -0.06; 95% CI = [-0.10, -0.02]; p-value = 0.0014). Evaluating the inverse direction of causality revealed a possible causal association of apoA-IV on HDL-cholesterol (2 SNPs; causal effect estimate per one percent increase in apoA-IV = -0.40; 95% CI = [-0.60, -0.21]; p-value = 5.5e-05). Show less
Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseas Show more
Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseases or traits. This study sought to systematically test if genetic variants identified for non-CAD diseases/traits also associate with CAD and to undertake a comprehensive analysis of the extent of pleiotropy of all CAD loci. In discovery analyses involving 42,335 CAD cases and 78,240 control subjects we tested the association of 29,383 common (minor allele frequency >5%) single nucleotide polymorphisms available on the exome array, which included a substantial proportion of known or suspected single nucleotide polymorphisms associated with common diseases or traits as of 2011. Suggestive association signals were replicated in an additional 30,533 cases and 42,530 control subjects. To evaluate pleiotropy, we tested CAD loci for association with cardiovascular risk factors (lipid traits, blood pressure phenotypes, body mass index, diabetes, and smoking behavior), as well as with other diseases/traits through interrogation of currently available genome-wide association study catalogs. We identified 6 new loci associated with CAD at genome-wide significance: on 2q37 (KCNJ13-GIGYF2), 6p21 (C2), 11p15 (MRVI1-CTR9), 12q13 (LRP1), 12q24 (SCARB1), and 16q13 (CETP). Risk allele frequencies ranged from 0.15 to 0.86, and odds ratio per copy of the risk allele ranged from 1.04 to 1.09. Of 62 new and known CAD loci, 24 (38.7%) showed statistical association with a traditional cardiovascular risk factor, with some showing multiple associations, and 29 (47%) showed associations at p < 1 × 10 We identified 6 loci associated with CAD at genome-wide significance. Several CAD loci show substantial pleiotropy, which may help us understand the mechanisms by which these loci affect CAD risk. Show less
Genetic differences in the target proteins, metabolizing enzymes and transporters that contribute to inter-individual differences in drug response are not integrated in contemporary drug development p Show more
Genetic differences in the target proteins, metabolizing enzymes and transporters that contribute to inter-individual differences in drug response are not integrated in contemporary drug development programs. Ayurveda, that has propelled many drug discovery programs albeit for the search of new chemical entities incorporates inter-individual variability "Prakriti" in development and administration of drug in an individualized manner. Prakriti of an individual largely determines responsiveness to external environment including drugs as well as susceptibility to diseases. Prakriti has also been shown to have molecular and genomic correlates. We highlight how integration of Prakriti concepts can augment the efficiency of drug discovery and development programs through a unique initiative of Ayurgenomics TRISUTRA consortium. Five aspects that have been carried out are (1) analysis of variability in FDA approved pharmacogenomics genes/SNPs in exomes of 72 healthy individuals including predominant Prakriti types and matched controls from a North Indian Indo-European cohort (2) establishment of a consortium network and development of five genetically homogeneous cohorts from diverse ethnic and geo-climatic background (3) identification of parameters and development of uniform standard protocols for objective assessment of Prakriti types (4) development of protocols for Prakriti evaluation and its application in more than 7500 individuals in the five cohorts (5) Development of data and sample repository and integrative omics pipelines for identification of genomic correlates. Highlight of the study are (1) Exome sequencing revealed significant differences between Prakriti types in 28 SNPs of 11 FDA approved genes of pharmacogenomics relevance viz. CYP2C19, CYP2B6, ESR1, F2, PGR, HLA-B, HLA-DQA1, HLA-DRB1, LDLR, CFTR, CPS1. These variations are polymorphic in diverse Indian and world populations included in 1000 genomes project. (2) Based on the phenotypic attributes of Prakriti we identified anthropometry for anatomical features, biophysical parameters for skin types, HRV for autonomic function tests, spirometry for vital capacity and gustometry for taste thresholds as objective parameters. (3) Comparison of Prakriti phenotypes across different ethnic, age and gender groups led to identification of invariant features as well as some that require weighted considerations across the cohorts. Considering the molecular and genomics differences underlying Prakriti and relevance in disease pharmacogenomics studies, this novel integrative platform would help in identification of differently susceptible and drug responsive population. Additionally, integrated analysis of phenomic and genomic variations would not only allow identification of clinical and genomic markers of Prakriti for application in personalized medicine but also its integration in drug discovery and development programs. Show less
Over the past decade genome-wide association studies (GWAS) have been applied to aid in the understanding of the biology of traits. The success of this approach is governed by the underlying effect si Show more
Over the past decade genome-wide association studies (GWAS) have been applied to aid in the understanding of the biology of traits. The success of this approach is governed by the underlying effect sizes carried by the true risk variants and the corresponding statistical power to observe such effects given the study design and sample size under investigation. Previous ASD GWAS have identified genome-wide significant (GWS) risk loci; however, these studies were of only of low statistical power to identify GWS loci at the lower effect sizes (odds ratio (OR) <1.15). We conducted a large-scale coordinated international collaboration to combine independent genotyping data to improve the statistical power and aid in robust discovery of GWS loci. This study uses genome-wide genotyping data from a discovery sample (7387 ASD cases and 8567 controls) followed by meta-analysis of summary statistics from two replication sets (7783 ASD cases and 11359 controls; and 1369 ASD cases and 137308 controls). We observe a GWS locus at 10q24.32 that overlaps several genes including This study is an important step in the ongoing endeavour to identify the loci which underpin the common variant signal in ASD. In addition to novel GWS loci, we have identified a significant genetic correlation with schizophrenia and association of ASD with several neurodevelopmental-related genes such as Show less
Hand grip strength is a widely used proxy of muscular fitness, a marker of frailty, and predictor of a range of morbidities and all-cause mortality. To investigate the genetic determinants of variatio Show more
Hand grip strength is a widely used proxy of muscular fitness, a marker of frailty, and predictor of a range of morbidities and all-cause mortality. To investigate the genetic determinants of variation in grip strength, we perform a large-scale genetic discovery analysis in a combined sample of 195,180 individuals and identify 16 loci associated with grip strength (P<5 × 10 Show less
The gut incretin hormones glucagon-like peptide-1 (GLP-1) and glucose-dependent insulinotropic peptide (GIP) have a major role in the pathophysiology of type 2 diabetes. Specific genetic and dietary f Show more
The gut incretin hormones glucagon-like peptide-1 (GLP-1) and glucose-dependent insulinotropic peptide (GIP) have a major role in the pathophysiology of type 2 diabetes. Specific genetic and dietary factors have been found to influence the release and action of incretins. We examined the effect of interactions between seven incretin-related genetic variants in GIPR, KCNQ1, TCF7L2 and WFS1 and dietary components (whey-containing dairy, cereal fibre, coffee and olive oil) on the risk of type 2 diabetes in the European Prospective Investigation into Cancer and Nutrition (EPIC)-InterAct study. The current case-cohort study included 8086 incident type 2 diabetes cases and a representative subcohort of 11,035 participants (median follow-up: 12.5 years). Prentice-weighted Cox proportional hazard regression models were used to investigate the associations and interactions between the dietary factors and genes in relation to the risk of type 2 diabetes. An interaction (p = 0.048) between TCF7L2 variants and coffee intake was apparent, with an inverse association between coffee and type 2 diabetes present among carriers of the diabetes risk allele (T) in rs12255372 (GG: HR 0.99 [95% CI 0.97, 1.02] per cup of coffee; GT: HR 0.96 [95% CI 0.93, 0.98]); and TT: HR 0.93 [95% CI 0.88, 0.98]). In addition, an interaction (p = 0.005) between an incretin-specific genetic risk score and coffee was observed, again with a stronger inverse association with coffee in carriers with more risk alleles (0-3 risk alleles: HR 0.99 [95% CI 0.94, 1.04]; 7-10 risk alleles: HR 0.95 [95% CI 0.90, 0.99]). None of these associations were statistically significant after correction for multiple testing. Our large-scale case-cohort study provides some evidence for a possible interaction of TCF7L2 variants and an incretin-specific genetic risk score with coffee consumption in relation to the risk of type 2 diabetes. Further large-scale studies and/or meta-analyses are needed to confirm these interactions in other populations. Show less
A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide associatio Show more
A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We included 35 668 children from 20 studies in the discovery phase and 11 873 children from 13 studies in the replication phase. In total, 15 loci reached genome-wide significance (P-value < 5 × 10(-8)) in the joint discovery and replication analysis, of which 12 are previously identified loci in or close to ADCY3, GNPDA2, TMEM18, SEC16B, FAIM2, FTO, TFAP2B, TNNI3K, MC4R, GPR61, LMX1B and OLFM4 associated with adult body mass index or childhood obesity. We identified three novel loci: rs13253111 near ELP3, rs8092503 near RAB27B and rs13387838 near ADAM23. Per additional risk allele, body mass index increased 0.04 Standard Deviation Score (SDS) [Standard Error (SE) 0.007], 0.05 SDS (SE 0.008) and 0.14 SDS (SE 0.025), for rs13253111, rs8092503 and rs13387838, respectively. A genetic risk score combining all 15 SNPs showed that each additional average risk allele was associated with a 0.073 SDS (SE 0.011, P-value = 3.12 × 10(-10)) increase in childhood body mass index in a population of 1955 children. This risk score explained 2% of the variance in childhood body mass index. This study highlights the shared genetic background between childhood and adult body mass index and adds three novel loci. These loci likely represent age-related differences in strength of the associations with body mass index. Show less