Food allergy (FA) arises from a complex interplay between an individual's genetic predisposition and environmental factors, and its prevalence is increasing. Genome-wide association studies to date ha Show more
Food allergy (FA) arises from a complex interplay between an individual's genetic predisposition and environmental factors, and its prevalence is increasing. Genome-wide association studies to date have been hindered by small sample sizes and varying FA definitions. We sought to identify novel FA risk loci by conducting a genome-wide association study meta-analysis in children and adults by using a multiphenotype approach to ensure a good trade-off between sufficient sample size and valid FA definitions. Analyses were conducted separately in children and adults on the basis of the following FA phenotypes: self-report, doctor diagnosis, food-specific sensitization, and doctor diagnosis plus food-specific sensitization. A meta-analysis was performed of genome-wide association studies from up to 16 cohorts of people of European ancestry including 229,426 adults and 14,234 children. Models were adjusted for sex, age, principal components, and, if applicable, further study-specific confounders. Sensitivity models were additionally adjusted for hay fever. Replication was conducted in additional external cohorts and a validation in oral food challenge-defined FA cases. Thirty-seven single nucleotide polymorphisms met suggestive significance (P < 1 × 10 This study identified 37 single nucleotide polymorphisms suggestively associated with FA and demonstrated genetic differences across phenotypes. It highlights the need for a unified FA definition and sheds light on FA's shared genetic architecture with allergies. Show less
Acute myeloid leukemia (AML) is a complex hematologic malignancy with multiple disease subgroups defined by somatic mutations and heterogeneous outcomes. Although genome-wide association studies (GWAS Show more
Acute myeloid leukemia (AML) is a complex hematologic malignancy with multiple disease subgroups defined by somatic mutations and heterogeneous outcomes. Although genome-wide association studies (GWAS) have identified a small number of common genetic variants influencing AML risk, the heritable component of this disease outside of familial susceptibility remains largely undefined. Here, we perform a meta-analysis of 4 published GWAS plus 2 new GWAS, totaling 4710 AML cases and 12 938 controls. We identify a new genome-wide significant risk locus for pan-AML at 2p23.3 (rs4665765; P = 1.35 × 10-8; EFR3B, POMC, DNMT3A, and DNAJC27), which also significantly associates with patient survival (P = 6.09 × 10-3). Our analysis also identifies 3 new genome-wide significant risk loci for disease subgroups, including AML with deletions of chromosome 5 and/or 7 at 1q23.3 (rs12078864; P = 7.0 × 10-10; DUSP23) and cytogenetically complex AML at 2q33.3 (rs12988876; P = 3.28 × 10-8; PARD3B) and 2p21 (rs79918355; P = 1.60 × 10-9; EPCAM). We also investigated loci previously associated with the risk of clonal hematopoiesis (CH) or CH of indeterminate potential and identified several variants associated with the risk of AML. Our results further inform on AML etiology and demonstrate the existence of disease subgroup specific risk loci. Show less
Proprotein convertase subtilisin/kexin type 9 (PCSK9) is a key player of lipid metabolism with higher plasma levels in women throughout their life. Statin treatment affects PCSK9 levels also showing e Show more
Proprotein convertase subtilisin/kexin type 9 (PCSK9) is a key player of lipid metabolism with higher plasma levels in women throughout their life. Statin treatment affects PCSK9 levels also showing evidence of sex-differential effects. It remains unclear whether these differences can be explained by genetics. We performed genome-wide association meta-analyses (GWAS) of PCSK9 levels stratified for sex and statin treatment in six independent studies of Europeans (8936 women/11,080 men respectively 14,825 statin-free/5191 statin-treated individuals). Loci associated in one of the strata were tested for statin- and sex-interactions considering all independent signals per locus. Independent variants at the PCSK9 gene locus were then used in a stratified Mendelian Randomization analysis (cis-MR) of PCSK9 effects on low-density lipoprotein cholesterol (LDL-C) levels to detect differences of causal effects between the subgroups. We identified 11 loci associated with PCSK9 in at least one stratified subgroup (p < 1.0 × 10 We performed the first double-stratified GWAS of PCSK9 levels and identified multiple biologically plausible loci with genetic interaction effects. Our results indicate that the observed sexual dimorphism of PCSK9 and its statin-related interactions have a genetic basis. Significant differences in the causal relationship between PCSK9 and LDL-C suggest sex-specific dosages of PCSK9 inhibitors. Show less
Identifying genetic determinants of reproductive success may highlight mechanisms underlying fertility and identify alleles under present-day selection. Using data in 785,604 individuals of European a Show more
Identifying genetic determinants of reproductive success may highlight mechanisms underlying fertility and identify alleles under present-day selection. Using data in 785,604 individuals of European ancestry, we identified 43 genomic loci associated with either number of children ever born (NEB) or childlessness. These loci span diverse aspects of reproductive biology, including puberty timing, age at first birth, sex hormone regulation, endometriosis and age at menopause. Missense variants in ARHGAP27 were associated with higher NEB but shorter reproductive lifespan, suggesting a trade-off at this locus between reproductive ageing and intensity. Other genes implicated by coding variants include PIK3IP1, ZFP82 and LRP4, and our results suggest a new role for the melanocortin 1 receptor (MC1R) in reproductive biology. As NEB is one component of evolutionary fitness, our identified associations indicate loci under present-day natural selection. Integration with data from historical selection scans highlighted an allele in the FADS1/2 gene locus that has been under selection for thousands of years and remains so today. Collectively, our findings demonstrate that a broad range of biological mechanisms contribute to reproductive success. Show less
We determined the relationships between DNA sequence variation and DNA methylation using blood samples from 3,799 Europeans and 3,195 South Asians. We identify 11,165,559 SNP-CpG associations (methyla Show more
We determined the relationships between DNA sequence variation and DNA methylation using blood samples from 3,799 Europeans and 3,195 South Asians. We identify 11,165,559 SNP-CpG associations (methylation quantitative trait loci (meQTL), P < 10 Show less
Lean body mass (LM) plays an important role in mobility and metabolic function. We previously identified five loci associated with LM adjusted for fat mass in kilograms. Such an adjustment may reduce Show more
Lean body mass (LM) plays an important role in mobility and metabolic function. We previously identified five loci associated with LM adjusted for fat mass in kilograms. Such an adjustment may reduce the power to identify genetic signals having an association with both lean mass and fat mass. To determine the impact of different fat mass adjustments on genetic architecture of LM and identify additional LM loci. We performed genome-wide association analyses for whole-body LM (20 cohorts of European ancestry with n = 38,292) measured using dual-energy X-ray absorptiometry) or bioelectrical impedance analysis, adjusted for sex, age, age2, and height with or without fat mass adjustments (Model 1 no fat adjustment; Model 2 adjustment for fat mass as a percentage of body mass; Model 3 adjustment for fat mass in kilograms). Seven single-nucleotide polymorphisms (SNPs) in separate loci, including one novel LM locus (TNRC6B), were successfully replicated in an additional 47,227 individuals from 29 cohorts. Based on the strengths of the associations in Model 1 vs Model 3, we divided the LM loci into those with an effect on both lean mass and fat mass in the same direction and refer to those as "sumo wrestler" loci (FTO and MC4R). In contrast, loci with an impact specifically on LM were termed "body builder" loci (VCAN and ADAMTSL3). Using existing available genome-wide association study databases, LM increasing alleles of SNPs in sumo wrestler loci were associated with an adverse metabolic profile, whereas LM increasing alleles of SNPs in "body builder" loci were associated with metabolic protection. In conclusion, we identified one novel LM locus (TNRC6B). Our results suggest that a genetically determined increase in lean mass might exert either harmful or protective effects on metabolic traits, depending on its relation to fat mass. Show less
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated w Show more
We screened variants on an exome-focused genotyping array in >300,000 participants (replication in >280,000 participants) and identified 444 independent variants in 250 loci significantly associated with total cholesterol (TC), high-density-lipoprotein cholesterol (HDL-C), low-density-lipoprotein cholesterol (LDL-C), and/or triglycerides (TG). At two loci (JAK2 and A1CF), experimental analysis in mice showed lipid changes consistent with the human data. We also found that: (i) beta-thalassemia trait carriers displayed lower TC and were protected from coronary artery disease (CAD); (ii) excluding the CETP locus, there was not a predictable relationship between plasma HDL-C and risk for age-related macular degeneration; (iii) only some mechanisms of lowering LDL-C appeared to increase risk for type 2 diabetes (T2D); and (iv) TG-lowering alleles involved in hepatic production of TG-rich lipoproteins (TM6SF2 and PNPLA3) tracked with higher liver fat, higher risk for T2D, and lower risk for CAD, whereas TG-lowering alleles involved in peripheral lipolysis (LPL and ANGPTL4) had no effect on liver fat but decreased risks for both T2D and CAD. Show less
Diabetes-associated metabolites may aid the identification of new risk variants for type 2 diabetes. Using targeted metabolomics within a subsample of the German EPIC-Potsdam study (n = 2500), we test Show more
Diabetes-associated metabolites may aid the identification of new risk variants for type 2 diabetes. Using targeted metabolomics within a subsample of the German EPIC-Potsdam study (n = 2500), we tested previously published SNPs for their association with diabetes-associated metabolites and conducted an additional exploratory analysis using data from the exome chip including replication within 2,692 individuals from the German KORA F4 study. We identified a total of 16 loci associated with diabetes-related metabolite traits, including one novel association between rs499974 (MOGAT2) and a diacyl-phosphatidylcholine ratio (PC aa C40:5/PC aa C38:5). Gene-based tests on all exome chip variants revealed associations between GFRAL and PC aa C42:1/PC aa C42:0, BIN1 and SM (OH) C22:2/SM C18:0 and TFRC and SM (OH) C22:2/SM C16:1). Selecting variants for gene-based tests based on functional annotation identified one additional association between OR51Q1 and hexoses. Among single genetic variants consistently associated with diabetes-related metabolites, two (rs174550 (FADS1), rs3204953 (REV3L)) were significantly associated with type 2 diabetes in large-scale meta-analysis for type 2 diabetes. In conclusion, we identified a novel metabolite locus in single variant analyses and four genes within gene-based tests and confirmed two previously known mGWAS loci which might be relevant for the risk of type 2 diabetes. Show less
Apolipoprotein A-IV (apoA-IV) is a major component of HDL and chylomicron particles and is involved in reverse cholesterol transport. It is an early marker of impaired renal function. We aimed to iden Show more
Apolipoprotein A-IV (apoA-IV) is a major component of HDL and chylomicron particles and is involved in reverse cholesterol transport. It is an early marker of impaired renal function. We aimed to identify genetic loci associated with apoA-IV concentrations and to investigate relationships with known susceptibility loci for kidney function and lipids. A genome-wide association meta-analysis on apoA-IV concentrations was conducted in five population-based cohorts (n = 13,813) followed by two additional replication studies (n = 2,267) including approximately 10 M SNPs. Three independent SNPs from two genomic regions were significantly associated with apoA-IV concentrations: rs1729407 near APOA4 (P = 6.77 × 10 Show less
Identification of novel biomarkers for type 2 diabetes and their genetic determinants could lead to improved understanding of causal pathways and improve risk prediction. In this study, we used data f Show more
Identification of novel biomarkers for type 2 diabetes and their genetic determinants could lead to improved understanding of causal pathways and improve risk prediction. In this study, we used data from non-targeted metabolomics performed using liquid chromatography coupled with tandem mass spectrometry in three Swedish cohorts (Uppsala Longitudinal Study of Adult Men [ULSAM], n = 1138; Prospective Investigation of the Vasculature in Uppsala Seniors [PIVUS], n = 970; TwinGene, n = 1630). Metabolites associated with impaired fasting glucose (IFG) and/or prevalent type 2 diabetes were assessed for associations with incident type 2 diabetes in the three cohorts followed by replication attempts in the Cooperative Health Research in the Region of Augsburg (KORA) S4 cohort (n = 855). Assessment of the association of metabolite-regulating genetic variants with type 2 diabetes was done using data from a meta-analysis of genome-wide association studies. Out of 5961 investigated metabolic features, 1120 were associated with prevalent type 2 diabetes and IFG and 70 were annotated to metabolites and replicated in the three cohorts. Fifteen metabolites were associated with incident type 2 diabetes in the four cohorts combined (358 events) following adjustment for age, sex, BMI, waist circumference and fasting glucose. Novel findings included associations of higher values of the bile acid deoxycholic acid and monoacylglyceride 18:2 and lower concentrations of cortisol with type 2 diabetes risk. However, adding metabolites to an existing risk score improved model fit only marginally. A genetic variant within the CYP7A1 locus, encoding the rate-limiting enzyme in bile acid synthesis, was found to be associated with lower concentrations of deoxycholic acid, higher concentrations of LDL-cholesterol and lower type 2 diabetes risk. Variants in or near SGPP1, GCKR and FADS1/2 were associated with diabetes-associated phospholipids and type 2 diabetes. We found evidence that the metabolism of bile acids and phospholipids shares some common genetic origin with type 2 diabetes. Metabolomics data have been deposited in the Metabolights database, with accession numbers MTBLS93 (TwinGene), MTBLS124 (ULSAM) and MTBLS90 (PIVUS). Show less
Genome-wide association studies with metabolic traits (mGWAS) uncovered many genetic variants that influence human metabolism. These genetically influenced metabotypes (GIMs) contribute to our metabol Show more
Genome-wide association studies with metabolic traits (mGWAS) uncovered many genetic variants that influence human metabolism. These genetically influenced metabotypes (GIMs) contribute to our metabolic individuality, our capacity to respond to environmental challenges, and our susceptibility to specific diseases. While metabolic homeostasis in blood is a well investigated topic in large mGWAS with over 150 known loci, metabolic detoxification through urinary excretion has only been addressed by few small mGWAS with only 11 associated loci so far. Here we report the largest mGWAS to date, combining targeted and non-targeted 1H NMR analysis of urine samples from 3,861 participants of the SHIP-0 cohort and 1,691 subjects of the KORA F4 cohort. We identified and replicated 22 loci with significant associations with urinary traits, 15 of which are new (HIBCH, CPS1, AGXT, XYLB, TKT, ETNPPL, SLC6A19, DMGDH, SLC36A2, GLDC, SLC6A13, ACSM3, SLC5A11, PNMT, SLC13A3). Two-thirds of the urinary loci also have a metabolite association in blood. For all but one of the 6 loci where significant associations target the same metabolite in blood and urine, the genetic effects have the same direction in both fluids. In contrast, for the SLC5A11 locus, we found increased levels of myo-inositol in urine whereas mGWAS in blood reported decreased levels for the same genetic variant. This might indicate less effective re-absorption of myo-inositol in the kidneys of carriers. In summary, our study more than doubles the number of known loci that influence urinary phenotypes. It thus allows novel insights into the relationship between blood homeostasis and its regulation through excretion. The newly discovered loci also include variants previously linked to chronic kidney disease (CPS1, SLC6A13), pulmonary hypertension (CPS1), and ischemic stroke (XYLB). By establishing connections from gene to disease via metabolic traits our results provide novel hypotheses about molecular mechanisms involved in the etiology of diseases. Show less
Metformin is used as a first-line oral treatment for type 2 diabetes (T2D). However, the underlying mechanism is not fully understood. Here, we aimed to comprehensively investigate the pleiotropic eff Show more
Metformin is used as a first-line oral treatment for type 2 diabetes (T2D). However, the underlying mechanism is not fully understood. Here, we aimed to comprehensively investigate the pleiotropic effects of metformin. We analyzed both metabolomic and genomic data of the population-based KORA cohort. To evaluate the effect of metformin treatment on metabolite concentrations, we quantified 131 metabolites in fasting serum samples and used multivariable linear regression models in three independent cross-sectional studies (n = 151 patients with T2D treated with metformin [mt-T2D]). Additionally, we used linear mixed-effect models to study the longitudinal KORA samples (n = 912) and performed mediation analyses to investigate the effects of metformin intake on blood lipid profiles. We combined genotyping data with the identified metformin-associated metabolites in KORA individuals (n = 1,809) and explored the underlying pathways. We found significantly lower (P < 5.0E-06) concentrations of three metabolites (acyl-alkyl phosphatidylcholines [PCs]) when comparing mt-T2D with four control groups who were not using glucose-lowering oral medication. These findings were controlled for conventional risk factors of T2D and replicated in two independent studies. Furthermore, we observed that the levels of these metabolites decreased significantly in patients after they started metformin treatment during 7 years' follow-up. The reduction of these metabolites was also associated with a lowered blood level of LDL cholesterol (LDL-C). Variations of these three metabolites were significantly associated with 17 genes (including FADS1 and FADS2) and controlled by AMPK, a metformin target. Our results indicate that metformin intake activates AMPK and consequently suppresses FADS, which leads to reduced levels of the three acyl-alkyl PCs and LDL-C. Our findings suggest potential beneficial effects of metformin in the prevention of cardiovascular disease. Show less
Coffee, a major dietary source of caffeine, is among the most widely consumed beverages in the world and has received considerable attention regarding health risks and benefits. We conducted a genome- Show more
Coffee, a major dietary source of caffeine, is among the most widely consumed beverages in the world and has received considerable attention regarding health risks and benefits. We conducted a genome-wide (GW) meta-analysis of predominately regular-type coffee consumption (cups per day) among up to 91,462 coffee consumers of European ancestry with top single-nucleotide polymorphisms (SNPs) followed-up in ~30 062 and 7964 coffee consumers of European and African-American ancestry, respectively. Studies from both stages were combined in a trans-ethnic meta-analysis. Confirmed loci were examined for putative functional and biological relevance. Eight loci, including six novel loci, met GW significance (log10Bayes factor (BF)>5.64) with per-allele effect sizes of 0.03-0.14 cups per day. Six are located in or near genes potentially involved in pharmacokinetics (ABCG2, AHR, POR and CYP1A2) and pharmacodynamics (BDNF and SLC6A4) of caffeine. Two map to GCKR and MLXIPL genes related to metabolic traits but lacking known roles in coffee consumption. Enhancer and promoter histone marks populate the regions of many confirmed loci and several potential regulatory SNPs are highly correlated with the lead SNP of each. SNP alleles near GCKR, MLXIPL, BDNF and CYP1A2 that were associated with higher coffee consumption have previously been associated with smoking initiation, higher adiposity and fasting insulin and glucose but lower blood pressure and favorable lipid, inflammatory and liver enzyme profiles (P<5 × 10(-8)).Our genetic findings among European and African-American adults reinforce the role of caffeine in mediating habitual coffee consumption and may point to molecular mechanisms underlying inter-individual variability in pharmacological and health effects of coffee. Show less
Emerging technologies based on mass spectrometry or nuclear magnetic resonance enable the monitoring of hundreds of small metabolites from tissues or body fluids. Profiling of metabolites can help elu Show more
Emerging technologies based on mass spectrometry or nuclear magnetic resonance enable the monitoring of hundreds of small metabolites from tissues or body fluids. Profiling of metabolites can help elucidate causal pathways linking established genetic variants to known disease risk factors such as blood lipid traits. We applied statistical methodology to dissect causal relationships between single nucleotide polymorphisms, metabolite concentrations, and serum lipid traits, focusing on 95 genetic loci reproducibly associated with the four main serum lipids (total-, low-density lipoprotein-, and high-density lipoprotein- cholesterol and triglycerides). The dataset used included 2,973 individuals from two independent population-based cohorts with data for 151 small molecule metabolites and four main serum lipids. Three statistical approaches, namely conditional analysis, Mendelian randomization, and structural equation modeling, were compared to investigate causal relationship at sets of a single nucleotide polymorphism, a metabolite, and a lipid trait associated with one another. A subset of three lipid-associated loci (FADS1, GCKR, and LPA) have a statistically significant association with at least one main lipid and one metabolite concentration in our data, defining a total of 38 cross-associated sets of a single nucleotide polymorphism, a metabolite and a lipid trait. Structural equation modeling provided sufficient discrimination to indicate that the association of a single nucleotide polymorphism with a lipid trait was mediated through a metabolite at 15 of the 38 sets, and involving variants at the FADS1 and GCKR loci. These data provide a framework for evaluating the causal role of components of the metabolome (or other intermediate factors) in mediating the association between established genetic variants and diseases or traits. Show less
Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analys Show more
Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease. Show less
Nuclear magnetic resonance spectroscopy (NMR) provides robust readouts of many metabolic parameters in one experiment. However, identification of clinically relevant markers in (1)H NMR spectra is a m Show more
Nuclear magnetic resonance spectroscopy (NMR) provides robust readouts of many metabolic parameters in one experiment. However, identification of clinically relevant markers in (1)H NMR spectra is a major challenge. Association of NMR-derived quantities with genetic variants can uncover biologically relevant metabolic traits. Using NMR data of plasma samples from 1,757 individuals from the KORA study together with 655,658 genetic variants, we show that ratios between NMR intensities at two chemical shift positions can provide informative and robust biomarkers. We report seven loci of genetic association with NMR-derived traits (APOA1, CETP, CPS1, GCKR, FADS1, LIPC, PYROXD2) and characterize these traits biochemically using mass spectrometry. These ratios may now be used in clinical studies. Show less
Adverse levels of lipoproteins are highly heritable and constitute risk factors for cardiovascular outcomes. Hitherto, genome-wide association studies revealed 95 lipid-associated loci. However, due t Show more
Adverse levels of lipoproteins are highly heritable and constitute risk factors for cardiovascular outcomes. Hitherto, genome-wide association studies revealed 95 lipid-associated loci. However, due to the small effect sizes of these associations large sample numbers (>100 000 samples) were needed. Here we show that analyzing more refined lipid phenotypes, namely lipoprotein subfractions, can increase the number of significantly associated loci compared with bulk high-density lipoprotein and low-density lipoprotein analysis in a study with identical sample numbers. Moreover, lipoprotein subfractions provide novel insight into the human lipid metabolism. We measured 15 lipoprotein subfractions (L1-L15) in 1791 samples using (1)H-NMR (nuclear magnetic resonance) spectroscopy. Using cluster analyses, we quantified inter-relationships among lipoprotein subfractions. Additionally, we analyzed associations with subfractions at known lipid loci. We identified five distinct groups of subfractions: one (L1) was only marginally captured by serum lipids and therefore extends our knowledge of lipoprotein biochemistry. During a lipid-tolerance test, L1 lost its special position. In the association analysis, we found that eight loci (LIPC, CETP, PLTP, FADS1-2-3, SORT1, GCKR, APOB, APOA1) were associated with the subfractions, whereas only four loci (CETP, SORT1, GCKR, APOA1) were associated with serum lipids. For LIPC, we observed a 10-fold increase in the variance explained by our regression models. In conclusion, NMR-based fine mapping of lipoprotein subfractions provides novel information on their biological nature and strengthens the associations with genetic loci. Future clinical studies are now needed to investigate their biomedical relevance. Show less
Metabolomic profiling and the integration of whole-genome genetic association data has proven to be a powerful tool to comprehensively explore gene regulatory networks and to investigate the effects o Show more
Metabolomic profiling and the integration of whole-genome genetic association data has proven to be a powerful tool to comprehensively explore gene regulatory networks and to investigate the effects of genetic variation at the molecular level. Serum metabolite concentrations allow a direct readout of biological processes, and association of specific metabolomic signatures with complex diseases such as Alzheimer's disease and cardiovascular and metabolic disorders has been shown. There are well-known correlations between sex and the incidence, prevalence, age of onset, symptoms, and severity of a disease, as well as the reaction to drugs. However, most of the studies published so far did not consider the role of sexual dimorphism and did not analyse their data stratified by gender. This study investigated sex-specific differences of serum metabolite concentrations and their underlying genetic determination. For discovery and replication we used more than 3,300 independent individuals from KORA F3 and F4 with metabolite measurements of 131 metabolites, including amino acids, phosphatidylcholines, sphingomyelins, acylcarnitines, and C6-sugars. A linear regression approach revealed significant concentration differences between males and females for 102 out of 131 metabolites (p-values<3.8×10(-4); Bonferroni-corrected threshold). Sex-specific genome-wide association studies (GWAS) showed genome-wide significant differences in beta-estimates for SNPs in the CPS1 locus (carbamoyl-phosphate synthase 1, significance level: p<3.8×10(-10); Bonferroni-corrected threshold) for glycine. We showed that the metabolite profiles of males and females are significantly different and, furthermore, that specific genetic variants in metabolism-related genes depict sexual dimorphism. Our study provides new important insights into sex-specific differences of cell regulatory processes and underscores that studies should consider sex-specific effects in design and interpretation. Show less
Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with conc Show more
Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10(-8) to P = 10(-190)). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including genes involved in biliary transport (ATP8B1 and ABCB11), glucose, carbohydrate and lipid metabolism (FADS1, FADS2, GCKR, JMJD1C, HNF1A, MLXIPL, PNPLA3, PPP1R3B, SLC2A2 and TRIB1), glycoprotein biosynthesis and cell surface glycobiology (ABO, ASGR1, FUT2, GPLD1 and ST3GAL4), inflammation and immunity (CD276, CDH6, GCKR, HNF1A, HPR, ITGA1, RORA and STAT4) and glutathione metabolism (GSTT1, GSTT2 and GGT), as well as several genes of uncertain or unknown function (including ABHD12, EFHD1, EFNA1, EPHA2, MICAL3 and ZNF827). Our results provide new insight into genetic mechanisms and pathways influencing markers of liver function. Show less
Restless legs syndrome (RLS) is a sleep related movement disorder that occurs both in an idiopathic form and in symptomatic varieties. RLS is a frequent and distressing comorbidity in end stage renal Show more
Restless legs syndrome (RLS) is a sleep related movement disorder that occurs both in an idiopathic form and in symptomatic varieties. RLS is a frequent and distressing comorbidity in end stage renal disease (ESRD). For idiopathic RLS (iRLS), genetic risk factors have been identified, but their role in RLS in ESRD has not been investigated yet. Therefore, a case-control association study of these variants in ESRD patients was performed. The study genotyped 10 iRLS associated variants at four loci encompassing the genes MEIS1, BTBD9, MAP2K5/SKOR1, and PTPRD, in two independent case-control samples from Germany and Greece using multiplex PCR and MALDI-TOF (matrix assisted laser desorption/ionisation time-of-flight) mass spectrometry. Statistical analysis was performed as logistic regression with age and gender as covariates. For the combined analysis a Cochran-Mantel-Haenszel test was applied. The study included 200 RLS-positive and 443 RLS-negative ESRD patients in the German sample, and 141 and 393 patients, respectively, in the Greek sample. In the German sample, variants in MEIS1 and BTBD9 were associated with RLS in ESRD (P(nom)≤0.004, ORs 1.52 and 1.55), whereas, in the Greek sample, there was a trend for association to MAP2K5/SKOR1 and BTBD9 (P(nom)≤0.08, ORs 1.41 and 1.33). In the combined analysis including all samples, BTBD9 was associated after correction for multiple testing (P(corrected)=0.0013, OR 1.47). This is the first demonstration of a genetic influence on RLS in ESRD patients with BTBD9 being significantly associated. The extent of the genetic predisposition could vary between different subgroups of RLS in ESRD. Show less
C-reactive protein (CRP) is a heritable marker of chronic inflammation that is strongly associated with cardiovascular disease. We sought to identify genetic variants that are associated with CRP leve Show more
C-reactive protein (CRP) is a heritable marker of chronic inflammation that is strongly associated with cardiovascular disease. We sought to identify genetic variants that are associated with CRP levels. We performed a genome-wide association analysis of CRP in 66 185 participants from 15 population-based studies. We sought replication for the genome-wide significant and suggestive loci in a replication panel comprising 16 540 individuals from 10 independent studies. We found 18 genome-wide significant loci, and we provided evidence of replication for 8 of them. Our results confirm 7 previously known loci and introduce 11 novel loci that are implicated in pathways related to the metabolic syndrome (APOC1, HNF1A, LEPR, GCKR, HNF4A, and PTPN2) or the immune system (CRP, IL6R, NLRP3, IL1F10, and IRF1) or that reside in regions previously not known to play a role in chronic inflammation (PPP1R3B, SALL1, PABPC4, ASCL1, RORA, and BCL7B). We found a significant interaction of body mass index with LEPR (P<2.9×10(-6)). A weighted genetic risk score that was developed to summarize the effect of risk alleles was strongly associated with CRP levels and explained ≈5% of the trait variance; however, there was no evidence for these genetic variants explaining the association of CRP with coronary heart disease. We identified 18 loci that were associated with CRP levels. Our study highlights immune response and metabolic regulatory pathways involved in the regulation of chronic inflammation. Show less
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between bod Show more
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and ∼ 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 × 10⁻⁸), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation. Show less
Higher resting heart rate is associated with increased cardiovascular disease and mortality risk. Though heritable factors play a substantial role in population variation, little is known about specif Show more
Higher resting heart rate is associated with increased cardiovascular disease and mortality risk. Though heritable factors play a substantial role in population variation, little is known about specific genetic determinants. This knowledge can impact clinical care by identifying novel factors that influence pathologic heart rate states, modulate heart rate through cardiac structure and function or by improving our understanding of the physiology of heart rate regulation. To identify common genetic variants associated with heart rate, we performed a meta-analysis of 15 genome-wide association studies (GWAS), including 38,991 subjects of European ancestry, estimating the association between age-, sex- and body mass-adjusted RR interval (inverse heart rate) and approximately 2.5 million markers. Results with P < 5 × 10(-8) were considered genome-wide significant. We constructed regression models with multiple markers to assess whether results at less stringent thresholds were likely to be truly associated with RR interval. We identified six novel associations with resting heart rate at six loci: 6q22 near GJA1; 14q12 near MYH7; 12p12 near SOX5, c12orf67, BCAT1, LRMP and CASC1; 6q22 near SLC35F1, PLN and c6orf204; 7q22 near SLC12A9 and UfSp1; and 11q12 near FADS1. Associations at 6q22 400 kb away from GJA1, at 14q12 MYH6 and at 1q32 near CD34 identified in previously published GWAS were confirmed. In aggregate, these variants explain approximately 0.7% of RR interval variance. A multivariant regression model including 20 variants with P < 10(-5) increased the explained variance to 1.6%, suggesting that some loci falling short of genome-wide significance are likely truly associated. Future research is warranted to elucidate underlying mechanisms that may impact clinical care. Show less
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, Show more
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, fasting insulin and indices of beta-cell function (HOMA-B) and insulin resistance (HOMA-IR) in up to 46,186 nondiabetic participants. Follow-up of 25 loci in up to 76,558 additional subjects identified 16 loci associated with fasting glucose and HOMA-B and two loci associated with fasting insulin and HOMA-IR. These include nine loci newly associated with fasting glucose (in or near ADCY5, MADD, ADRA2A, CRY2, FADS1, GLIS3, SLC2A2, PROX1 and C2CD4B) and one influencing fasting insulin and HOMA-IR (near IGF1). We also demonstrated association of ADCY5, PROX1, GCK, GCKR and DGKB-TMEM195 with type 2 diabetes. Within these loci, likely biological candidate genes influence signal transduction, cell proliferation, development, glucose-sensing and circadian regulation. Our results demonstrate that genetic studies of glycemic traits can identify type 2 diabetes risk loci, as well as loci containing gene variants that are associated with a modest elevation in glucose levels but are not associated with overt diabetes. Show less
Serum metabolite concentrations provide a direct readout of biological processes in the human body, and they are associated with disorders such as cardiovascular and metabolic diseases. We present a g Show more
Serum metabolite concentrations provide a direct readout of biological processes in the human body, and they are associated with disorders such as cardiovascular and metabolic diseases. We present a genome-wide association study (GWAS) of 163 metabolic traits measured in human blood from 1,809 participants from the KORA population, with replication in 422 participants of the TwinsUK cohort. For eight out of nine replicated loci (FADS1, ELOVL2, ACADS, ACADM, ACADL, SPTLC3, ETFDH and SLC16A9), the genetic variant is located in or near genes encoding enzymes or solute carriers whose functions match the associating metabolic traits. In our study, the use of metabolite concentration ratios as proxies for enzymatic reaction rates reduced the variance and yielded robust statistical associations with P values ranging from 3 x 10(-24) to 6.5 x 10(-179). These loci explained 5.6%-36.3% of the observed variance in metabolite concentrations. For several loci, associations with clinically relevant parameters have been reported previously. Show less
To identify loci for age at menarche, we performed a meta-analysis of 32 genome-wide association studies in 87,802 women of European descent, with replication in up to 14,731 women. In addition to the Show more
To identify loci for age at menarche, we performed a meta-analysis of 32 genome-wide association studies in 87,802 women of European descent, with replication in up to 14,731 women. In addition to the known loci at LIN28B (P = 5.4 × 10⁻⁶⁰) and 9q31.2 (P = 2.2 × 10⁻³³), we identified 30 new menarche loci (all P < 5 × 10⁻⁸) and found suggestive evidence for a further 10 loci (P < 1.9 × 10⁻⁶). The new loci included four previously associated with body mass index (in or near FTO, SEC16B, TRA2B and TMEM18), three in or near other genes implicated in energy homeostasis (BSX, CRTC1 and MCHR2) and three in or near genes implicated in hormonal regulation (INHBA, PCSK2 and RXRG). Ingenuity and gene-set enrichment pathway analyses identified coenzyme A and fatty acid biosynthesis as biological processes related to menarche timing. Show less
Recent genome-wide association (GWA) studies of lipids have been conducted in samples ascertained for other phenotypes, particularly diabetes. Here we report the first GWA analysis of loci affecting t Show more
Recent genome-wide association (GWA) studies of lipids have been conducted in samples ascertained for other phenotypes, particularly diabetes. Here we report the first GWA analysis of loci affecting total cholesterol (TC), low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol and triglycerides sampled randomly from 16 population-based cohorts and genotyped using mainly the Illumina HumanHap300-Duo platform. Our study included a total of 17,797-22,562 persons, aged 18-104 years and from geographic regions spanning from the Nordic countries to Southern Europe. We established 22 loci associated with serum lipid levels at a genome-wide significance level (P < 5 x 10(-8)), including 16 loci that were identified by previous GWA studies. The six newly identified loci in our cohort samples are ABCG5 (TC, P = 1.5 x 10(-11); LDL, P = 2.6 x 10(-10)), TMEM57 (TC, P = 5.4 x 10(-10)), CTCF-PRMT8 region (HDL, P = 8.3 x 10(-16)), DNAH11 (LDL, P = 6.1 x 10(-9)), FADS3-FADS2 (TC, P = 1.5 x 10(-10); LDL, P = 4.4 x 10(-13)) and MADD-FOLH1 region (HDL, P = 6 x 10(-11)). For three loci, effect sizes differed significantly by sex. Genetic risk scores based on lipid loci explain up to 4.8% of variation in lipids and were also associated with increased intima media thickness (P = 0.001) and coronary heart disease incidence (P = 0.04). The genetic risk score improves the screening of high-risk groups of dyslipidemia over classical risk factors. Show less
Restless legs syndrome (RLS) is associated with common variants in three intronic and intergenic regions in MEIS1, BTBD9, and MAP2K5/LBXCOR1 on chromosomes 2p, 6p and 15q. Our study investigated these Show more
Restless legs syndrome (RLS) is associated with common variants in three intronic and intergenic regions in MEIS1, BTBD9, and MAP2K5/LBXCOR1 on chromosomes 2p, 6p and 15q. Our study investigated these variants in 649 RLS patients and 1230 controls from the Czech Republic (290 cases and 450 controls), Austria (269 cases and 611 controls) and Finland (90 cases and 169 controls). Ten single nucleotide polymorphisms (SNPs) within the three genomic regions were selected according to the results of previous genome-wide scans. Samples were genotyped using Sequenom platforms. We replicated associations for all loci in the combined samples set (rs2300478 in MEIS1, p = 1.26 x 10(-5), odds ratio (OR) = 1.47, rs3923809 in BTBD9, p = 4.11 x 10(-5), OR = 1.58 and rs6494696 in MAP2K5/LBXCOR1, p = 0.04764, OR = 1.27). Analysing only familial cases against all controls, all three loci were significantly associated. Using sporadic cases only, we could confirm the association only with BTBD9. Our study shows that variants in these three loci confer consistent disease risks in patients of European descent. Among the known loci, BTBD9 seems to be the most consistent in its effect on RLS across populations and is also most independent of familial clustering. Show less
The rapidly evolving field of metabolomics aims at a comprehensive measurement of ideally all endogenous metabolites in a cell or body fluid. It thereby provides a functional readout of the physiologi Show more
The rapidly evolving field of metabolomics aims at a comprehensive measurement of ideally all endogenous metabolites in a cell or body fluid. It thereby provides a functional readout of the physiological state of the human body. Genetic variants that associate with changes in the homeostasis of key lipids, carbohydrates, or amino acids are not only expected to display much larger effect sizes due to their direct involvement in metabolite conversion modification, but should also provide access to the biochemical context of such variations, in particular when enzyme coding genes are concerned. To test this hypothesis, we conducted what is, to the best of our knowledge, the first GWA study with metabolomics based on the quantitative measurement of 363 metabolites in serum of 284 male participants of the KORA study. We found associations of frequent single nucleotide polymorphisms (SNPs) with considerable differences in the metabolic homeostasis of the human body, explaining up to 12% of the observed variance. Using ratios of certain metabolite concentrations as a proxy for enzymatic activity, up to 28% of the variance can be explained (p-values 10(-16) to 10(-21)). We identified four genetic variants in genes coding for enzymes (FADS1, LIPC, SCAD, MCAD) where the corresponding metabolic phenotype (metabotype) clearly matches the biochemical pathways in which these enzymes are active. Our results suggest that common genetic polymorphisms induce major differentiations in the metabolic make-up of the human population. This may lead to a novel approach to personalized health care based on a combination of genotyping and metabolic characterization. These genetically determined metabotypes may subscribe the risk for a certain medical phenotype, the response to a given drug treatment, or the reaction to a nutritional intervention or environmental challenge. Show less
Restless legs syndrome (RLS) is a frequent neurological disorder characterized by an imperative urge to move the legs during night, unpleasant sensation in the lower limbs, disturbed sleep and increas Show more
Restless legs syndrome (RLS) is a frequent neurological disorder characterized by an imperative urge to move the legs during night, unpleasant sensation in the lower limbs, disturbed sleep and increased cardiovascular morbidity. In a genome-wide association study we found highly significant associations between RLS and intronic variants in the homeobox gene MEIS1, the BTBD9 gene encoding a BTB(POZ) domain as well as variants in a third locus containing the genes encoding mitogen-activated protein kinase MAP2K5 and the transcription factor LBXCOR1 on chromosomes 2p, 6p and 15q, respectively. Two independent replications confirmed these association signals. Each genetic variant was associated with a more than 50% increase in risk for RLS, with the combined allelic variants conferring more than half of the risk. MEIS1 has been implicated in limb development, raising the possibility that RLS has components of a developmental disorder. Show less