To delineate organ-specific and systemic drivers of metabolic dysfunction-associated steatotic liver disease (MASLD), we applied integrative causal inference across clinical, imaging, and proteomic do Show more
To delineate organ-specific and systemic drivers of metabolic dysfunction-associated steatotic liver disease (MASLD), we applied integrative causal inference across clinical, imaging, and proteomic domains in individuals with and without type 2 diabetes (T2D). Bayesian network analyses and complementary two-sample Mendelian randomization were used to quantify causal pathways linking adipose distribution, glycemia, and insulin dynamics with liver fat in the IMI-DIRECT prospective cohort study. Data included frequently sampled metabolic challenge tests, MRI-derived abdominal and hepatic fat content, serological biomarkers, and Olink plasma proteomics from 331 adults with new-onset T2D and 964 adults without diabetes, with harmonized protocols enabling replication. High basal insulin secretion rate (BasalISR), estimated via C-peptide deconvolution, emerged as the primary potential causal driver of liver fat accumulation in both cohorts. BasalISR, a clearance-independent measure of β-cell insulin output distinct from peripheral insulin levels, was independently linked to hepatic steatosis. Visceral adipose tissue exhibited bidirectional associations with liver fat, suggesting a self-reinforcing metabolic loop. Of 446 analyzed proteins, 34 mapped to these metabolic networks (27 in the non-diabetes network, 18 in the T2D network, and 11 shared). Key proteins directly associated with liver fat included GUSB, ALDH1A1, LPL, IGFBP1/2, CTSD, HMOX1, FGF21, AGRP, and ACE2. Sex-stratified analyses identified GUSB in females and LEP in males as the strongest protein predictors of liver fat. BasalISR may better capture early β-cell-driven disturbances contributing to MASLD. These findings outline a multifactorial, sex- and disease stage-specific proteo-metabolic architecture of hepatic steatosis and identify potential biomarkers or therapeutic targets. Show less
To delineate organ-specific and systemic drivers of metabolic dysfunction-associated steatotic liver disease (MASLD), we applied integrative causal inference across clinical, imaging, and proteomic do Show more
To delineate organ-specific and systemic drivers of metabolic dysfunction-associated steatotic liver disease (MASLD), we applied integrative causal inference across clinical, imaging, and proteomic domains in individuals with and without type 2 diabetes (T2D). We used Bayesian network analyses to quantify causal pathways linking adipose distribution, glycemia, and insulin dynamics with fatty liver using data from the IMI-DIRECT prospective cohort study. Measurements were made of glucose and insulin dynamics (using frequently-sampled metabolic challenge tests), MRI-derived abdominal and liver fat content, serological biomarkers, and Olink plasma proteomics from 331 adults with new-onset T2D and 964 adults free from diabetes at enrolment. The common protocols used in these two cohorts provided the opportunity for replication analyses to be performed. When the direction of the effect could not be determined with high probability through Bayesian networks, complementary two-sample Mendelian randomization (MR) was employed. High basal insulin secretion rate (BasalISR) was identified as the primary causal driver of liver fat accumulation in both diabetes and non-diabetes. Excess visceral adipose tissue (VAT) was bidirectionally associated with liver fat, indicating a self-reinforcing metabolic loop. Basal insulin clearance (Clinsb) worsened as a consequence of liver fat accumulation to a greater degree before the onset of T2D. Out of 446 analysed proteins, 34 mapped to these metabolic networks and 27 were identified in the non-diabetes network, 18 in the diabetes network, and 11 were common between the two networks. Key proteins directly associated with liver fat included GUSB, ALDH1A1, LPL, IGFBP1/2, CTSD, HMOX1, FGF21, AGRP, and ACE2. Sex-stratified analyses revealed distinct proteomic drivers: GUSB and LEP were most predictive of liver fat in females and males, respectively. Basal insulin hypersecretion is a modifiable, causal driver of MASLD, particularly prior to glycaemic decompensation. Our findings highlight a multifactorial, sex- and disease-stage-specific proteo-metabolic architecture of hepatic steatosis. Proteins such as GUSB, ALDH1A1, LPL, and IGFBPs warrant further investigation as potential biomarkers or therapeutic targets for MASLD prevention and treatment. Show less
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding var Show more
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, noncoding variants from which pinpointing causal genes remains challenging. Here we combined data from 718,734 individuals to discover rare and low-frequency (minor allele frequency (MAF) < 5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which 8 variants were in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2 and ZNF169) newly implicated in human obesity, 2 variants were in genes (MC4R and KSR2) previously observed to be mutated in extreme obesity and 2 variants were in GIPR. The effect sizes of rare variants are ~10 times larger than those of common variants, with the largest effect observed in carriers of an MC4R mutation introducing a stop codon (p.Tyr35Ter, MAF = 0.01%), who weighed ~7 kg more than non-carriers. Pathway analyses based on the variants associated with BMI confirm enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically supported therapeutic targets in obesity. Show less
Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) Show more
Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes. The study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m(2) and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case-control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans. Exome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10(-14)), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10(-11)) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10(-10)). We applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits. Show less
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between bod Show more
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and ∼ 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 × 10⁻⁸), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation. Show less
Glucose levels 2 h after an oral glucose challenge are a clinical measure of glucose tolerance used in the diagnosis of type 2 diabetes. We report a meta-analysis of nine genome-wide association studi Show more
Glucose levels 2 h after an oral glucose challenge are a clinical measure of glucose tolerance used in the diagnosis of type 2 diabetes. We report a meta-analysis of nine genome-wide association studies (n = 15,234 nondiabetic individuals) and a follow-up of 29 independent loci (n = 6,958-30,620). We identify variants at the GIPR locus associated with 2-h glucose level (rs10423928, beta (s.e.m.) = 0.09 (0.01) mmol/l per A allele, P = 2.0 x 10(-15)). The GIPR A-allele carriers also showed decreased insulin secretion (n = 22,492; insulinogenic index, P = 1.0 x 10(-17); ratio of insulin to glucose area under the curve, P = 1.3 x 10(-16)) and diminished incretin effect (n = 804; P = 4.3 x 10(-4)). We also identified variants at ADCY5 (rs2877716, P = 4.2 x 10(-16)), VPS13C (rs17271305, P = 4.1 x 10(-8)), GCKR (rs1260326, P = 7.1 x 10(-11)) and TCF7L2 (rs7903146, P = 4.2 x 10(-10)) associated with 2-h glucose. Of the three newly implicated loci (GIPR, ADCY5 and VPS13C), only ADCY5 was found to be associated with type 2 diabetes in collaborating studies (n = 35,869 cases, 89,798 controls, OR = 1.12, 95% CI 1.09-1.15, P = 4.8 x 10(-18)). Show less
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, Show more
Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, fasting insulin and indices of beta-cell function (HOMA-B) and insulin resistance (HOMA-IR) in up to 46,186 nondiabetic participants. Follow-up of 25 loci in up to 76,558 additional subjects identified 16 loci associated with fasting glucose and HOMA-B and two loci associated with fasting insulin and HOMA-IR. These include nine loci newly associated with fasting glucose (in or near ADCY5, MADD, ADRA2A, CRY2, FADS1, GLIS3, SLC2A2, PROX1 and C2CD4B) and one influencing fasting insulin and HOMA-IR (near IGF1). We also demonstrated association of ADCY5, PROX1, GCK, GCKR and DGKB-TMEM195 with type 2 diabetes. Within these loci, likely biological candidate genes influence signal transduction, cell proliferation, development, glucose-sensing and circadian regulation. Our results demonstrate that genetic studies of glycemic traits can identify type 2 diabetes risk loci, as well as loci containing gene variants that are associated with a modest elevation in glucose levels but are not associated with overt diabetes. Show less
Common genetic variants influence plasma triacylglycerol, HDL-cholesterol (HDL-C) and glucose levels in cross-sectional studies. However, the longitudinal effects of these established variants have no Show more
Common genetic variants influence plasma triacylglycerol, HDL-cholesterol (HDL-C) and glucose levels in cross-sectional studies. However, the longitudinal effects of these established variants have not been studied. Our aim was to examine the longitudinal associations of four such variants in the apolipoprotein A-V (APOA5), lipoprotein lipase (LPL), and glucokinase (GCK) genes with fasting glucose or lipid levels. The individuals analysed were participants in the Busselton Health Survey (n = 4,554). Cross-sectional analyses of family data used the total association test. Longitudinal association analyses of unrelated participant data (n = 2,864) used linear mixed-effects models. The findings of cross-sectional association analyses replicated those of previous studies. We observed associations of the G and C alleles at the APOA5 single nucleotide polymorphisms (SNPs) rs662799 and rs3135506 with raised triacylglycerol levels (p = 0.0003 and p < 0.0001, respectively), the 447X allele at the LPL SNP rs328 with reduced triacylglycerol levels (p = 0.0004) and raised HDL-C levels (p = 0.0004), and the A allele of the GCK SNP rs1799884 with raised fasting glucose level (p = 0.015). Longitudinal association analyses showed that most of these associations did not change in the same individuals over an average follow-up time of 17.4 years, though there was some evidence that the association of the 447X allele of rs328 with raised HDL-C level significantly increased with age (p = 0.01), and that the association of the C allele of rs3135506 with raised triacylglycerol level significantly increased over time (p = 0.0007). The current study suggests that the effects of established gene variants on lipid and glucose traits do not tend to alter with age during adulthood or over time. Show less
The APOA5 gene variants, -1131T>C and S19W, are associated with altered triglyceride concentrations in studies of subjects of Caucasian and East Asian descent. There are few studies of these variants Show more
The APOA5 gene variants, -1131T>C and S19W, are associated with altered triglyceride concentrations in studies of subjects of Caucasian and East Asian descent. There are few studies of these variants in South Asians. We investigated whether the two APOA5 variants also show similar association with various lipid parameters in Indian population as in the UK white subjects. We genotyped 557 Indian adults from Pune, India, and 237 UK white adults for -1131T>C and S19W variants in the APOA5 gene, compared their allelic and genotype frequency and determined their association with fasting serum triglycerides, total cholesterol, HDL and LDL cholesterol levels using univariate general linear analysis. APOC3 SstI polymorphism was also analyzed in 175 Pune Indian subjects for analysis of linkage disequilibrium with the APOA5 variants. The APOA5 -1131C allele was more prevalent in Indians from Pune (Pune Indians) compared to UK white subjects (allele frequency 20% vs. 4%, p = 0.00001), whereas the 19W allele was less prevalent (3% vs. 6% p = 0.0015). Patterns of linkage disequilibrium between the two variants were similar between the two populations and confirmed that they occur on two different haplotypes. In Pune Indians, the presence of -1131C allele and the 19W allele was associated with a 19% and 15% increase respectively in triglyceride concentrations although only -1131C was significant (p = 0.0003). This effect size was similar to that seen in the UK white subjects. Analysis of the APOC3 SstI polymorphism in 175 Pune Indian subjects showed that this variant is not in appreciable linkage disequilibrium with the APOA5 -1131T>C variant (r2 = 0.07). This is the first study to look at the role of APOA5 in Asian Indian subjects that reside in India. The -1131C allele is more prevalent and the 19W allele is less prevalent in Pune Indians compared to UK Caucasians. We confirm that the APOA5 variants are associated with triglyceride levels independent of ethnicity and that this association is similar in magnitude in Asian Indians and Caucasians. The -1131C allele is present in 36% of the Pune Indian population making it a powerful marker for looking at the role of elevated triglycerides in important conditions such as pancreatitis, diabetes and coronary heart disease. Show less
Apolipoprotein AV (ApoAV) gene variant, -1131T>C, is associated with increased triglyceride concentrations in all ethnic groups studied. An MseI based RFLP analysis is the most commonly used method fo Show more
Apolipoprotein AV (ApoAV) gene variant, -1131T>C, is associated with increased triglyceride concentrations in all ethnic groups studied. An MseI based RFLP analysis is the most commonly used method for genotyping this SNP. We genotyped a large cohort comprising 1185 Asian Indians and 173 UK Caucasians for -1131T>C using an ARMS-PCR based tetra-primer method. For quality control, we re-genotyped approximately 10% random samples from this cohort utilizing the MseI RFLP, which showed a 2.9% (3/102) genotyping error rate between the two methods. To investigate further, we sequenced the 900 bp region around the -1131T>C polymorphism in 25 Asian Indians and 15 UK Caucasians and found a number of polymorphisms including the -987C>T polymorphism. Further analysis of the -987C>T SNP showed a higher rare allele frequency of 0.23 in Asian Indians (n = 158) compared to 0.09 in the UK Caucasians (n = 157). This SNP is located 4 bp from the 3' end of the RFLP forward primer and is in weak linkage disequilibrium with -1131T>C variant (r2 = 0.084 and D' = 1). Repeated RFLP analysis of seven subjects heterozygous for -987C>T (seven times), showed discordant results with the sequence at -1131T>C SNP nearly one third (15/49) of the time. We conclude that presence of -987C>T polymorphism in the forward primer of the MseI RFLP assay may lead to allelic drop-out and generate unforeseen errors in genotyping the -1131T>C polymorphism. Our results also emphasise the need for careful quality control in all molecular genetic studies, particularly while transferring genotyping methods between various ethnic groups. Show less