Article:Human genetics as a model for target validation: finding new therapies for diabetes. (5423999)
This page is the ScienceSource HTML version of the scholarly article described at https://www.wikidata.org/wiki/Q30234640. Its title is Human genetics as a model for target validation: finding new therapies for diabetes. and the publication date was 2017-06-01. The initial author is Soren K. Thomsen.
Fuller metadata can be found in the Wikidata link, which lists all authors, and may have detailed items for some or all of them. There is further information on the article in the footer below. This page is a reference version, and is protected against editing.
Human genetics as a model for target validation: finding new therapies for diabetes
- Soren K. Thomsen
- Anna L. Gloyn
Publication date (epub): 4/2017
Publication date (pmc-release): 4/2017
Publication date (ppub): /2017
Type 2 diabetes is a global epidemic with major effects on healthcare expenditure and quality of life. Currently available treatments are inadequate for the prevention of comorbidities, yet progress towards new therapies remains slow. A major barrier is the insufficiency of traditional preclinical models for predicting drug efficacy and safety. Human genetics offers a complementary model to assess causal mechanisms for target validation. Genetic perturbations are ‘experiments of nature’ that provide a uniquely relevant window into the long-term effects of modulating specific targets. Here, we show that genetic discoveries over the past decades have accurately predicted (now known) therapeutic mechanisms for type 2 diabetes. These findings highlight the potential for use of human genetic variation for prospective target validation, and establish a framework for future applications. Studies into rare, monogenic forms of diabetes have also provided proof-of-principle for precision medicine, and the applicability of this paradigm to complex disease is discussed. Finally, we highlight some of the limitations that are relevant to the use of genome-wide association studies (GWAS) in the search for new therapies for diabetes. A key outstanding challenge is the translation of GWAS signals into disease biology and we outline possible solutions for tackling this experimental bottleneck.
Electronic supplementary material
The online version of this article (doi:10.1007/s00125-017-4270-y) contains a slideset of the figures for download, which is available to authorised users.
Long-term complications of type 2 diabetes and its related disorders present a major, growing socioeconomic burden to society . Despite incremental advances in the development of therapies for diabetes, current treatments fail to provide adequate glucose control for most patients. Much-needed efforts to develop novel, first-in-class drugs are hampered by the slow rate and escalating costs of research and development programmes in the pharmaceutical industry. The staggering estimated price tag for an average new drug is approaching $3 billion, with the high attrition rate in clinical trials (>80%) imposing a cumulatively higher cost on those drugs that do make it to market [, ]. The most common reasons for failure include lack of efficacy and/or unsuitable safety profiles, even in cases where the correct target is engaged [, ]. Clearly, these observations attest to the limitations of existing preclinical models in evaluating therapeutic candidates before committing to expensive human studies.
Over the past decades, technological advances have unlocked the possibility of using human genetics as a complimentary strategy for preclinical target validation. Genetic variation offers valuable insights into the effects of manipulating specific proteins or pathways in a system that is directly relevant to human disease. Such ‘experiments of nature’ can, in principle, inform target identification, predict potential adverse long-term effects and identify suitable indications for treatment. In this review, we first discuss evidence for the benefits provided by human genetics for the treatment of diabetes within each of these three domains. Second, we focus on one of the key challenges facing this paradigm, specifically the identification of causal mechanisms for genetic variants, and provide examples of potential solutions.
‘Experiments of nature’ in drug target discovery
Nomination of a therapeutic candidate for clinical testing is based on the expectation that modulating a specific target will result in a net benefit for patients, taking into account both desired and adverse effects. Supporting evidence is usually derived from extensive preclinical testing, including studies in animal and cellular models, as well as data from observational epidemiology . Importantly, these sources are often unsuitable for establishing definitive proof of causality in humans, with the specific models either lacking direct relevance or being unable to confidently distinguish cause and effect . In both of these areas, human genetics can complement existing lines of evidence with a relevant window into the chain of causality. A genetic perturbation (an inherited mutation in or near an affected protein) is constant from birth and thus precedes the disease state rather than being affected by the disease environment. Unlike other molecular phenotypes, such as metabolite or protein levels, genetic variants are, therefore, not prone to be confounded by reverse causation. Moreover, in well-mixed populations, genotypes are randomly assigned at conception, thus acting, in effect, as natural versions of a randomised controlled trial . Taking advantage of these properties, genetic epidemiology can intersect with other types of preclinical evidence to establish a powerful framework for target discovery and validation (Fig. 1).Fig. 1
Using human genetics as a model for drug target validation. GWAS into the heritability of type 2 diabetes (T2D) have identified a large number of variants that are robustly associated with disease risk (a, b). Nevertheless, establishing the underlying causal mechanisms has proven to be a major experimental bottleneck. The process usually involves an array of approaches, including in vitro and in vivo studies in animal and cellular models, as well as genetic and physiological follow-up studies of risk-allele carriers. Once a causal gene has been identified (c), the encoded protein may be taken forward for further validation as a potential drug target. Genetic alleles within the causal gene can be interrogated for links to other phenotypes using PheWAS, which can highlight likely adverse or beneficial effects of long-term treatment (d). For candidate genes harbouring multiple, independent alleles, effects on disease risk can be correlated against their known impact on protein function (e). Some perturbations, such as protein-truncating variants, have predictable effects, while most alleles require extensive experimental follow-up to reliably ascertain their functional impacts. If an allelic series has been established, their phenotypic associations can be used to generate the genetic equivalent of a dose–response curve (e). The therapeutic window (TW) marks the range of perturbations that produce a suitable ratio between desirable effects (i.e. type 2 diabetes protection) and adverse effects (e.g. raised lipid levels). In cases where a potential treatment is not predicted to result in a net patient benefit, the target is considered unsuitable and the process can be repeated for a different candidate. However, if an appropriate TW has been identified, the target can be taken forward for drug development on the basis of this human genetic validation
Though experiments of nature can be highly relevant tools for prioritising drug targets, a number of challenges limit the applicability of this strategy. First, many disease-causing variants are located in regions of the genome that do not code for proteins, making identification of causal mechanisms a substantial challenge (this is discussed in more detail later) . Further, the impact of individual mutations on protein activity often cannot be determined a priori, which necessitates extensive in vitro work to establish both statistical associations and directions of effect . Once causal mechanisms have been established, these insights can then be used to focus research and development efforts on producing a desirable therapeutic effect. For instance, if loss-of-function variants in a particular gene are linked to protective effects, modulating the encoded protein by antagonists would be an attractive target (and often a more tractable goal for medicinal chemists than protein activation).
Ultimately, genetic epidemiology is reliant upon the occurrence of natural variation (either single-nucleotide and/or structural variants) in genomic regions relevant to disease. Since the penetrance of disease-associated variants tends to be inversely related to their frequency in the population (because of negative selection), most common variants have small effect sizes, while more deleterious mutations are rare [–]. As a result, sequencing of large-scale case–control cohorts is required to establish associations across the frequency spectrum [, ]. Phenotypic selection and familial studies (e.g. focusing on extreme forms of diabetes) can enrich for high-penetrance mutations, but assigning pathogenicity and estimating effect sizes from such studies can be a challenge [, , ]. In the ideal scenario, multiple independent alleles with different degrees of effect on a phenotype can be used to calibrate a genetic ‘dose–response’ curve [, ]. Not only does this build confidence in a specific target but the allelic series can also be used to predict the magnitude of effect required to produce a therapeutic response in vivo (Fig. 1). It is worth noting, however, that not all mutations produce effects that can be linearly mapped to a simple dose–response curve. More subtle perturbations could, for example, lead to aspects of both gain- and loss-of-function.
Genetic studies for the validation of drug mechanisms
Over the past 10 years, genome-wide association studies (GWAS) have made significant progress towards mapping the genetic heritability of many complex diseases . In the context of drug development, however, these advances are, broadly speaking, too recent to assess their impact on prospective target discovery. Still, attempts can be made to validate the use of GWAS associations for target discovery based on existing drugs. One study looked at the proportion of drug mechanisms (defined as a target paired with its approved indication) supported by genetic evidence at the various stages of the drug development pipeline . This proportion was found to increase from 2% at the preclinical stage to 8.2% for approved drugs, with the single largest increase occurring between phase II and phase III of clinical trials. Remarkably, based on such historical data, it was estimated that the rate of success was around twofold higher for a target–indication pair supported by GWAS or other human genetic data compared with pairs with no support. While any retrospective study will certainly have limitations (e.g. successful drug mechanisms might spur genetic research into particular targets), two observations support the overall conclusion of this study: first, the same correlation was found for GWAS data alone (success ratio: 1.8 for supported vs unsupported mechanisms), which is unlikely to be influenced by known biological mechanisms ; second, most potential confounding factors (e.g. unknown causal mechanisms for GWAS signals and incomplete mapping of genetic heritability) actually tend to bias observations towards the null hypothesis.
For the treatment of type 2 diabetes, thiazolidinediones (TZDs) provide an instructive example of a drug mechanism that has been corroborated by genetic evidence since first being discovered. TZDs are a class of commonly used drugs that act primarily through activation of the peroxisome proliferator-activated receptor γ (PPARγ) to improve insulin sensitivity . Within a few years after obtaining market approval in 1996, the gene encoding PPARγ (PPARG) was found to contain a missense variant (Pro12Ala) that associates with type 2 diabetes susceptibility [–]. Though the functional impact of this common variant remains uncertain, the subsequent discovery of rare, loss-of-function variants associated with disease risk have established direction-of-effect at this locus . Genetic evidence thus points to a therapeutic benefit of PPARγ agonists, fully consistent with the clinical effects observed from TZDs. The Pro12Ala association has also since been replicated by GWAS for type 2 diabetes risk, despite the relatively small effect size of the risk allele (OR 1.16) . As illustrated by this case, the measured effect size of a single genetic variant is not necessarily a useful predictor of therapeutic opportunities.
This lesson is further reinforced by insights from genetic studies on the ATP-sensitive potassium channel (KATP), which couples glucose metabolism to insulin secretion in pancreatic beta cells. As early as 1942, sulfonylureas inhibiting the channel were found to display hypoglycaemic effects in animal studies . Around 60 years later, genetic studies in humans identified a type 2 diabetes association signal that overlaps two genes, KCNJ11 and ABCC8, which encode subunits of the KATP (OR 1.1–1.2) [–]. Subsequent molecular studies have confirmed the risk haplotype to produce a channel that is less sensitive to ATP inhibition, thus reducing insulin secretion . In contrast, sulfonylureas promote closure of the channel to depolarise the beta cell and mobilise insulin granules . Thus, these findings demonstrate how genetic discovery can successfully predict the therapeutic potential of a known target based on genetic variants with moderate effect.
Although no validated drug targets have emerged from type 2 diabetes GWAS to date, recently identified coding variants have highlighted plausible candidates. One candidate that has been the focus of particular interest is the SLC30A8 gene, encoding the zinc transporter 8 (ZnT8), which is expressed in insulin secretory granules. Initially, common risk variants of unknown functional importance had spurred commercial interest in the development of agonists, based on the assumption of a negative correlation between activity levels of ZnT8 and diabetes risk [, ]. This notion was challenged by a more recent study that focused on protein-truncating variants in SLC30A8 to determine the effect of loss-of-function on type 2 diabetes susceptibility . Strikingly, the study found that carriers that were haploinsufficient for ZnT8 were protected from type 2 diabetes, with a 65% reduction in disease risk. These observations provide strong evidence in favour of a therapeutic strategy based on ZnT8 inhibition. More broadly, this example also illustrates the value in considering use of an extended allelic series to more fully explore the effects of target modulation at various levels of inhibition and/or activation.
Predicting adverse effects of new therapies
The suitability of a drug candidate is ultimately dependent on whether the therapeutic effect is expected to outweigh any on- and off-target adverse effects. These can be difficult to predict, especially if caused by unintended drug promiscuity, but attempts can be made to anticipate on-target side effects. In an analogous fashion to the use of GWAS for target identification, phenome-wide association studies (PheWAS) provide a tool for determining the long-term consequences of manipulating a target . Rather than seeking to identify variants associated with a particular disease, a PheWAS is designed to systematically identify the diseases or traits associated with a particular variant (or multiple variants within a gene of interest). In the same way that genetic perturbations can pinpoint target proteins, the detected phenotypes are a consequence of life-long experiments of nature. Any pleiotropy thus raises the possibility of additional on-target effects from long-term target modulation (Fig. 1).
As for GWAS, there are a number of practical and conceptual limitations that apply to the PheWAS paradigm in the context of target validation. First, it is clear that the identified phenotypes (both therapeutic benefits and on-target adverse effects) may be restricted to a perturbation that is imposed over many years or is present at a specific stage of disease progression. For instance, in the case of type 1 diabetes, the identified association signals have primarily uncovered genes implicated in the immune system. Nevertheless, modulating immune function in individuals with type 1 diabetes is unlikely to be an effective therapy, since autoimmune beta cell destruction has already occurred. For such diseases, the therapeutic pathways for treating symptoms (e.g. insulin or beta cell replacement for type 1 diabetes) may be different from the susceptibility pathways (uncovered by genetics) that are relevant to preventing disease. More generally, the lifetime exposure of a genetic defect might also produce long-term secondary effects (e.g. through compensatory mechanisms) that are not directly predictive of acute therapeutic interventions.
A second limitation of PheWAS is the requirement for access to diverse, deeply phenotyped cohorts or electronic medical records with genotyping information. Though population-wide biobanks and large, industry-led cohorts with sequencing data are now taking form, systematic PheWAS have previously been impractical . Studies of this nature have, therefore, been more akin to traditional candidate gene association studies, focusing on a specific hypothesis concerning a target gene and a selected outcome. Nonetheless, recent examples of this approach being applied to the development of new treatments for type 2 diabetes provide insights into the potential value of PheWAS.
Glucokinase and glucokinase regulatory protein
Glucokinase (encoded by GCK) is a key glycolytic enzyme involved in sensing the energy status of the body’s major organs. The protein is regulated in the liver by glucokinase regulatory protein (GKRP; encoded by GCKR), which sequesters glucokinase during fasting . Genetic variation in both GCK and GCKR has been implicated in type 2 diabetes susceptibility, and the proteins are both targets of ongoing drug development efforts to modulate this pathway [–]. While increasing glucokinase activity (e.g. through GKRP inhibition or allosteric activation) could lower plasma glucose to reduce the risk of type 2 diabetes, genetic evidence also points to the possibility of likely adverse effects [–]. Several studies of deleterious variants in GCKR have found increased risk of hypertriacylglycerolaemia, probably as a consequence of elevated substrate availability for hepatic lipogenesis [–]. Interestingly, in clinical trials of one glucokinase activator, mild dyslipidaemia was reported in treatment groups, providing preliminary confirmation of this potential adverse effect . Similar results were reported across different classes of glucokinase activators in rodents, arguing for an effect that is independent of the specific chemical compound . In light of corroborating genetic and molecular data, it is clear that monitoring lipid levels for therapies targeting glucokinase/GKRP is essential.
Sodium–glucose cotransporter 2
In a similar way, genetic evidence has been able to shine light on the clinical use of sodium–glucose cotransporter 2 (SGLT2) inhibitors, an emerging class of glucose-lowering drugs that act through increased renal clearance of glucose . A naturally occurring inhibitor of SGLT2 (phlorizin) had been known for some time, spurring the development of synthetic analogues for use in humans . Nevertheless, the discovery that familial renal glycosuria is caused by genetic variants in the gene encoding SGLT2 (SLC5A2) provided an opportunity to test for any side effects of long-term perturbations [, ]. Individuals carrying loss-of-function alleles in SLC5A2 have reduced ability to reabsorb glucose in the kidney but display otherwise normal renal function and no or few additional clinical features (www.omim.org/entry/233100, accessed 1 March 2017). These observations suggest that selective targeting of SGLT2, even for prolonged periods of time, is not associated with any significant complications.
Another rare genetic disorder, known as Cowden’s syndrome, has offered new clues into a possible link between type 2 diabetes, obesity and cancer, as initially suggested by epidemiological data . The majority of patients suffering from the cancer predisposition syndrome carry germline loss-of-function mutations in the PTEN gene . The protein encoded by PTEN (phosphatase and tensin homolog; PTEN) is a known tumour suppressor and a critical inhibitor of the phosphatidylinositol (3,4,5)-trisphosphate (PIP3) branch of insulin signalling. On this basis, individuals would be expected to display improved insulin sensitivity, with a concomitant increase in cell growth and metabolism. Indeed, a recent study found individuals with Cowden’s syndrome to be profoundly insulin sensitive, even in the face of obesity . This provides a dramatic example of the sometimes overlapping effects of intracellular signalling pathways involved in the regulation of metabolic and cell cycle-related processes.
Mendelian randomisation as a tool for predicting adverse effects of risk factor modulation
Among studies within genetic epidemiology, a subset are based on a particular design known as Mendelian randomisation. The aim of Mendelian randomisation studies is to establish causal relationships between an environmental exposure and disease status . More specifically, genetic variants are used as proxies for a modifiable exposure, which in turn may influence the outcome phenotype. As for other genetic association studies, the Mendelian randomisation design rests on the assumptions that genetic variants are fixed in time (not prone to reverse causation) and subject to independent assortment at conception (hence, more likely to produce unbiased estimates of a causal effect). In addition, Mendelian randomisation studies require that the selected variants influence disease status exclusively through the exposure of interest, and that they are not in linkage disequilibrium with any variants that could confound results . If these conditions are satisfied, the paradigm can provide a powerful tool for causal inference without many of the confounding influences of conventional observational epidemiology. Most obviously, Mendelian randomisation studies can be used to define the role of environmental influences in disease aetiology, and thereby determine behavioural or molecular traits that can be modified to minimise risk.
Within a framework of drug target validation, Mendelian randomisation can be a useful strategy for exploring possible adverse effects of a proposed treatment. Unlike conventional GWAS/PheWAS, which seek to predict the side effects of drugs that modify a particular target, the aim of Mendelian randomisation is in doing so for any intervention that targets a particular risk factor. Recently, for example, Mendelian randomisation studies were used to delineate a clinically relevant link between treatments for cardiovascular disease and type 2 diabetes risk [, ]. Alleles in the genes PCSK9 and HMGCR, known to predispose individuals to lower plasma LDL-cholesterol, were associated with the expected protective effect against cardiovascular events, but also with an inverse effect on type 2 diabetes risk. As the variants have no (known) pleiotropic effects, the results indicate a causal role of reduced LDL-cholesterol in type 2 diabetes susceptibility (among individuals that already have impaired glucose tolerance). Thus, the findings not only have implications for our understanding of current therapies targeting HMGCR (statins) and PCSK9 (proprotein convertase subtilisin/kexin type 9 [PCSK9] inhibitors) but they also show that the same undesirable effect may turn out to be a general feature of any treatment that lowers LDL-cholesterol.
Finding therapeutic indications based on pharmacogenomics and precision medicine
When balancing the expected effects of a drug candidate (both adverse and beneficial) any therapeutic hypothesis should be formulated in the context of an intended target population. Since not all individuals will benefit equally from a given treatment, identifying the most appropriate indication is critical to success in clinical trials. Clearly, the genetic associations identified during target discovery can be used to immediately propose broad indications for a candidate drug. By extension, PheWAS data can be used to search a larger phenotype space for any association that indicates a likely therapeutic or adverse effect.
The application of this principle extends beyond novel therapeutics and could be a powerful method for repositioning existing drugs for new indications . One report overlapped known drug mechanisms with GWAS associations for each target . Interestingly, around 40% of the associated traits matched the corresponding drug indication (e.g. the use of statins for hypercholesterolaemia is accurately predicted by HMGCR variants that are associated with LDL-cholesterol). Though this type of analysis will necessarily be limited to studying drugs that have known targets harbouring genetic variants associated with disease, the substantial overlap provides a validation of the approach and adds confidence to those indications corroborated by genetic evidence [–]. Still, more than half of the studied targets were associated with a different GWAS trait from that suggested by the indication for the drug. Some of these are likely a consequence of methodological limitations (e.g. difficulties translating GWAS signals), but the mismatches also highlight examples with additional supporting evidence. These represent plausible drug-repositioning opportunities.
Pharmacogenomics: application in monogenic vs polygenic diabetes
The indications proposed by genetic associations are generally broad phenotypic labels. Within the field of type 2 diabetes, the heterogeneous nature of the disorder is often alluded to, sometimes with the implication that genetics could be used to inform more precise diagnostic categories. If clinically meaningful subtypes did exist, such diagnostic labels could likely improve treatment efficacy. Certainly, there are individuals carrying mutations with high, if not complete penetrance in specific genes. These individuals may either suffer from a monogenic form of diabetes or exist somewhere on the spectrum between complex type 2 diabetes and a Mendelian disorder [, , ]. Since disease progression in such individuals is determined by perturbations in a very limited number of pathways, genetic testing could in theory enable precision medicine.
Proof of concept has been provided by a life-changing treatment for individuals with permanent neonatal diabetes mellitus (PNDM). Genetic studies on PNDM has led to the realisation that a subset of individuals harbour mutations in the genes encoding the KATP channel . Similar to the type 2 diabetes risk haplotype at this locus, the mutations were found to promote opening of the channel, suggesting that sulfonylureas could provide a disease-modifying therapy. This was confirmed in follow-up studies that demonstrated sustained efficacy in individuals with PNDM [, ]. Remarkably, most participants were able to discontinue insulin treatment, switching to oral therapy with improved metabolic control. Sulfonylureas have also been successful in disease management for certain forms of MODY. It was found that individuals with MODY carrying mutations in the HNF1A or HNF4A genes are sensitive to low-dose sulfonylureas, though the mechanism is incompletely understood [–]. The examples above, all of which are diseases with a defined genetic aetiology, provide compelling demonstrations that taking a pharmacogenetics approach can improve quality of life.
An interesting question pertains to whether such pharmacogenomic principles can be generalised to more complex forms of diabetes. In other words, can genetic testing identify subgroups of individuals with type 2 diabetes that are more likely to benefit from particular treatments than others? This would likely be the case if the underlying reality of diabetes was a collection of distinct subtypes, each dominated by defects in different pathways. As mentioned, it is clear that some individuals with type 2 diabetes do carry genetic variants with intermediate to high effect sizes that may be suggestive of increased sensitivity to drugs targeting the particular pathways affected. However, available evidence from genetic studies has shown that such individuals are in the minority and that the bulk of the genetic susceptibility for type 2 diabetes is carried by a very large number of common variants, each with small effect sizes. Equally, non-genetic factors, though less well understood, appear to be characterised by pervasive environmental perturbations. Individual risk profiles are thus dominated by exposures that are mostly common and widely shared, arguing against a model for disease architecture based on a set of distinct pathologies.
Emerging from the notion that existing disease models may be poorly suited for our current understanding of diabetes, an alternative taxonomy has recently been proposed [, ]; referred to as the ‘palette’ model (as opposed to a subtype-oriented model), it posits that diabetes is caused by a large number of small perturbations (environmental and genetic) across the component pathways of disease (e.g. beta cell function, insulin sensitivity, autoimmunity). Individually, the phenotypic impact of each perturbation is limited, but in aggregate will push a person on a path away from metabolic homeostasis. By analogy to colours combined in different hues and saturation, the palette taxonomy proposes an unlimited spectrum of disease manifestations. Monogenic and autoimmune forms of diabetes are represented in the extremes of this continuum . It is thus an implication of this model that the majority of individuals are not dominated by defects in single or few processes . These individuals cannot meaningfully be categorised into subtypes and, thus, attempts at delivering precision medicine will be challenging. It may be that biomarkers for specific processes can be used to glean insights into the pathways that are driving disease progression at any given time [, ]. As more process-modulating therapies become available, these could be used to encourage individuals along an appropriate trajectory, towards health. In the near future, however, targeting people with high-impact mutations (those at the extremes of the diabetes spectrum) are likely to be a more tractable aim for precision medicine.
Experimental challenges for drug target validation using human genetics
A key aspect of translating GWAS signals into target validation naturally centres on the identification of the causal genes (or ‘effector transcripts’) driving disease susceptibility (Fig. 1). Despite advances in broadly understanding molecular and regulatory mechanisms involved in type 2 diabetes pathogenesis, progress on individual loci has been slow. A minority of the >100 independent association signals for disease risk have produced a single high-confidence candidate gene. As a result, the therapeutic value of GWAS for target discovery is still limited by this experimental bottleneck, especially since follow-up studies have tended to focus on the ‘low-hanging fruits’ supported by existing lines of evidence . In the last few years, a number of different approaches have been taken to tackle this issue, providing complementary lines of evidence to enhance our understanding of causal mechanisms. The methods used for identifying effector transcripts broadly fall into three categories:
The identification of coding risk variants to directly pinpoint effector transcripts This approach has been facilitated by a recent shift in the attention of GWAS efforts towards low frequency and rare variants with higher penetrance [, ]. Even in regions with existing regulatory variants, coding variants can be used to direct experimental efforts towards particular candidates. This is illustrated by the G6PC2/ABCC11 locus, which contained two strong candidate causal genes near a non-coding association signal identified for fasting glucose (an intermediate trait for type 2 diabetes susceptibility) [, ]. A more recent effort to map coding variation for glycaemic control found coding variants within the G6PC2 gene [, ]. Follow-up experimental studies have since explored the effect of the variants to show a functional impact on the encoded glucose-6-phosphatase subunit . An added benefit of finding causal variants in coding regions is the offer of a more straight-forward interpretation for therapeutic strategies. Non-coding variation is subject to the context-dependent activity of cis-regulation and the effects can be restricted to specific tissues or developmental stages . As a consequence, drugs that target the affected gene could cause unexpected adverse effects by producing a more global phenotype. Though coding variants can also be subject to such context-dependency (e.g. through tissue-specific isoforms), the effector transcripts are often affected more widely [, ].
Integration of genetic and genomic data to establish direct links between regulatory variation, genomic annotation and regional genes One powerful approach within this category attempts to identify variants that affect the expression level of nearby genes, so-called cis-expression quantitative trait loci (cis-eQTLs). In cases where the association signal overlaps a cis-eQTL in a disease-relevant tissue, this can reveal both the target gene and the direction of effect for the risk variant. While many cis-eQTLs are shared across tissues, others appear to show more restricted effects that are specific to one or more tissues . Since physiological characterisations of carriers of type 2 diabetes risk variants have implied a central role for islet dysfunction in disease susceptibility, several cis-eQTL studies have focused on pancreatic islets [, ]. Though the power to detect associations has been limited by the availability of islets from donors, the approach has successfully highlighted candidate effector transcripts with previously unknown roles in disease pathogenesis [, ]; this is the case for the poorly characterised ZMIZ1 gene that was identified in a recent study . In vitro work subsequently confirmed a role for ZMIZ1 in islet function following functional studies.
Intersecting human genetics with genomic annotation can also be used to define common regulatory themes that underlie causal mechanisms at multiple loci. A recent study, for example, demonstrated an enrichment of islet and liver binding sites for the forkhead box protein A2 (FOXA2) transcription factor among type 2 diabetes association signals . These results suggest a shared role of FOXA2 across a subset of risk loci and highlight the potential to identify specific causal variants. In one instance, at the MTNR1B locus, where the association signal has been collapsed to a single variant through fine-mapping, the FOXA2 binding event was shown to be a marker for binding of another transcription factor, neurogenic differentiation 1 (NEUROD1). It was found that the risk allele creates a NEUROD1 binding site, leading to increased expression of MTNR1B in beta cells. This is in line with a cis-eQTL that was previously identified for this variant in islets, and adds support to a mechanism for this non-coding risk allele being mediated via elevated melatonin receptor 1B (MTRN1B; encoded by MTNR1B) activity [, ].
Interestingly, a different direction of effect for the MTRN1B gene has been proposed by coding loss-of-function variants, which have also been associated with elevated risk of type 2 diabetes . One potential explanation is suggested by the observation that the regulatory variant appears to exhibit tissue-specificity in its activity . It is thus possible that the discrepancy between coding and non-coding variants could reflect differences between global and local roles of MTRN1B. Other explanations are possible and it remains to be seen whether increased MTNR1B transcript levels translate into higher protein expression. MTNR1B, a G-protein-coupled receptor, has received considerable attention as a potentially ‘druggable’ target. Addressing the inconsistencies in genetic data will thus provide insights into the suitability of MTNR1B as a drug target and inform any potential therapeutic strategies.
Indirect prioritisation of genes based on known biology Last among the methods for identifying causal mechanisms, a third category aims to indirectly prioritise genes based on known biology. For instance, a number of type 2 diabetes loci harbour genes implicated in monogenic forms of diabetes. Given the overlapping aetiologies between the diseases, monogenic diabetes genes can also be prioritised as causal for complex diabetes. Though this type of evidence is circumstantial, it could be a useful method for limiting the search space of genes to be studied. However, the number of loci for which current evidence favours one candidate over others is limiting and tends to be biased towards previously studied genes. One recently developed method aimed to sidestep this limitation by performing high-throughput functional characterisation of positional candidates for type 2 diabetes GWAS signals . The screen successfully replicated known mechanisms of beta cell dysfunction and pointed to several unknown candidate causal genes. While any such study will be limited to a particular cell state, focusing on those tissues with high expected relevance to disease are likely to be most informative. Emerging genetic tools, such as clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR associated protein 9 (Cas9) and induced pluripotent stem cells, will make surveying a multitude of relevant phenotypes across tissue types and developmental stages an increasingly tractable goal.
Available models for preclinical target validation have limited ability to assess causal relationships with direct relevance to humans. Advances in genetics and genomics hold the promise to bring down the cost of industry research and development pipelines by complementing these approaches. Through genotype–phenotype associations, ‘experiments of nature’ can, in principle, facilitate drug target validation. It is still too early to assess the impact of GWAS findings on prospective target discovery for diabetes treatment but the genes identified to date have successfully predicted known therapeutic mechanisms. These encouraging findings suggest that translating uncharacterised loci into pathophysiological mechanisms could point to novel drug targets. Increasingly, the uncovered therapeutic mechanisms may enable modes of precision medicine in diabetes for individuals with moderate- to high-penetrance mutations. For more common forms of diabetes, the extent to which pharmacogenetics will prove a relevant paradigm is still uncertain. Recent genetic insights have argued against a subtype-oriented taxonomy of disease, and more precise indications for type 2 diabetes therapies may not be a realistic target for the near future. Even so, human genetics could pave the way for new disease-modifying treatments that can benefit both common and rare forms of diabetes.
Electronic supplementary material
(PPTX 169 kb)
This review article was based on the Minkowski 2014 lecture delivered by ALG at the 49th EASD Annual Meeting in Vienna and consequently focuses predominantly on her contribution to the field. It has been updated by ALG and SKT to reflect additional relevant studies reported since the delivery of the lecture. ALG would like to thank her mentors, colleagues, research team and all study participants over the years for their support and guidance and the opportunity to be part of international collaborative science.
- IDF (2015) International diabetes federation diabetes atlas, 7th edn. International Diabetes Federation, Belgium. Available from http://www.diabetesatlas.org. Accessed 1 March 2017
- JA DiMasiL FeldmanA SecklerA WilsonTrends in risks associated with new drug development: success rates for investigational drugsClin Pharmacol Ther20108727227710.1038/clpt.2009.29520130567
- JA DiMasiHG GrabowskiRW HansenInnovation in the pharmaceutical industry: new estimates of R&D costsJ Health Econ201647203310.1016/j.jhealeco.2016.01.01226928437
- J ArrowsmithP MillerTrial watch: phase II and phase III attrition rates 2011-2012Nat Rev Drug Discov20131256910.1038/nrd409023903212
- D CookD BrownR AlexanderLessons learned from the fate of AstraZeneca's drug pipeline: a five-dimensional frameworkNat Rev Drug Discov20141341943110.1038/nrd430924833294
- M WehlingAssessing the translatability of drug projects: what needs to be scored to predict success?Nat Rev Drug Discov2009854154610.1038/nrd289819543224
- RM PlengeEM ScolnickD AltshulerValidating therapeutic targets through human geneticsNat Rev Drug Discov20131258159410.1038/nrd405123868113
- JC BarrettI DunhamE BirneyUsing human genetics to make new medicinesNat Rev Genet20151656156210.1038/nrg399826370900
- LA HindorffP SethupathyHA JunkinsPotential etiologic and functional implications of genome-wide association loci for human diseases and traitsProc Natl Acad Sci U S A20091069362936710.1073/pnas.090310310619474294
- DG MacArthurTA ManolioDP DimmockGuidelines for investigating causality of sequence variants in human diseaseNature201450846947610.1038/nature1312724759409
- GV KryukovLA PennacchioSR SunyaevMost rare missense alleles are deleterious in humans: implications for complex disease and association studiesAm J Hum Genet20078072773910.1086/51347317357078
- DB GoldsteinA AllenJ KeeblerSequencing studies in human genetics: design and interpretationNat Rev Genet20131446047010.1038/nrg345523752795
- C FuchsbergerJ FlannickTM TeslovichThe genetic architecture of type 2 diabetesNature2016536414710.1038/nature1864227398621
- O ZukSF SchaffnerK SamochaSearching for missing heritability: designing rare variant association studiesProc Natl Acad Sci U S A2014111E455E46410.1073/pnas.132256311124443550
- J FlannickNL BeerAG BickAssessing the phenotypic effects in the general population of rare variants in genes for a dominant Mendelian form of diabetesNat Genet2013451380138510.1038/ng.279424097065
- CB BeggOn the use of familial aggregation in population-based case probands for calculating penetranceJ Natl Cancer Inst2002941221122610.1093/jnci/94.16.122112189225
- K ZhouHK PedersenAY DawedER PearsonPharmacogenomics in diabetes mellitus: insights into drug action and drug discoveryNat Rev Endocrinol20161233734627062931
- AL PriceCC SpencerP DonnellyProgress and promise in understanding the genetic basis of common diseasesProc R Soc B20152822015168410.1098/rspb.2015.168426702037
- MR NelsonH TipneyJL PainterThe support of human genetic evidence for approved drug indicationsNat Genet20154785686010.1038/ng.331426121088
- Hauner H (2002) The mode of action of thiazolidinediones. Diabetes Metab Res Rev 18(Suppl 2):S10–S15
- D AltshulerJN HirschhornM KlannemarkThe common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetesNat Genet200026768010.1038/7983910973253
- SS DeebL FajasM NemotoA Pro12Ala substitution in PPARgamma2 associated with decreased receptor activity, lower body mass index and improved insulin sensitivityNat Genet19982028428710.1038/30999806549
- CJ YenBA BeamerC NegriMolecular scanning of the human peroxisome proliferator activated receptor gamma (hPPAR gamma) gene in diabetic Caucasians: identification of a Pro12Ala PPAR gamma 2 missense mutationBiochem Biophys Res Commun199724127027410.1006/bbrc.1997.77989425261
- AR MajithiaJ FlannickP ShahinianRare variants in PPARG with decreased activity in adipocyte differentiation are associated with increased risk of type 2 diabetesProc Natl Acad Sci U S A2014111131271313210.1073/pnas.141042811125157153
- A MahajanGenome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibilityNat Genet20144623424410.1038/ng.289724509480
- M JanbonJ ChaptalA VedelJ SchaapAccidents hypoglycémiques graves par un sulfamidothiodiazol (le VK 57 ou 2254 RP)Montp Med19424412122
- AL GloynMN WeedonKR OwenLarge-scale association studies of variants in genes encoding the pancreatic beta-cell KATP channel subunits Kir6.2 (KCNJ11) and SUR1 (ABCC8) confirm that the KCNJ11 E23K variant is associated with type 2 diabetesDiabetes20035256857210.2337/diabetes.52.2.56812540637
- EH HaniP BoutinE DurandMissense mutations in the pancreatic islet beta cell inwardly rectifying K+ channel gene (KIR6.2/BIR): a meta-analysis suggests a role in the polygenic basis of type II diabetes mellitus in CaucasiansDiabetologia1998411511151510.1007/s0012500510989867219
- AL GloynY HashimSJ AshcroftR AshfieldS WiltshireRC TurnerAssociation studies of variants in promoter and coding regions of beta-cell ATP-sensitive K-channel genes SUR1 and Kir6.2 with type 2 diabetes mellitus (UKPDS 53)Diabet Med20011820621210.1046/j.1464-5491.2001.00449.x11318841
- KS HammingD SolimanLC MatemiszCoexpression of the type 2 diabetes susceptibility gene variants KCNJ11 E23K and ABCC8 S1369A alter the ATP and sulfonylurea sensitivities of the ATP-sensitive K(+) channelDiabetes2009582419242410.2337/db09-014319587354
- Proks P, Reimann F, Green N, Gribble F, Ashcroft F (2002) Sulfonylurea stimulation of insulin secretion. Diabetes 51(Suppl 3):S368–S376
- R SladekG RocheleauJ RungA genome-wide association study identifies novel risk loci for type 2 diabetesNature200744588188510.1038/nature0561617293876
- GA RutterF ChimientiSLC30A8 mutations in type 2 diabetesDiabetologia201558313610.1007/s00125-014-3405-725287711
- J FlannickG ThorleifssonNL BeerSB JacobsLoss-of-function mutations in SLC30A8 protect against type 2 diabetesNat Genet20144635736310.1038/ng.291524584071
- WS BushMT OetjensDC CrawfordUnravelling the human genome-phenome relationship using phenome-wide association studiesNat Rev Genet20161712914510.1038/nrg.2015.3626875678
- C ShiotaJ CoffeyJ GrimsbyJF GrippoMA MagnusonNuclear import of hepatic glucokinase depends upon glucokinase regulatory protein, whereas export is due to a nuclear export signal sequence in glucokinaseJ Biol Chem1999274371253713010.1074/jbc.274.52.3712510601273
- J DupuisC LangenbergI ProkopenkoNew genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes riskNat Genet20104210511610.1038/ng.52020081858
- DJ LloydDJ St Jean JrRJ KurzejaAntidiabetic effects of glucokinase regulatory protein small-molecule disruptorsNature201350443744010.1038/nature1272424226772
- FM MatschinskyAssessing the potential of glucokinase activators in diabetes therapyNat Rev Drug Discov2009839941610.1038/nrd285019373249
- R SaxenaBF VoightV LyssenkoGenome-wide association analysis identifies loci for type 2 diabetes and triglyceride levelsScience20073161331133610.1126/science.114235817463246
- M Orho-MelanderO MelanderC GuiducciCommon missense variant in the glucokinase regulatory protein gene is associated with increased plasma triglyceride and C-reactive protein but lower fasting glucose concentrationsDiabetes2008573112312110.2337/db08-051618678614
- CT JohansenJ WangMB LanktreeExcess of rare variants in genes identified by genome-wide association study of hypertriglyceridemiaNat Genet20104268468710.1038/ng.62820657596
- NL BeerND TribbleLJ McCullochThe P446L variant in GCKR associated with fasting plasma glucose and triglyceride levels exerts its effect through increased glucokinase activity in liverHum Mol Genet2009184081408810.1093/hmg/ddp35719643913
- MG ReesS WincovitchJ SchultzCellular characterisation of the GCKR P446L variant associated with type 2 diabetes riskDiabetologia20125511412210.1007/s00125-011-2348-522038520
- MG ReesD NgS RuppertCorrelation of rare coding variants in the gene encoding human glucokinase regulatory protein with phenotypic, cellular, and kinetic outcomesJ Clin Invest201212220521710.1172/JCI4642522182842
- MG ReesA RaimondoJ WangInheritance of rare functional GCKR variants and their contribution to triglyceride levels in familiesHum Mol Genet2014235570557810.1093/hmg/ddu26924879641
- GE MeiningerR ScottM AlbaEffects of MK-0941, a novel glucokinase activator, on glycemic control in insulin-treated patients with type 2 diabetesDiabetes Care2011342560256610.2337/dc11-120021994424
- F De CeuninckC KargarC IlicSmall molecule glucokinase activators disturb lipid homeostasis and induce fatty liver in rodents: a warning for therapeutic applications in humansBr J Pharmacol201316833935310.1111/j.1476-5381.2012.02184.x22925001
- EC ChaoSGLT-2 inhibitors: a new mechanism for glycemic controlClin Diabetes20143241110.2337/diaclin.32.1.426246672
- JR EhrenkranzNG LewisCR KahnJ RothPhlorizin: a reviewDiabetes Metab Res Rev200521313810.1002/dmrr.53215624123
- Y KanaiWS LeeG YouD BrownMA HedigerThe human kidney low affinity Na+/glucose cotransporter SGLT2. Delineation of the major renal reabsorptive mechanism for D-glucoseJ Clin Invest19949339740410.1172/JCI1169728282810
- LP van den HeuvelK AssinkM WillemsenL MonnensAutosomal recessive renal glucosuria attributable to a mutation in the sodium glucose cotransporter (SGLT2)Hum Genet200211154454710.1007/s00439-002-0820-512436245
- M TancrediA RosengrenAM SvenssonExcess mortality among persons with type 2 diabetesN Engl J Med20153731720173210.1056/NEJMoa150434726510021
- DJ MarshPL DahiaS CaronGermline PTEN mutations in Cowden syndrome-like familiesJ Med Genet19983588188510.1136/jmg.35.11.8819832031
- A PalTM BarberM Van de BuntPTEN mutations as a cause of constitutive insulin sensitivity and obesityN Engl J Med20123671002101110.1056/NEJMoa111396622970944
- GD SmithS Ebrahim‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease?Int J Epidemiol20033212210.1093/ije/dyg07012689998
- GD SmithS EbrahimMendelian randomization: prospects, potentials, and limitationsInt J Epidemiol200433304210.1093/ije/dyh13215075143
- AF SchmidtDI SwerdlowMV HolmesPCSK9 genetic variants and risk of type 2 diabetes: a mendelian randomisation studyLancet Diabetes Endocrinol201659710510.1016/S2213-8587(16)30396-527908689
- BA FerenceJG RobinsonRD BrookVariation in PCSK9 and HMGCR and risk of cardiovascular disease and diabetesN Engl J Med20163752144215310.1056/NEJMoa160430427959767
- M Rastegar-MojaradZ YeJM KolesarSJ HebbringSM LinOpportunities for drug repositioning from phenome-wide association studiesNat Biotechnol20153334234510.1038/nbt.318325850054
- P SanseauP AgarwalMR BarnesUse of genome-wide association studies for drug repositioningNat Biotechnol20123031732010.1038/nbt.215122491277
- P SanseauP AgarwalMR BarnesReply to rational drug repositioning by medical geneticsNat Biotechnol201331108210.1038/nbt.276924316642
- ZY WangHY ZhangRational drug repositioning by medical geneticsNat Biotechnol2013311080108210.1038/nbt.275824316641
- S AlthariAL GloynWhen is it MODY? Challenges in the interpretation of sequence variants in MODY genesRev Diabet Stud20151233034810.1900/RDS.2015.12.33027111119
- AL GloynER PearsonJF AntcliffActivating mutations in the gene encoding the ATP-sensitive potassium-channel subunit Kir6.2 and permanent neonatal diabetesN Engl J Med20043501838184910.1056/NEJMoa03292215115830
- ER PearsonI FlechtnerPR NjolstadSwitching from insulin to oral sulfonylureas in patients with diabetes due to Kir6.2 mutationsN Engl J Med200635546747710.1056/NEJMoa06175916885550
- JV SagenH RaederE HathoutPermanent neonatal diabetes due to mutations in KCNJ11 encoding Kir6.2: patient characteristics and initial response to sulfonylurea therapyDiabetes2004532713271810.2337/diabetes.53.10.271315448106
- M ShepherdER PearsonJ HoughtonG SaltS EllardAT HattersleyNo deterioration in glycemic control in HNF-1alpha maturity-onset diabetes of the young following transfer from long-term insulin to sulphonylureasDiabetes Care2003263191319210.2337/diacare.26.11.3191-a14578267
- ER PearsonS PruhovaCJ TackMolecular genetics and phenotypic characteristics of MODY caused by hepatocyte nuclear factor 4alpha mutations in a large European collectionDiabetologia20054887888510.1007/s00125-005-1738-y15830177
- ER PearsonBJ StarkeyRJ PowellFM GribblePM ClarkAT HattersleyGenetic cause of hyperglycaemia and response to treatment in diabetesLancet20033621275128110.1016/S0140-6736(03)14571-014575972
- McCarthy MI (2017) Painting a new picture of personalised medicine for diabetes. Diabetologia 60:793–799
- EAM GaleDeclassifying diabetesDiabetologia2006491989199510.1007/s00125-006-0348-716821044
- PW FranksMI McCarthyExposing the exposures responsible for type 2 diabetes and obesityScience2016354697310.1126/science.aaf509427846494
- SK ThomsenAL GloynThe pancreatic beta cell: recent insights from human geneticsTrends Endocrinol Metab20142542543410.1016/j.tem.2014.05.00124986330
- V SteinthorsdottirG ThorleifssonP SulemIdentification of low-frequency and rare sequence variants associated with elevated or reduced risk of type 2 diabetesNat Genet20144629429810.1038/ng.288224464100
- N Bouatia-NajiG RocheleauL Van LommelA polymorphism within the G6PC2 gene is associated with fasting plasma glucose levelsScience20083201085108810.1126/science.115684918451265
- WM ChenMR ErdosAU JacksonVariations in the G6PC2/ABCB11 genomic region are associated with fasting glucose levelsJ Clin Invest20081182620262818521185
- A MahajanX SimHJ NgIdentification and functional characterization of G6PC2 coding variants influencing glycemic traits define an effector transcript at the G6PC2-ABCB11 locusPLoS Genet201511e100487610.1371/journal.pgen.100487625625282
- J WesselAY ChuSM WillemsLow-frequency and rare exome chip variants associate with fasting glucose and type 2 diabetes susceptibilityNat Commun20156589710.1038/ncomms689725631608
- SK ThomsenMI McCarthyAL GloynThe importance of context: uncovering species- and tissue-specific effects of genetic risk variants for type 2 diabetesFront Endocrinol2016711210.3389/fendo.2016.00112
- I MoltkeN GrarupME JorgensenA common Greenlandic TBC1D4 variant confers muscle insulin resistance and type 2 diabetesNature201451219019310.1038/nature1342525043022
- GTEx ConsortiumHuman genomics. The genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humansScience201534864866010.1126/science.126211025954001
- BF VoightLJ ScottV SteinthorsdottirTwelve type 2 diabetes susceptibility loci identified through large-scale association analysisNat Genet20104257958910.1038/ng.60920581827
- AS DimasV LagouA BarkerImpact of type 2 diabetes susceptibility variants on quantitative glycemic traits reveals mechanistic heterogeneityDiabetes2014632158217110.2337/db13-094924296717
- J FadistaP VikmanEO LaaksoGlobal genomic and transcriptomic analysis of human pancreatic islets reveals novel genes influencing glucose metabolismProc Natl Acad Sci U S A2014111139241392910.1073/pnas.140266511125201977
- M van de BuntJE Manning FoxX DaiTranscript expression data from human islets links regulatory signals from genome-wide association studies for type 2 diabetes and glycemic traits to their downstream effectorsPLoS Genet201511e100569410.1371/journal.pgen.100569426624892
- KJ GaultonT FerreiraY LeeGenetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility lociNat Genet2015471415142510.1038/ng.343726551672
- T TuomiCL NagornyP SinghIncreased melatonin signaling is a risk factor for type 2 diabetesCell Metab2016231067107710.1016/j.cmet.2016.04.00927185156
- A BonnefondN ClementK FawcettRare MTNR1B variants impairing melatonin receptor 1B function contribute to type 2 diabetesNat Genet20124429730110.1038/ng.105322286214
- SK ThomsenA CeroniM van de BuntSystematic functional characterization of candidate causal genes for type 2 diabetes risk variantsDiabetes2016653805381110.2337/db16-036127554474