Identification of potential protein targets for CoViD

In this module, we will focus on the identification of promising protein targets for drugs intended to treat coronavirus infections.
The analysis is based in a machine learning exercise carried out on SARS viruses experimental records stored in ChEMBL and PubChem DBs and described in the Prediction of activity section in this web site. 

Predicted activities have been aggregated at the protein level into different scores attending to their potency, frequency of protein-molecule interactions, and specificity based on positive records vs total number of active records. Network graphs show all ChEMBL  interactions of predicted active molecules. Barcharts and scatter plots show the aggregated potency against SARS-CoV (SARS_Score) vs the number of distinct molecular interactions for each protein (expressed as logged degree). Most interesting proteins attending to any of these values have been flagged in the scatter plots. Additional details as described in respective captions. 

Network interaction graphs of proteins predicted as potential targets against coronavirus.

Click on the graphs to expand view. Click on the links to access to specific charts for each target class.

Scatter plot ilustrating target activity and interactions scores. Best scored targets highlighted.

Aggregated SARS scores at the protein level: Log10(degree): logged number of interactions for each protein. SARS_Score: Aggregated score against coronavirus infection per protein. Size by Degree/CountOfScores represents the number of SARS active interactions vs total records per protein in ChEMBL. Color by condition of elements involved in score aggregation; they can be built from actual, predicted values or a combination of both. Shape indicates the organism type as described in the legend.

We can also see most relevant protein targets ranked by a combination of the three relevant scores: degree, SARS_Score and degre vs total number of records per protein (Degree/CountOfScores). 

Log10(degree): logged number of interactions for each protein. SARS_Score: Aggregated score against coronavirus infection per protein. Size by Degree/CountOfScores represents the number of SARS active interactions vs total records per protein in ChEMBL.

And now we can look at to all the interactions between predicted active molecules and protein targets in ChEMBL to determine main nodes of activity (bottom left interactions graph) or clusters/modules of specific interactions
(bottom righ interactions graph). 

And use these interactions maps to classify activities in different groups based on closest relations between compounds and proteins. We can highlight protein groups with relevant scores for SASRS infection: Proteases, GPCRs, ion channels, kinases, transporters, nucleic acids binding proteins, steroid metabolism enzymes, ubiquitin-transference  elements, bromodomain proteins, histone deacetylases and more.  See network graphs above. 
Interestingly, Riva and co-workers identify a good number of these targets from a gene enrichment approach. 

Host proteases, ABC and SLC transporters, voltage-gated potassium channels, TRPV1 channel, numerous G-protein coupled receptors (chemokine, LTB4, CRHR1, arginine-vasopressin, angiotensin, serotonin, dopamine and opioid receptors), sterol synthesis proteins (FNTA), tyrosine protein kinases (IGF1R, ABL1), steroid hormone receptors (ESR1, ESR2, PGR) and ser/thr-kinases in the MAPK/RAS/BRAF pathway (MAPK14, BRAF & RAS1) identified as potential targets for CoViD have emerged from these study as well. See graphical summary and reference below. 

Discovery of SARS-CoV-2 antiviral drugs through large-scale compound repurposing

[…]

The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in 2019 has triggered an ongoing global pandemic of the severe pneumonia-like disease coronavirus disease 2019 (COVID-19)1. The development of a vaccine is likely to take at least 12–18 months, and the typical timeline for approval of a new antiviral therapeutic agent can exceed 10 years. Thus, repurposing of known drugs could substantially accelerate the deployment of new therapies for COVID-19. Here we profiled a library of drugs encompassing approximately 12,000 clinical-stage or Food and Drug Administration (FDA)-approved small molecules to identify candidate therapeutic drugs for COVID-19. We report the identification of 100 molecules that inhibit viral replication of SARS-CoV-2, including 21 drugs that exhibit dose–response relationships. Of these, thirteen were found to harbour effective concentrations commensurate with probable achievable therapeutic doses in patients, including the PIKfyve kinase inhibitor apilimod2,3,4 and the cysteine protease inhibitors MDL-28170, Z LVG CHN2, VBY-825 and ONO 5334. Notably, MDL-28170, ONO 5334 and apilimod were found to antagonize viral replication in human pneumocyte-like cells derived from induced pluripotent stem cells, and apilimod also demonstrated antiviral efficacy in a primary human lung explant model. Since most of the molecules identified in this study have already advanced into the clinic, their known pharmacological and human safety profiles will enable accelerated preclinical and clinical evaluation of these drugs for the treatment of COVID-19.

Cell map of the protein targets identified by Riva and co-workers.

Some interesting matches can also be found in Gordon’s et al approach, based on the MS measurement of interactions between viral and host proteins:

Sigma receptors, ABC transporters (ABCC1), bromodomain proteins (BRD4) and proteases (PCSK6, PSEN) are most relevant matches. 

A SARS-CoV-2 protein interaction map reveals targets for drug repurposing

David E. Gordon, Gwendolyn M. Jang, […]Nevan J. Krogan

Nature volume 583, pages459–468(2020)Cite this article

Abstract

A newly described coronavirus named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which is the causative agent of coronavirus disease 2019 (COVID-19), has infected over 2.3 million people, led to the death of more than 160,000 individuals and caused worldwide social and economic disruption1,2. There are no antiviral drugs with proven clinical efficacy for the treatment of COVID-19, nor are there any vaccines that prevent infection with SARS-CoV-2, and efforts to develop drugs and vaccines are hampered by the limited knowledge of the molecular details of how SARS-CoV-2 infects cells. Here we cloned, tagged and expressed 26 of the 29 SARS-CoV-2 proteins in human cells and identified the human proteins that physically associated with each of the SARS-CoV-2 proteins using affinity-purification mass spectrometry, identifying 332 high-confidence protein–protein interactions between SARS-CoV-2 and human proteins. Among these, we identify 66 druggable human proteins or host factors targeted by 69 compounds (of which, 29 drugs are approved by the US Food and Drug Administration, 12 are in clinical trials and 28 are preclinical compounds). We screened a subset of these in multiple viral assays and found two sets of pharmacological agents that displayed antiviral activity: inhibitors of mRNA translation and predicted regulators of the sigma-1 and sigma-2 receptors. Further studies of these host-factor-targeting agents, including their combination with drugs that directly target viral enzymes, could lead to a therapeutic regimen to treat COVID-19.

https://www.nature.com/articles/s41586-020-2286-9

 

 

332 high-confidence interactions between 26 SARS-CoV-2 proteins (red diamonds) and human proteins (circles; drug targets: orange; protein complexes: yellow; proteins in the same biological process: blue). Edge colour proportional to MiST score; edge thickness proportional to spectral counts. Physical interactions among host proteins (thin black lines) were curated from CORUM, IntAct, and Reactome. An interactive protein–protein interaction map can be found at kroganlab.ucsf.edu/network-maps. ECM, extracellular matrix; ER, endoplasmic reticulum; snRNP, small nuclear ribonucleoprotein. n = 3 biologically independent samples.Gordon, D.E., Jang, G.M., Bouhaddou, M. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020). https://doi.org/10.1038/s41586-020-2286-9

Bojkova and co-workers identify relevant proteins by evaluation of protein content and protein translation levels upon coronavirus infection. 

Matching with DrTarget results, here we find adhesion proteins (CD47), heat shock proteins (HSP90s), the telomeric repeat-binding factor 2-interacting protein 1 (TERF2IP), Psmb5 protease, a populated set of ribosomal proteins (RPLs & RPSs), ser/thre protein kinases (AKT1, BCR, MAPK14), Sigma receptors, the glucocorticoid receptor (NR3C1), cholesterol metabolism enzymes (ERG7, Ebp), tyrosine kinases ( MET, SRC, EGFR, IGF1R), bromodomain proteins (BRD4) and Ub-transfer enzyme components (ELOB, UBE2V1) among others.

Proteomics of SARS-CoV-2-infected host cells reveals therapy targets

Denisa Bojkova, Kevin Klann, Benjamin Koch, Marek Widera, David Krause, Sandra Ciesek, Jindrich Cinatl & Christian Münch

Nature volume 583, pages469–472(2020)

Abstract

A new coronavirus was recently discovered and named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Infection with SARS-CoV-2 in humans causes coronavirus disease 2019 (COVID-19) and has been rapidly spreading around the globe1,2. SARS-CoV-2 shows some similarities to other coronaviruses; however, treatment options and an understanding of how SARS-CoV-2 infects cells are lacking. Here we identify the host cell pathways that are modulated by SARS-CoV-2 and show that inhibition of these pathways prevents viral replication in human cells. We established a human cell-culture model for infection with a clinical isolate of SARS-CoV-2. Using this cell-culture system, we determined the infection profile of SARS-CoV-2 by translatome3 and proteome proteomics at different times after infection. These analyses revealed that SARS-CoV-2 reshapes central cellular pathways such as translation, splicing, carbon metabolism, protein homeostasis (proteostasis) and nucleic acid metabolism. Small-molecule inhibitors that target these pathways prevented viral replication in cells. Our results reveal the cellular infection profile of SARS-CoV-2 and have enabled the identification of drugs that inhibit viral replication. We anticipate that our results will guide efforts to understand the molecular mechanisms that underlie the modulation of host cells after infection with SARS-CoV-2. Furthermore, our findings provide insights for the development of therapies for the treatment of COVID-19.

https://www.nature.com/articles/s41586-020-2332-7#Fig8

Network of proteins that are decreased during SARS-CoV-2 infection

a, Proteins belonging to cluster I in Fig. 3a were used for the creation of a functional interaction network. Lines indicate functional interactions. The network was created using the ReactomeFI plugin in Cytoscape, protein names were added in the plugin and the network adjusted by the yFiles Layout algorithm. b, ReactomeFI network analysis of proteins downregulated in total protein levels. Circle size represents number of proteins found in the pathway, colour shows the FDR for enrichment. Bojkova, D., Klann, K., Koch, B. et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature 583, 469–472 (2020). https://doi.org/10.1038/s41586-020-2332-7.

Network of proteins that are increased during SARS-CoV-2 infection

a, Network of proteins increased during SARS-CoV-2 infection with spliceosome annotation in reactome pathway analysis. Lines indicate functional interactions. b, Table summarizing viral proteins (from different coronaviruses) interacting with various spliceosome components13,14,15,16,42,43. c, d, Cytotoxicity assays for different concentrations of pladienolide B (c) and 2-deoxy-D-glucose (d) relative to control. Mean values ± s.d. (n = 3 independent biological samples). Line represents 100% viable cells. e, Protein network showing increased proteins during infection with annotation to the molecular function of ‘unfolded protein binding’. Lines indicate functional interactions.. Bojkova, D., Klann, K., Koch, B. et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature 583, 469–472 (2020). https://doi.org/10.1038/s41586-020-2332-7

Messina and collaborators perform a network analysis based on protein-protein interactions (PPI) and coexpression (COEX), finding interesting matches with drTarget preferred proteins. 

Common proteins identified by machine learning (drTarget) and PPI-COEX analysis.

COVID-19: viral–host interactome analyzed by network based-approach model to study pathogenesis of SARS-CoV-2 infection

Francesco Messina, Emanuela Giombini, Chiara Agrati, Francesco Vairo, Tommaso Ascoli Bartoli, Samir Al Moghazi, Mauro Piacentini, Franco Locatelli, Gary Kobinger, Markus Maeurer, Alimuddin Zumla, Maria R. Capobianchi, Francesco Nicola Lauria, Giuseppe Ippolito & COVID 19 INMI Network Medicine for IDs Study Group

Journal of Translational Medicine volume 18, Article number: 233 (2020) Cite this article

Abstract

Background

Epidemiological, virological and pathogenetic characteristics of SARS-CoV-2 infection are under evaluation. A better understanding of the pathophysiology associated with COVID-19 is crucial to improve treatment modalities and to develop effective prevention strategies. Transcriptomic and proteomic data on the host response against SARS-CoV-2 still have anecdotic character; currently available data from other coronavirus infections are therefore a key source of information.

Methods

We investigated selected molecular aspects of three human coronavirus (HCoV) infections, namely SARS-CoV, MERS-CoV and HCoV-229E, through a network based-approach. A functional analysis of HCoV–host interactome was carried out in order to provide a theoretic host–pathogen interaction model for HCoV infections and in order to translate the results in prediction for SARS-CoV-2 pathogenesis. The 3D model of S-glycoprotein of SARS-CoV-2 was compared to the structure of the corresponding SARS-CoV, HCoV-229E and MERS-CoV S-glycoprotein. SARS-CoV, MERS-CoV, HCoV-229E and the host interactome were inferred through published protein–protein interactions (PPI) as well as gene co-expression, triggered by HCoV S-glycoprotein in host cells.

Results

Although the amino acid sequences of the S-glycoprotein were found to be different between the various HCoV, the structures showed high similarity, but the best 3D structural overlap shared by SARS-CoV and SARS-CoV-2, consistent with the shared ACE2 predicted receptor. The host interactome, linked to the S-glycoprotein of SARS-CoV and MERS-CoV, mainly highlighted innate immunity pathway components, such as Toll Like receptors, cytokines and chemokines.

Conclusions

In this paper, we developed a network-based model with the aim to define molecular aspects of pathogenic phenotypes in HCoV infections. The resulting pattern may facilitate the process of structure-guided pharmaceutical and diagnostic research with the prospect to identify potential new biological targets.

https://translational-medicine.biomedcentral.com/articles/10.1186/s12967-020-02405-w#Fig2

PPI-COEX multilayer analysis, based on human PPI interactome and COEX network, with top 50 closest proteins/genes identified by RWR, using S-glycoprotein of HCoV-229E. Edges in blue represent protein-protein interactions, while red edges are coexpressions

Messina, F., Giombini, E., Agrati, C. et al. COVID-19: viral–host interactome analyzed by network based-approach model to study pathogenesis of SARS-CoV-2 infection. J Transl Med 18, 233 (2020). https://doi.org/10.1186/s12967-020-02405-w

PPI-COEX multilayer analysis, based on human PPI interactome and COEX network, with top 50 closest proteins/genes identified by RWR, using S-glycoprotein of SARS-CoV. Edges in blue represent protein-protein interactions, while red edges are coexpressions

Messina, F., Giombini, E., Agrati, C. et al. COVID-19: viral–host interactome analyzed by network based-approach model to study pathogenesis of SARS-CoV-2 infection. J Transl Med 18, 233 (2020). https://doi.org/10.1186/s12967-020-02405-w

PPI-COEX multilayer analysis, based on human PPI interactome and COEX network, with top 50 closest proteins/genes identified by RWR, using S-glycoprotein of MERS-CoV. Edges in blue represent protein-protein interactions, while red edges are coexpressions

Messina, F., Giombini, E., Agrati, C. et al. COVID-19: viral–host interactome analyzed by network based-approach model to study pathogenesis of SARS-CoV-2 infection. J Transl Med 18, 233 (2020). https://doi.org/10.1186/s12967-020-02405-w