Identification of Significant Proteins in Coronavirus Disease 2019 Protein-Protein Interaction Using Principal Component Analysis and ClusterONE


Centrality measurements, ClusterONE, COVID-19, Overall centrality, Protein-protein interaction, Significant protein

How to Cite

Ananta Kusuma, W., Farhan Ramadhani , H. . and Annisa (2021) “Identification of Significant Proteins in Coronavirus Disease 2019 Protein-Protein Interaction Using Principal Component Analysis and ClusterONE”, Bioinformatics and Biomedical Research Journal, 3(2), pp. 25–34. Available at: (Accessed: 16October2021).


Coronavirus Disease 2019 (COVID-19) will cause disease complications and organ damage due to excessive inflammatory reactions if left untreated. Computational analysis of protein-protein interactions can be carried out in various ways, including topological analysis and clustering of protein-protein interaction networks. Topological analysis can identify significant proteins by measuring the most important nodes with centrality measurements. By using Principal Component Analysis (PCA), the types of centrality measures were extracted into the overall centrality value. The study aimed to found significant proteins in COVID-19 protein-protein interactions using PCA and ClusterONE. This study used 57 proteins associated with COVID-19 to obtain protein networks. All of these proteins are homo sapiens organism. The number of proteins and the number of interactions from 57 proteins were 357 proteins and 1686 interactions. The results of this study consisted of two clusters; the best cluster was the first cluster with a lower p-value but had an average overall centrality value that closed to the second cluster. There are twenty important proteins in that cluster, and all of these proteins are related to COVID-19. These proteins are expected to be used in the process of discovering medicinal compounds in COVID-19.



Lee PI, Hsueh PR (2020) Emerging threats from zoonotic coronaviruses-from SARS and MERS to 2019-nCoV. J Mi-crobiol Immunol Infect. 53:365–367. doi: 10.1016/j.jmii.2020.02.001.

Rothan HA, Byrareddy SN (2020) The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak. J. Autoimmun. 109:102433. doi: 10.1016/j.jaut.2020.102433.

Ren LL, Wang YM, Wu ZQ, Xiang ZC, Guo L, Xu T, et al. (2020) Identification of a novel coronavirus causing severe pneumonia in human: a descriptive study. Chinese Med J. 133:1015–1024. doi: 10.1097/CM9.0000000000000722.

Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. (2020) Clinical features of patients infected with 2019 novel corona-virus in Wuhan, China. Lancet. 395:497–506. doi: 10.1016/S0140-6736(20)30183-5

Khaerunnisa S, Kurniawan H, Awaluddin R, Suhartati S, Soetjipto S (2020) Potential inhibitor of COVID-19 main pro-tease (Mpro) from several medicinal plant compounds by mo-lecular docking study. Preprints. 2020:1-14.

Gorbalenya AE, Baker SC, Baric RS, de Groot RJ, Drosten C, Gulyaeva AA, et al. (2020) The species Severe acute respirato-ry syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat Microbiol. 5:536–544. doi: 10.1038/s41564-020-0695-z

Susilo A, Rumende CM, Pitoyo CW, Santoso WD, Yulianti M, Herikurniawan H, et al. (2020) Coronavirus disease 2019: Tinjauan literatur terkini. Jurnal Penyakit Dalam Indonesia. 7:1-45.

Buana DR (2020) Analisis perilaku masyarakat indonesia dalam menghadapi pandemi COVID-19 dan kiat menjaga kesejahteraan jiwa. Jurnal Sosial Dan Budaya Syar-I. 7:217–226.

Yuliana Y (2020) Corona virus diseases (Covid-19); sebuah tinjauan literatur. Wellness Healthy Mag. 2:187–192. doi: 10.30604/well.95212020.

Worldometer. Covid-19 Coronavirus Pandemic [Retrieved 12 Desember 2020]. Available from:

Rodríguez-Morales AJ, MacGregor K, Kanagarajah S, Patel D, Schlagenhauf P (2020) Going global – Travel and the 2019 novel coronavirus. Travel Med Infect Dis. 33:101578. doi: 10.1016/j.tmaid.2020.101578.

Sukmana M, Yuniarti FA (2020) The pathogenesis characteris-tics and symptom of COVID-19 in the context of establishing a nursing diagnosis. J. Kesehatan Pasak Bumi Kalimantan. 3:21–28.

Ramadhani N, Sumiwi SA (2016) Aktivitas antiinflamasi berbagai tanaman diduga berasal dari flavonoid. Farmaka. 14:111–123.

Chang JW, Zhou YQ, Ul Qamar MT, Chen LL, Ding YD (2016) Prediction of protein–protein interactions by evidence combining methods. Int J Mol Sci. 17:1946. doi: 10.3390/ijms17111946

Acencio ML, Lemke N (2009) Towards the prediction of essential genes by integration of network topology, cellular lo-calization and biological process information. BMC Bioinfor-matic. 10:1-18. doi: 10.1186/1471-2105-10-290

Liu W, Wu A, Pellegrini M, Wang X (2015) Integrative analy-sis of human protein, function and disease networks. Sci Rep. 5:14344. doi: 10.1038/srep14344

Singh M, Singh G (2011) Cluster analysis technique based on bipartite graph for human protein class prediction. Int J Com-put Appl. 20:22–27. doi: 10.5120/2414-3226

Pan A, Lahiri C, Rajendiran A, Shanmugham B (2016) Com-putational analysis of protein interaction networks for infec-tious diseases. Brief Bioinform. 17:517–526.

Lei X, Yang X (2018) A new method for predicting essential proteins based on participation degree in protein complex and subgraph density. PLOS ONE. 13:e0198998. doi: 10.1371/journal.pone.0198998

Jollife IT, Cadima J (2016) Principal component analysis: A review and recent developments. Philos Trans R Soc A. 374:1-16. doi: 10.1098/rsta.2015.0202

Nepusz T, Yu H, Paccanaro A (2012) Detecting overlapping protein complexes in protein-protein interaction networks. Nat Methods. 9:471–472. doi: 10.1038/nmeth.1938

Usman MS, Kusuma WA, Afendi FM, Heryanto R (2019) Identification of significant proteins associated with diabetes mellitus using network analysis of protein-protein interactions. Comput Eng Appl J. 8:41–52.

Zulfikar AA, Diansyah MR, Rizqa AA, Kusuma WA (2018) Identification of significant protein Diabetes Mellitus Type 2 with Fuzzy C- Means and topological analysis. 2018 1st Int Conf Bioinform Biotechnol Biomed Eng. 1:1–5.

Diansyah MR, Kusuma WA, Annisa A (2019) Analysis of protein-protein interaction using skyline query on parkinson disease. 2019 Int Conf Adv Comput Sci Inform Syst. 2019:175–180. doi: 10.1109/ICACSIS47736.2019.8979892

Rezaei-Tavirani M, Rezaei-Tavirani S, Mansouri V, Rostami-Nejad M, Rezaei-Tavirani M (2017) Protein-protein interaction network analysis for a biomarker panel related to human esophageal adenocarcinoma. Asian Pac J Cancer Prev. 18:3357–3363. doi: 10.22034/APJCP.2017.18.12.3357

Khorsand B, Savadi A, Naghibzadeh M (2020) SARS-CoV-2-human protein-protein interaction network. Informatics in Med-icine Unlocked. 20:100413. doi: 10.1016/j.imu.2020.100413.

Kermali M, Khalsa RK, Pillai K, Ismail Z, Harky A (2020) The role of biomarkers in diagnosis of COVID-19–A systematic review. Life Sci. 254:1-9. doi: 10.1016/j.lfs.2020.117788.

Kuleshov M, Clarke DJB, Kropiwnicki E, Jagodnik K, Bartal A, Evangelista JE, et al. (2020) The COVID-19 gene and drug set library. SSRN Electronic J. 1:1-1.

Zhang L, Guo H (2020) Biomarkers of COVID-19 and tech-nologies to combat SARS-CoV-2. Adv Biomarker Sci Technol. 2:1–23. doi: 10.1016/j.abst.2020.08.001.

Zhu M, Gao L, Li X, Liu Z, Xu C, Yan Y, et al. (2009) The analysis of the drug-targets based on the topological properties in the human protein-protein interaction network. J Drug Tar-get. 17:524–532. doi: 10.1080/10611860903046610.

Mistry D, Wise RP, Dickerson JA (2017) DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network. PLOS ONE. 12:e0187091. doi: 10.1371/journal.pone.0187091

Ashtiani M, Salehzadeh-Yazdi A, Razaghi-Moghadam Z, Hennig H, Wolkenhauer O, Mirzaie M, et al. (2018) A system-atic survey of centrality measures for protein-protein interac-tion networks. BMC Syst Biol. 12:1-17. doi: 10.1186/s12918-018-0598-2

Tang Y, Li M, Wang J, Pan Y, Wu FX (2015) CytoNCA: A cytoscape plugin for centrality analysis and evaluation of pro-tein interaction networks. BioSystems. 127:67–72. doi: 10.1016/j.biosystems.2014.11.005.

Wang J, Li M, Wang H, Pan Y (2012) Identification of essen-tial proteins based on edge clustering coefficient. IEEE/ACM Transact Comput Biol Bioinform. 9:1070–1080. doi: 10.1109/TCBB.2011.147.

Dash P, Nayak M, Das GP (2014) Principal Component Anal-ysis using singular value decomposition for image compression. Int J Comput Appl. 93:21–27.

Lee GS, Djauhari MA (2012) An overall centrality measure: The case of U.S. stock market. Int J Basic Appl Sci. 12:99–103.

Dosztányi Z, Chen J, Dunker, AK, Simon I, Tompa P (2006) Disorder and sequence repeats in hub proteins and their impli-cations for network evolution. J Proteome Res. 5:2985–2995. doi: 10.1021/pr060171o.

Blanco-Melo D, Nilsson-Payant BE, Liu WC, Møller R, Panis M, Sachs D, et al. (2020) SARS-CoV-2 launches a unique transcriptional signature from in vitro, ex vivo, and in vivo sys-tems. BioRxiv. 1-21. doi: 10.1101/2020.03.24.004655

Shaath H, Vishnubalaji R, Elkord E, Alajez NM (2020) Single-cell transcriptome analysis highlights a role for neutrophils and inflammatory macrophages in the pathogenesis of severe COVID-19. Cells. 9:1–19. doi: 10.3390/cells9112374.

Prasad K, Khatoon F, Rashid S, Ali N, AlAsmari AF, Ahmed MZ, et al. (2020) Targeting hub genes and pathways of innate immune response in COVID-19: A network biology perspec-tive. Int. J. Biol Macromol. 163:1–8. doi: 10.1016/j.ijbiomac.2020.06.228

Hachim MY, Al Heialy S, Hachim IY, Halwani R, Senok AC, Maghazachi AA, et al. (2020) Interferon-induced transmem-brane protein (IFITM3) is upregulated explicitly in SARS-CoV-2 infected lung epithelial cells. Front Immunol. 11:1–9. doi: 10.3389/fimmu.2020.01372

Zhang Q, Liu Z, Moncada-Velez M, Chen J, Ogishi M, Bigio B, et al. (2020). Inborn errors of type I IFN immunity in pa-tients with life-threatening COVID-19. Science. 370: eabd4570.

Hadjadj J, Yatim N, Barnabei L, Corneau A, Boussier J, Smith N, et al. (2020) Impaired type I interferon activity and in-flammatory responses in severe COVID-19 patients. Science. 369:718–724. doi: 10.1126/science.abc6027.

Zhang Z-L, Hou Y-L, Li D-T, Li F-Z (2020) Laboratory find-ings of COVID-19: a systematic review and meta-analysis. Scand J Clin Lab Invest. 80(6):441–447. doi: 10.1080/00365513.2020.1768587.

Aziz M, Fatima R, Assaly R (2020) Elevated interleukin?6 and severe COVID?19: A meta?analysis. J Med Virol. 92(11):2283–2285.

Herold T, Jurinovic V, Arnreich C, Lipworth BJ, Hellmuth JC, von Bergwelt-Baildon M, et al. (2020) Elevated levels of IL-6 and CRP predict the need for mechanical ventilation in COVID-19. J Allerg Clin Immunol. 146(1):128-136.e4. doi: 10.1016/j.jaci.2020.05.008

Gupta R (2020) The double-edged interferon riddle in COVID-19 pathogenesis. Crit Care. 24:1–2. doi: 10.1186/s13054-020-03337-z.

Sharma P, Pandey AK, Bhattacharyya DK (2021) Determining crucial genes associated with COVID-19 based on COPD Findings. Comput Biol Med. 128:104126. doi: 10.1016/j.compbiomed.2020.104126.

Matsuyama T, Kubli SP, Yoshinaga SK, Pfeffer K, Mak TW (2020) An aberrant STAT pathway is central to COVID-19. Cell Death Differ. 27:3209–3225. doi: 10.1038/s41418-020-00633-7

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2021 Galaxy Sciences


Download data is not yet available.