Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/7631
Title: Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
Authors: Supianto, Ahmad Afif
Nurdiansyah, Rizky
Weng, Chia-Wei
Zilvan, Vicky
Yuwana, Raden Sandra
Arisal, Andria
Pardede, Hilman Ferdinandus
Lee, Min-Min
Huang, Chien-Hung
Ng, Ka-Lok
Keywords: Coronavirus
COVID-19
Drug docking
Phytochemicals
SARS-CoV-2
Text mining
Issue Date: 2023
Publisher: Journal of Taibah University Medical Sciences
Series/Report no.: Original Article;787-801
Abstract: Objective: The coronavirus disease 2019 (COVID-19) health crisis that began at the end of 2019 made researchers around the world quickly race to find effective solutions. Related literature exploded and it was inevitable that an automated approach was needed to find useful information, namely text mining, to overcome COVID-19, especially in terms of drug candidate discovery. While text mining methods for finding drug candidates mostly try to extract bioentity associations from PubMed, very few of them mine with a clustering approach. The purpose of this study was to demonstrate the effectiveness of our approach to identify drugs for the prevention of COVID- 19 through literature review, cluster analysis, drug docking calculations, and clinical trial data. Methods: This research was conducted in four main stages. First, the text mining stage was carried out by involving Bidirectional Encoder Representations from Transformers for Biomedical to obtain vector representation of each word in the sentence from texts. The next stage generated the disease-drug associations, which were obtained from the correlation between disease and drug. Next, the clustering stage grouped the rules through the similarity of diseases by utilizing Term Frequency-Inverse Document Frequency as its feature. Finally, the drug candidate extraction stage was processed through leveraging PubChem and DrugBank databases. We further used the drug docking package AUTODOCK VINA in PyRx software to verify the results. Results: Comparative analyses showed that the percentage of findings using mining with clustering outperformed mining without clustering in all experimental settings. In addition, we suggest that the top three drugs/phytochemicals by drug docking analysis may be effective in preventing COVID-19. Conclusions: The proposed method for text mining utilizing the clustering method is quite promising in the discovery of drug candidates for the prevention of COVID-19 through the biomedical literature.
URI: http://localhost:8080/xmlui/handle/123456789/7631
ISSN: 1658-3612
Appears in Collections:Vol 18 No 4 (2023)

Files in This Item:
File Description SizeFormat 
787-801.pdf787-8013.13 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.