PREDICTING DRUG TOXICITY ON TOX21 WITH A MULTI-TASK GNN-TRANSFORMER MODEL
Volume 4, Issue 1, Pp 13-18, 2026
DOI: https://doi.org/10.61784/wjit3077
Author(s)
YuHui Li
Affiliation(s)
School of pharmacy, Shenyang Pharmaceutical University, Benxi 117004, Liaoning, China.
Corresponding Author
YuHui Li
ABSTRACT
As research into medicinal chemical synthesis deepens, an increasing number of novel drug molecules with promising clinical therapeutic potential are emerging. During drug development, many compounds are discarded due to excessive toxic side effects, while traditional toxicity testing faces challenges of high cost and lengthy timelines. To enable rapid toxicity assessment by researchers, this paper proposes a drug toxicity prediction model (TGT) based on GNN-Transformer. The model was constructed and validated using the Tox21 dataset, with caffeine selected for practical generalisation testing. The Tox21 dataset encompasses toxicity test results for diverse compounds, providing high-quality data. The model architecture leverages graph neural networks (GNN) to process molecular graph-structured data, effectively capturing complex topological relationships and chemical information within molecules. This transforms molecular graph structures into meaningful node feature representations. The Transformer component, with its exceptional sequence modelling capabilities, further learns from GNN-extracted features, capturing long-range dependencies between molecular structures. Through training and optimisation, the model demonstrated commendable performance in toxicity prediction tasks, achieving an average AUC of 0.7488 on the validation set. Its high accuracy in predicting drug toxicity was further validated through practical application on caffeine molecules, establishing it as an efficient and precise predictive tool for early-stage drug safety assessment.The GNN-Transformer drug toxicity prediction model proposed in this study enhances prediction reliability by integrating multi-task learning with interpretability techniques. It serves as an auxiliary pre-screening tool for drug development, thereby helping to shorten the research and development cycle and reduce costs.
KEYWORDS
GNN-Transformer model; Drug toxicity prediction; Tox21 dataset; Molecular structure data
CITE THIS PAPER
YuHui Li. Predicting drug toxicity on Tox21 with a multi-task GNN-Transformer model. World Journal of Information Technology. 2026, 4(1): 13-18. DOI: https://doi.org/10.61784/wjit3077.
REFERENCES
[1] Li Jianqing, Wang Tianqin, Teng Yuefa, et al. Machine learning prediction model for emerging pollutants-induced activities of 12 nuclear receptors. Chromatography, 2025, 43(08): 959-970.
[2] Mori P M, Bortner D C, Touny E L, et al. Unusual and novel mitochondrial-driven mechanism of toxicity from Tox21 dataset. Free Radical Biology and Medicine, 2025, 240(S1): S29.
[3] Luo X, Zhang L, Sakamuru S, et al. Systematic evaluation of Tox21 compounds that target β-adrenergic receptors and their role in cardiotoxicity. Toxicology and applied pharmacology, 2025, 505117567.
[4] Thomas RS, Paules RS, Simeonov A, et al. The US Federal Tox21 Program: A strategic and operational plan for continued leadership. ALTEX, 2018, 35(2): 163-168.
[5] Gabriel I, Sundar T, Joseph L, et al. Structure–activity relationship-based chemical classification of highly imbalanced Tox21 datasets. Journal of Cheminformatics, 2020, 12(1): 66.
[6] Mayr A, EMayr A, EKlambauer G, et al. DeepTox: Toxicity Prediction using Deep Learning. Frontiers in Environmental Science, 2016, 3.
[7] Sun Y, Zhu D, Wang Y, et al. GTC: GNN-Transformer co-contrastive learning for self-supervised heterogeneous graph representation. Neural Networks, 2025, 181, 106645.
[8] Wei X H, Xu Z W, Wang X W, et al. Harnessing Geospatial and Temporal Information:GNN-Transformer Application to MJO Prediction. Journal of Jilin University (Natural Science Edition), 2025, 63(01): 67-75.

Download as PDF