Please wait a minute...
IMAGE/TABLE DETAILS
Predicting and interpreting the toxicity of ionic liquids using graph neural network
Haijun FENG, Bingxuan ZHANG, Jian ZHOU
CIESC Journal    2025, 76 (1): 93-106.   DOI: 10.11949/0438-1157.20240663
Abstract   (147 HTML12 PDF(pc) (4371KB)(222)  

Ionic liquids are potentially toxic to the environment, how to control the toxicity of ionic liquids is one of the key factors. To understand their toxicity mechanisms, three traditional machine learning methods (support vector machine, random forest, multilayer perceptron) and three graph neural network models (graph attention network, message passing neural network, graph convolutional model) were established to predict the toxicity of ionic liquids in four living organisms (leukemia rat cell line IPC-81, acetylcholinesterase, Escherichia coli, and Vibrio fischeri). The simplified molecular-input line-entry system (SMILES) of molecules and toxicity lgEC50 values work as the input and output respectively. In the three traditional machine learning methods, extended-connectivity fingerprints (ECFPs) were used to represent molecules. While in the three graph neural network models, molecular graphs were used to represent molecules. Benefiting from molecular structure information, the graph convolutional model (GCM) had lower RMSE and MAE, and higher R2 than other models in all four datasets. Therefore, the GCM model was superior in predicting the toxicity of ionic liquids. Meanwhile, based on the GCM model, an intepretability model was established to analyze the contribution of atomic groups to the toxicity of ionic liquids in a data-driven procedure. The aromatic ring of cations and long alkyl chain could produce toxicity. Atomic groups such as S+, P+, N+, and NH+ could significantly enhance the toxicity of ionic liquids, while atomic groups such as P-, F, B-, and C could effectively reduce the toxicity of ionic liquids. This discovery provides a theoretical basis for rapid screening and development of greener and low-toxicity ionic liquids.


数据集正贡献原子基团毒性贡献正权重负贡献原子基团毒性贡献负权重
大鼠白血病细胞(IPC-81)S+2.003P--0.960
N+0.975C-0.649
NH+0.937CH2-0.562
Cl-0.612B--0.545
NH20.595P+-0.432
对乙酰胆碱酯酶(AChE)P+1.985F-0.427
N+0.974P-0.366
CH0.659C-0.362
C0.604S-0.289
S0.589N-0.284
大肠杆菌(E. coliN+1.293B--0.722
S0.858P--0.644
CH30.823N+-0.399
P+0.819C-0.399
O-0.570S-0.385
费氏弧菌(Vibrio fischeriNH+1.604P--1.206
N-1.598N-0.930
NH1.306S-0.766
N+1.154C-0.759
P0.907CH2-0.594
Table 3 Top 5 atomic weights for both positive and negative contributions in 4 datasets
Other Images/Table from this Article