Please wait a minute...
IMAGE/TABLE DETAILS
Predicting and interpreting the toxicity of ionic liquids using graph neural network
Haijun FENG, Bingxuan ZHANG, Jian ZHOU
CIESC Journal    2025, 76 (1): 93-106.   DOI: 10.11949/0438-1157.20240663
Abstract   (147 HTML12 PDF(pc) (4371KB)(222)  

Ionic liquids are potentially toxic to the environment, how to control the toxicity of ionic liquids is one of the key factors. To understand their toxicity mechanisms, three traditional machine learning methods (support vector machine, random forest, multilayer perceptron) and three graph neural network models (graph attention network, message passing neural network, graph convolutional model) were established to predict the toxicity of ionic liquids in four living organisms (leukemia rat cell line IPC-81, acetylcholinesterase, Escherichia coli, and Vibrio fischeri). The simplified molecular-input line-entry system (SMILES) of molecules and toxicity lgEC50 values work as the input and output respectively. In the three traditional machine learning methods, extended-connectivity fingerprints (ECFPs) were used to represent molecules. While in the three graph neural network models, molecular graphs were used to represent molecules. Benefiting from molecular structure information, the graph convolutional model (GCM) had lower RMSE and MAE, and higher R2 than other models in all four datasets. Therefore, the GCM model was superior in predicting the toxicity of ionic liquids. Meanwhile, based on the GCM model, an intepretability model was established to analyze the contribution of atomic groups to the toxicity of ionic liquids in a data-driven procedure. The aromatic ring of cations and long alkyl chain could produce toxicity. Atomic groups such as S+, P+, N+, and NH+ could significantly enhance the toxicity of ionic liquids, while atomic groups such as P-, F, B-, and C could effectively reduce the toxicity of ionic liquids. This discovery provides a theoretical basis for rapid screening and development of greener and low-toxicity ionic liquids.


数据集评估指标SVMRFMLPGATMPNNGCM
大鼠白血病细胞(IPC-81)RMSE0.6870.6550.6400.5190.4860.457
MAE0.5100.4360.4670.3790.3530.325
R20.6830.7060.6880.7640.7870.867

对乙酰胆碱酯酶

(AChE)

RMSE0.2290.2860.2430.2950.2460.229
MAE0.1280.1700.1560.2100.1710.184
R20.8140.7460.8250.7160.8220.830

大肠杆菌

E. coli

RMSE0.5450.5470.6650.6480.6100.396
MAE0.3960.4070.4620.4710.4170.324
R20.8320.8110.7750.7600.7730.880

费氏弧菌

Vibrio fischeri

RMSE0.7390.7240.6580.7320.6960.591
MAE0.5240.4600.4760.5240.4920.441
R20.7380.7530.7790.7040.7420.825
Table 1 Performances of different models on the toxicity prediction of ILs from different datasets
Other Images/Table from this Article