CIESC Journal ›› 2024, Vol. 75 ›› Issue (3): 914-923.DOI: 10.11949/0438-1157.20231370

• Process system engineering • Previous Articles     Next Articles

Machine learning-assisted solvent molecule design for efficient absorption of ethanethiol

Yuxiang CHEN1(), Chuanlei LIU1, Zijun GONG2, Qiyue ZHAO1, Guanchu GUO1, Hao JIANG1, Hui SUN1,3,4(), Benxian SHEN1   

  1. 1.Petroleum Processing Research Center, East China University of Science and Technology, Shanghai 200237, China
    2.Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190, China
    3.International Joint Research Center of Green Energy Chemical Engineering, East China University of Science and Technology, Shanghai 200237, China
    4.Key Laboratory of Oil & Gas Fine Chemicals Ministry of Education & Xinjiang Uyghur Autonomous Region, Xinjiang University, Urumqi 830046, Xinjiang, China
  • Received:2023-12-25 Revised:2024-02-06 Online:2024-05-11 Published:2024-03-25
  • Contact: Hui SUN

机器学习辅助乙硫醇高效吸收溶剂分子设计

陈宇翔1(), 刘传磊1, 龚子君2, 赵起越1, 郭冠初1, 姜豪1, 孙辉1,3,4(), 沈本贤1   

  1. 1.华东理工大学石油加工研究所,上海 200237
    2.中国科学院过程工程研究所,北京 100190
    3.华东理工大学绿色能源化工国际联合研究中心,上海 200237
    4.新疆大学石油天然气精细化工教育部重点实验室,新疆 乌鲁木齐 830046
  • 通讯作者: 孙辉
  • 作者简介:陈宇翔(1996—),男,博士研究生,chenyuxiang0425@gmail.com
  • 基金资助:
    国家自然科学基金项目(21878097);上海市自然科学基金项目(21ZR1417700)

Abstract:

To solve the problems of low organic sulfur removal efficiency, long solvent development cycle and high cost in the traditional amine elution desulfurization process, the quantitative structure-activity relationship (QSPR) model for ethanethiol solubility was established by using seven machine learning algorithms. Besides, the absorption mechanism of ethanethiol was elucidated by using the SHapley Additive exPlanations (SHAP) method and the virtual screening for candidate molecules was conducted to identify efficient solvents for the absorption removal of ethanethiol. Molar solubilities of ethanethiol in 14732 solvents, which cover a wide range of chemical space, were calculated by using the conductor-like screening model for real solvents (COSMO-RS). XGBoost was identified as the optimal algorithm for predicting the molar solubility of ethanethiol, having Rtest2 of 0.66, RMSE of 1.22, and MAE of 0.84. The complexity of molecular structure, covalent bonding, and electron distribution in molecules were identified as the key factors for the molar solubility of ethanethiol. Four solvents, including 3-ethoxypropylamine, 3-diethylaminopropylamine, 1,4-dimethylpiperazine, and 3-butoxypropylamine were identified as potential solvents. The results of the equilibrium solubility determination experiments show that 3-butoxypropylamine has the best ethanethiol dissolution with Henry’s law constant of 37.34 kPa.

Key words: molecule design, machine learning, solubility, absorption

摘要:

针对传统胺洗脱硫工艺中有机硫脱除效率低,溶剂开发周期长、成本高等问题,利用7种机器学习算法建立了乙硫醇溶解度的定量构效关系模型,运用SHAP方法阐释了乙硫醇的吸收机理,对备选分子库进行了虚拟筛选,识别出高效吸收脱除乙硫醇的溶剂。基于COSMO-RS模型计算了14732种溶剂的乙硫醇溶解度,这些分子覆盖了广泛的化学空间;XGBoost算法在预测乙硫醇溶解度方面表现最佳,该算法的Rtest2为0.66,RMSE为1.22,MAE为0.84;分子结构的复杂程度、共价键分布、电荷分布是影响乙硫醇溶解能力的关键因素;确定了4种候选溶剂:3-乙氧基丙胺、3-二乙胺基丙胺、1,4-二甲基哌嗪和3-丁氧基丙胺;平衡溶解度测定实验的结果表明3-丁氧基丙胺的乙硫醇吸收性能最优,亨利常数为37.34 kPa。

关键词: 分子设计, 机器学习, 溶解度, 吸收

CLC Number: