化工学报 ›› 2022, Vol. 73 ›› Issue (3): 1270-1279.DOI: 10.11949/0438-1157.20211291

• 过程系统工程 • 上一篇    下一篇

基于二子空间协同训练算法的半监督软测量建模

罗顺桦1(),王振雷1(),王昕2   

  1. 1.华东理工大学能源化工过程智能制造教育部重点实验室,上海 200237
    2.上海交通大学电工与电子技术中心,上海 200240
  • 收稿日期:2021-09-05 修回日期:2021-11-09 出版日期:2022-03-15 发布日期:2022-03-14
  • 通讯作者: 王振雷
  • 作者简介:罗顺桦(1997—),男,硕士研究生,1361616694@qq.com
  • 基金资助:
    国家重点研发计划项目(2018YFB1701103);国家杰出青年科学基金项目(61725301);国家自然科学基金面上项目(61973124)

Semi-supervised soft sensor modeling based on two-subspace co-training algorithm

Shunhua LUO1(),Zhenlei WANG1(),Xin WANG2   

  1. 1.Key Laboratory of Smart Manufacturing in Energy Chemical Process,Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
    2.Center of Electrical & Electronic Technology,Shanghai Jiao Tong University, Shanghai 200240, China
  • Received:2021-09-05 Revised:2021-11-09 Online:2022-03-15 Published:2022-03-14
  • Contact: Zhenlei WANG

摘要:

在工业过程中,存在着辅助变量与主导变量数据比例严重失衡的问题。协同训练算法是其中一种利用无标签数据中的潜在信息以提升学习性能的模型训练方法。然而目前在协同训练软测量建模过程中,学习器之间存在严重的训练特性交叉重叠的问题,这将导致对主导变量的预测性能衰减。针对这一问题,提出基于二子空间协同训练算法的半监督软测量模型two-subspace co-training KNN(TSCO-KNN)。该模型将二子空间分块算法与协同训练算法相结合,利用辅助变量与主成分子空间PCS和残差子空间RS两个特征子空间的相关性程度,将数据变量拆分为两个具有显著差异性的学习数据集,进而使用KNN回归器进行协同训练,共同用于对主导变量的预测。最后在乙烯精馏塔塔顶乙烷浓度和TE过程产品浓度软测量中进行仿真研究,验证本文所提算法的有效性。

关键词: 软测量, 半监督学习, 协同训练, 主成分分析, K近邻算法

Abstract:

In the industrial process, there is a serious imbalance between the auxiliary variable and the dominant variable. Co-training algorithm is one of the model training methods that uses potential information in unlabeled data to improve learning performance. However, there is a problem of overlapping training characteristics between learners in the current collaborative training using in soft sensor, which will lead to a decline in the prediction performance of dominant variables. To solve this problem, this paper proposes a TSCO-KNN semi-supervised soft sensor model based on two-subspace co-training algorithm. The model combines the two-subspace algorithm with the existing co-training algorithm. By analyzing the correlation between the auxiliary variables and the PCS and the RS, the variables are split into two different learning data sets, and then the KNN regressor is used for collaborative training, which is jointly used to predict the key quality variable. Finally, a simulation study was carried out in the soft measurement of the ethane concentration at the top of the ethylene distillation tower and the product concentration of the TE process to verify the effectiveness of the algorithm proposed in this paper.

Key words: soft sensing, semi-supervised, co-training, PCA, KNN

中图分类号: