物理特征融合的半监督小样本滚动轴承故障诊断

舒畅; 康建雄; 沈艳军; 卢家伟; 杨皓然; 剡昌锋

doi:10.13374/j.issn2095-9389.2025.11.17.001

物理特征融合的半监督小样本滚动轴承故障诊断

Semi-supervised few-shot fault diagnosis method for rolling bearings based on physical feature fusion

摘要

摘要: 轴承故障诊断能够有效预防设备故障、提高生产效率、降低维护成本. 传统的故障诊断方法依赖于大量标注数据，但实际数据稀缺问题限制了其实际应用，无监督学习方法虽然不依赖大量标注数据，但诊断效率较低. 针对这一问题，本文结合改进的分布对齐与数据增强半监督学习框架，提出了基于物理特征融合的半监督小样本轴承故障诊断方法. 利用希尔伯特变换等对轴承振动信号进行处理获得包络信号，从包络信号中提取多个物理特征，运用特征相关性与强度分析筛选最具预测效果的特征集. 采用多尺度残差模块进一步优化网络结构，并在分类阶段引入物理先验正则约束，对分类权重的学习过程进行引导，从而增强模型决策结果与物理机理的一致性. 基于此，设计了物理特征融合的半监督学习模型，对传统ReMixMatch框架中的弱增强与强增强模块进行改进. 结果表明，所提方法在小样本条件下具有优异的性能，平均诊断准确率达到95.4%，F1分数的平均值为95.4%，明显较无监督与其他半监督方法有更好的诊断精度和鲁棒性，对滚动轴承健康监测应用具有重要的意义.

Abstract: Rolling bearing fault diagnosis is a critical task in condition monitoring and intelligent maintenance of rotating machinery, as bearing failures may lead to unexpected downtime, safety risks, and significant economic losses. In recent years, deep learning–based diagnostic methods have shown strong capability in extracting discriminative features from complex vibration signals. However, most existing supervised approaches rely heavily on large-scale labeled datasets, which are difficult and costly to obtain in real industrial environments. Although unsupervised learning methods reduce the dependence on labeled data, their diagnostic accuracy and robustness are often insufficient for engineering applications. Therefore, developing an effective fault diagnosis method that can achieve high performance under limited labeled data conditions remains a challenging problem. To address this issue, in this study, a semi-supervised few-shot fault diagnosis method is proposed for rolling bearings based on physical feature fusion. The proposed method is developed upon an improved framework of semi-supervised learning with distribution matching and augmentation anchoring (ReMixMatch), integrates physical feature engineering with a physics-guided deep neural network architecture, forming a Semi-Supervised learning with physical feature fusion (1D–ViMix) model. The core objective is to fully exploit a small number of labeled samples together with a large amount of unlabeled data, while enhancing model interpretability through the incorporation of physical prior knowledge. Specifically, raw bearing vibration signals are initially processed via band-pass filtering and Hilbert transform to obtain envelope signals, which effectively suppress low-frequency background components and highlight fault-induced modulation characteristics. Based on the envelope signals and their envelope spectra, multiple physical features are extracted, including time-domain statistical indicators, envelope spectrum frequency-domain features, and segmented frequency-band energy features. To construct a compact and discriminative physical feature set, a systematic feature selection strategy is adopted. This strategy combines single-feature predictive performance evaluation using Macro–F1 scores, Pearson correlation analysis to identify redundant features, and a forward greedy search algorithm with correlation constraints, ensuring a balance between diagnostic performance, stability, and feature diversity. In terms of network design, a deep one-dimensional convolutional neural network is employed as the backbone to capture temporal characteristics of vibration signals. Multi-scale residual blocks are incorporated to enhance the representation of fault-related patterns across different time scales and frequency components. Furthermore, a physical prior regularization mechanism is introduced at the classification stage to guide the learning of classification weights. This regularization encourages the model to assign higher importance to strongly correlated physical features while suppressing weakly related ones, thereby improving consistency between model decisions and bearing fault mechanisms and enhancing interpretability. To improve the effectiveness of semi-supervised learning for vibration signals, the original ReMixMatch framework is further optimized. Pseudo-label generation is stabilized using exponential moving average smoothing, followed by distribution alignment and sharpening to mitigate class imbalance and improve label confidence. Additionally, signal-oriented weak and strong augmentation strategies, including noise injection, random masking, and amplitude scaling, are designed to effectively match the characteristics of mechanical vibration data. Consistency regularization and mixup are jointly applied to labeled and unlabeled samples, enabling more effective utilization of unlabeled data and improving model generalization under few-shot conditions. Experimental validation is conducted on the rolling bearing fault dataset from Paderborn University under few-shot learning scenarios, where only a limited number of labeled samples are available for each fault category. Extensive experiments, including multiple random trials, comparative studies with representative semi-supervised and small-sample learning methods, ablation tests, and interpretability analysis, are performed. The proposed 1D-ViMix model achieves an average diagnostic accuracy and F1-score of 95.4%, significantly outperforming the comparison methods. The results demonstrate that the integration of physical feature priors, multi-scale feature modeling, and improved semi-supervised learning strategies effectively enhances diagnostic accuracy, robustness, and interpretability. This study provides a practical and explainable solution for rolling bearing fault diagnosis under limited labeled data conditions in real industrial applications.

HTML全文

参考文献(38)

施引文献

资源附件(0)