基于显式视觉提示的煤岩CT图像裂隙分割模型及应用

靳子越; 李海涛; 杨冠宇; 陈宇龙; 张海宽

doi:10.13374/j.issn2095-9389.2025.01.28.001

基于显式视觉提示的煤岩CT图像裂隙分割模型及应用

Crack segmentation model for coal-rock CT images based on explicit visual prompting and its applications

摘要

摘要: 针对目前煤岩识别算法模型难以准确识别CT (Computed tomography)扫描图像中细小裂隙等问题，提出一种基于显式视觉提示的煤岩CT图像裂隙分割模型（EViP-CTCrack），并在自建的煤岩CT扫描图像数据集CTRock上进行了算法验证. EViP-CTCrack主要由残差混合连接卷积模块、交叉注意力上采样模块、多代表性向量分类器和显式视觉提示生成器等模块组成. 实验证明，EViP-CTCrack在CTRock数据集上的平均交并比和精确率分别达到了88.1%和94.4%，取得了良好的裂隙分割效果. 最后，将该模型应用于矿井钻孔岩心裂隙识别，建立了孔隙度–抗压强度方程，可以快速推算其单轴抗压强度.

Abstract: Various defect structures exist in coal and rock masses, including cracks and pores, which significantly affect the occurrence and migration of coalbed methane. Additionally, the instability and destruction of coal rock structures are closely related to the changes and expansion of coal rock pore and crack structures, leading to mining dynamic disasters. Therefore, identifying coal rock cracks is of great practical significance for studying the migration law of coal seam gas and the stability of the surrounding rock. The models proposed in recent studies for improving the identification accuracy of internal cracks in coal and rock masses still have the following shortcomings. (1) The impact of noise interference in CT (Computed tomography) images on model performance is often overlooked. (2) The potential loss of small-crack features during downsampling and the insufficient fusion of multilayer features during upsampling have not received sufficient attention. (3) The differences within the crack categories were not fully considered. To address the challenge of accurately identifying microcracks in CT scan images of coal and rock, this study proposes a crack segmentation model based on explicit visual prompting for coal and rock CT images (EViP-CTCrack) by reorganizing the convolution kernel, down-sampling and up-sampling processes, and classifier functions. This model effectively filters out noise interference and enhances its ability to capture the key details of cracks by introducing an explicit visual cue generator. The segmentation accuracy of the model was improved by improving the downsampling and upsampling strategies, preserving small-crack features, and enhancing multiscale feature fusion capabilities. Using a multi-representative vector classifier, the internal differences in the crack categories can be fully described to improve the generalization ability of the model. The main results show that (1) the crack segmentation model EViP-CTCrack mainly consists of a residual mixed-connection convolution module, a cross-attention upsampling module, a multi-representative vector classifier, and an explicit visual prompt generator module, which can effectively improve the shortcomings of existing models in terms of noise interference, feature loss, and crack category difference description, significantly enhancing the robustness and generalization ability of the model. (2) This study proposes a high-quality coal and rock CT image dataset, CTRock, and conducts a comparison and ablation study on CTRock to verify the effectiveness of the targeted design. The algorithm is validated using the proposed CTRock dataset. The average intersection-to-union ratio and accuracy of EViP-CTCrack on the CTRock dataset were 88.1% and 94.4%, respectively, indicating good crack segmentation results. (3) After CT scanning the rock samples collected from the Xinjie Coal Mine, EViP-CTCrack was used to identify cracks in all layers of the reconstructed data slices. A porosity compressive strength equation was established that can quickly calculate the uniaxial compressive strength and provide new ideas for studying the physical and mechanical properties of coal rock.

HTML全文

参考文献(29)

施引文献

资源附件(0)