轻量化YOLO–RDD路面病害检测算法

吴杰; 张舣航; 韩贝林; 邹超; 辛莉峰; 黄仕平

doi:10.13374/j.issn2095-9389.2026.01.20.001

轻量化YOLO–RDD路面病害检测算法

Lightweight YOLO–RDD pavement distress detection algorithm

摘要

摘要: 针对道路服役年限增长导致的路面病害增多，以及复杂路面背景下细小裂缝等病害易漏检、误检的问题，本文提出一种基于YOLOv11n的改进轻量化目标检测算法YOLO–RDD，面向车载平台与无人机等资源受限场景，兼顾检测精度与部署效率. 路面病害具有目标尺度小、对比度低、形态细长、背景纹理干扰强等特点，常规单阶段检测模型在下采样与跨尺度融合过程中容易削弱边界与弱纹理线索，造成裂缝类目标定位不稳与漏检. 为此，YOLO–RDD在结构与特征表征环节进行任务导向优化. 首先，在主干网络中采用结构重参数化的RepGELAN模块替代传统C3k2模块，通过同层渐进聚合与跨阶段信息回流增强多感受野表达，使细窄裂缝的边缘与弱纹理响应在深层传播中不易衰减，并在推理阶段折叠多分支结构以保持高效推理. 其次，在颈部网络中结合动态上采样单元（DSU）与深度可分离卷积，设计DySlim–Neck轻量化融合结构，在跨层融合前进行内容自适应对齐，减轻固定插值与拼接带来的错位与过度平滑，同时降低融合开销并强化小目标信息传递. 最后，在检测头部分引入动态权重检测头（DynamicHead），对多尺度特征进行自适应重整，缓解细小病害目标分类与回归耦合不足导致的漏检问题，提升复杂背景下的预测稳定性. 实验采用F1分数、平均精度均值（mAP）与模型参数量等指标进行评估. RDD2022数据集上结果表明，和基准模型YOLOv11n相比，模型参数量减少19.7%的同时，YOLO–RDD的F1分数与mAP@50分别提升1.0和2.3个百分点，整体检测性能更稳健，可有效提升裂缝与坑洼等病害目标的识别与定位效果，为路面病害自动化巡检提供了可行的轻量化实现路径.

Abstract: As pavements age, sustained traffic, environmental degradation, and material aging lead to increasing occurrences of cracks, potholes, and other distresses, threatening safety and infrastructure longevity. Textured road networks, combined with varying illumination, narrow geometries, and low-contrast cracks, make reliable detection challenging, leading to missed detections and false positives. Although deep learning has advanced automated pavement assessment, common lightweight single-stage detectors struggle to balance accuracy and speed on edge devices such as vehicle-mounted systems and unmanned aerial vehicles (UAVs). This is due to the fact that repeated downsampling, fixed interpolation, and coarse cross-scale fusion blur crack boundaries and weaken texture cues, thereby reducing recall and degrading localization. To address these challenges, this study introduces YOLO-RDD, a lightweight, task-specific pavement distress detection algorithm derived from YOLOv11n architecture. Designed explicitly for the intrinsic characteristics of pavement distress, including small spatial extent, high aspect ratio, low-intensity contrast relative to the surrounding pavement, and susceptibility to masking by heterogeneous background textures, YOLO-RDD achieves a principled balance between detection accuracy and real-time deployability on edge platforms. This method systematically strengthens the feature fidelity, cross-scale alignment, and context-aware prediction via targeted architectural refinement across the backbone, neck, and detection head. First, in the backbone network, the original C3k2 modules are replaced with a structurally reparametrized RepGELAN module. RepGELAN integrates progressive intralayer feature aggregation with cross-stage information feedback, thereby expanding the effective receptive fields while preserving high-frequency edge responses and weak texture discriminability, which are critical for detecting narrow, low-contrast cracks. Structural reparameterization enables the equivalent conversion of its multibranch training-time architecture into a single standard convolution at inference time, ensuring minimal computational overhead without sacrificing representational expressiveness. Second, for the neck network, we propose DySlim-Neck, a lightweight, semantic-aware fusion architecture that synergistically combines a dynamic upsampling unit (DSU) with depthwise- separable convolutions. Before cross-layer fusion, the DSU performs content-adaptive alignment, explicitly correcting geometric misalignment and suppressing over-smoothing artifacts commonly induced by fixed-resolution interpolation and naïve concatenation. Coupled with mixed convolutional aggregation, DySlim-Neck significantly reduces the fusion-induced latency and memory footprint while maintaining high-fidelity propagation of small-object features across scales, thereby safeguarding fine crack morphology and continuity. Third, the detection head adopts DynamicHead, which dynamically reweighs and reorganizes multiscale features along scale, spatial, and channel dimensions based on target-specific semantic cues. This adaptive coupling explicitly addresses the classification–regression decoupling problem prevalent in small, weakly contrasted targets, enhancing localization confidence and scale-invariant prediction robustness under heterogeneous pavement backgrounds. Furthermore, an embedded dynamic activation mechanism selectively suppresses background clutter responses while amplifying discriminative crack signatures, yielding higher precision in ambiguous regions. The proposed algorithm is rigorously evaluated on the RDD2022 benchmark dataset using four complementary metrics: F1-score (harmonic mean of precision and recall), mean Average Precision (mAP) at IoU thresholds of 0.5 and 0.5:0.95, parameter count (in millions), and computational cost measured as GFLOPs and inference latency (ms) on an embedded Jetson AGX Orin platform. Experimental results demonstrate that, relative to the YOLOv11n baseline, YOLO–RDD achieves a 19.7% reduction in parameter count while delivering consistent gains across detection robustness metrics—improving F1-score by 1.0 percentage point and mAP@0.5 by 2.3 percentage points—without compromising mAP@0.5:0.95. indicating enhanced localization consistency under varying overlap criteria.. Notably, ablation-guided analysis confirms that these improvements are particularly pronounced for fine-scale distress: the detection recall for narrow cracks (< 2 pixels wide) increases by 8.6%, and the pothole localization accuracy (measured by the bounding-box IoU) improves by 5.4%, especially under low-illumination conditions and highly textured asphalt surfaces. YOLO–RDD excels in pavement defect detection, particularly in narrow cracks. However, the stability decreases under real-world challenges such as uneven lighting, dirt, varying camera angles, and limited training diversity. The next step involves: (1) expanding real-world road data across times, road types, and capture conditions, as well as augmenting the generative crack/damage samples to boost generalization; and (2) adopting transfer and continual learning for cross-region adaptation and fusing multi-source data into a pavement health assessment framework to accelerate engineering deployment.

HTML全文

参考文献(28)

施引文献

资源附件(0)