Rotated object detection method based on adaptive angle classification and dynamic sample matching[J]. Chinese Journal of Engineering. DOI: 10.13374/j.issn2095-9389.2025.06.09.006
Citation: Rotated object detection method based on adaptive angle classification and dynamic sample matching[J]. Chinese Journal of Engineering. DOI: 10.13374/j.issn2095-9389.2025.06.09.006

Rotated object detection method based on adaptive angle classification and dynamic sample matching

  • Rotated object detection (ROD) is a critical subtask in computer vision, especially in real-world applications such as aerial remote sensing and industrial character detection, where objects frequently appear in arbitrary orientations with diverse aspect ratios. Unlike standard object detection that assumes axis-aligned bounding boxes, ROD requires precise estimation of object location and orientation. Conventional rotation regression methods suffer from angle periodicity and discontinuity issues, leading to unstable training and inaccurate predictions. Moreover, densely packed scenes with complex backgrounds make positive and negative sample assignment highly sensitive, often resulting in suboptimal convergence. To tackle these challenges, this work proposes a rotation-aware object detection approach based on YOLOv8, enhanced by two key components: a shape-aware adaptive angle classification strategy and a progressive dynamic matching mechanism.The angle classification strategy replaces traditional continuous angle regression with a discrete classification approach. To achieve this, angle annotations are converted into soft label vectors using a circular Gaussian window function, ensuring the periodic nature of angles is preserved. A novel aspect of this design is the integration of target shape information, where the smoothing parameter of the label distribution is adaptively adjusted based on the object’s aspect ratio. Specifically, for elongated targets such as ships or text lines, a narrow window is applied to enforce sharp classification around the true angle, enabling fine-grained orientation discrimination. Conversely, for square-like or low-aspect-ratio objects, a wider window is applied to tolerate angular ambiguity, stabilizing training across diverse target geometries. This shape-aware mechanism addresses discontinuities at angular boundaries and enhances classification accuracy in multi-oriented detection tasks.To complement the angle classification, a progressive dynamic sample matching mechanism is developed to improve the quality of positive sample selection during training. Instead of relying solely on rotated IoU (rIoU), which is unreliable at early stages when angle predictions are inaccurate, the method begins with horizontal IoU (hIoU) and gradually introduces rIoU via linear interpolation as training proceeds. The final matching score incorporates three components: classification confidence, IoU-based localization quality, and a cosine-based angle consistency term. This unified metric guides the selection of top-K positive samples per ground truth object, ensuring that high-quality matches are emphasized while low-quality or ambiguous matches are suppressed. This progressive transition improves training stability, accelerates convergence, and enhances rotation alignment between predictions and ground truth.Extensive experiments are conducted on two datasets. On the DOTA dataset, which includes multiple object classes with diverse orientations and aspect ratios, the proposed method achieves a mean Average Precision (mAP) of 0.786. This result demonstrates improvements across several categories, particularly those with high aspect ratios such as ships, vehicles, and containers. On a custom industrial character dataset, which consists of densely arranged, multi-oriented alphanumeric components captured under complex conditions, the method achieves a mAP of 0.924. The results validate its ability to generalize to scene text-like tasks. Ablation studies are conducted to isolate the impact of each component. The shape-aware classification contributes a 4.3% improvement in angle-sensitive categories, while the dynamic matching strategy results in smoother loss curves and more concentrated attention heatmaps.The method preserves the anchor-free structure and real-time inference capability of YOLOv8, while significantly enhancing its performance in rotation-sensitive contexts. All modifications are lightweight and easily integrable into existing pipelines without structural changes to the backbone.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return