3D Point Cloud Semantic Segmentation: State of the Art and Challenges
-
摘要:
随着获取点云数据成本下降以及GPU算力的提高,众多三维视觉场景如自动驾驶、工业控制、MR/XR对三维语义分割的需求日益旺盛,这进一步推动了深度学习模型在三维点云语义分割任务中的发展。近期,深度学习模型在网络架构上持续创新,如RandLA-Net 和Point Transformer,并突破性地以更低的计算成本提高了分割准确率,但已有的三维点云语义分割综述介绍的研究工作包含大量早期以及被舍弃的方法,没有系统地整理这些新型高效的方法,不能很好地体现研究现状。此外,这部分综述以输入网络的不同数据类型分类各点云语义分割方法,不能有效地体现各方法的演进关系,也不利于对比不同方法的分割性能。针对以上问题,本文面向近3年的研究成果和最新的研究进展,重点归纳了三维点云语义分割中基于不同网络架构的方法、面临的挑战及潜在研究方向,并从3个层面对三维点云语义分割进行了系统地综述。通过本文,读者可以较系统地了解三维点云语义分割的数据获取方式、常见数据集及模型的评价指标,对比基于不同网络架构的三维点云语义分割方法的发展过程、分割性能和优缺点,并进一步认识三维点云语义分割现存的挑战和潜在的研究方向。
Abstract:With the reduction in the cost of acquiring 3D point cloud data and the improvement of GPU computing power, the demand for 3D point cloud semantic segmentation in numerous 3D visual scenarios, such as autonomous driving, industrial control, and MR/XR is growing, which further advances the development of deep learning methods in 3D point cloud semantic segmentation. Recently, many novel deep learning network architectures, such as RandLA-Net and Point Transformer, have been proposed. These new architectures have made breakthroughs in upgrading semantic segmentation accuracy with a lower computational load. However, the existing researches reviewing 3D point cloud semantic segmentation methods reported a lots of relatively early works, whose methods were gradually abandoned these years and cannot reflect the current research status without systematically organizing these new and efficient methods. In addition, the methods introduced in these researches were divided into categories depending on their different input data types, which cannot provide a progressive view of the relationship between methods using different network architectures and is not conducive to comparing the segmentation performance of different methods. Therefore, this paper is given with a focus on the mainstream of the last 3 years 3D semantic segmentation methods using different deep learning network architectures, and is conducted at 3 levels: First, two main 3D point cloud data acquisition methods with their usual datasets and the metrics to evaluate model performance are introduced. Second, a systematic review of 3D semantic segmentation methods based on different network architectures is organized, followed by a statistical analysis based on the performance of models, which are commonly used in different network architectures, on two 3D segmentation datasets S3DIS and ScanNet to these architectures, including their structure relevance, superiors and inferiors. Finally, an insightful discussion of the remaining challenges on both methodological and applications' viewpoints, and its leading potential research directions are given. This paper provides a comprehensive review of recent 3-year research progress in 3D point cloud semantic segmentation. It summarizes different network architecture pipelines, illustrates their basic operations, compares the model performance for different architectures and discusses their highlights and shortcomings, more importantly, concludes the current challenges to provide promising research directions for the future. Thus, researchers can easily find the relevance and research hotspots between different 3D point cloud semantic segmentation methods based on these analyses. Through this paper, we aim to update the reviews on 3D point cloud semantic segmentation methods with a better viewpoint, highlight key properties and contributions of proposed methods so as to offer research directions from main challenges.
-
Key words:
- 3D vision /
- point cloud /
- semantic segmentation /
- deep learning /
- network framework
-

计量
- 文章访问数: 128
- HTML全文浏览量: 7
- PDF下载量: 23
- 被引次数: 0