ZTE Communications ›› 2023, Vol. 21 ›› Issue (4): 38-46.DOI: 10.12142/ZTECOM.202304005

• Special Topic • Previous Articles     Next Articles

Point Cloud Processing Methods for 3D Point Cloud Detection Tasks

WANG Chongchong1, LI Yao2, WANG Beibei3, CAO Hong3, ZHANG Yanyong2()   

  1. 1.Anhui University, Hefei 230601, China
    2.University of Science and Technology of China, Hefei 230026, China
    3.Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei 230026, China
  • Received:2023-10-07 Online:2023-12-25 Published:2023-12-07
  • About author:WANG Chongchong received his BS degree in computer science and technology from Huazhong Agricultural University, China in 2022. He is currently pursuing a master’s degree in computer science and technology at Anhui University, China.
    LI Yao received his BS degree in electronic information engineering from Jilin University, China in 2019. He is currently pursuing a PhD degree in computer science and technology at the University of Science and Technology of China.His research interests include computer vision and intelligent transportation perception system.
    WANG Beibei graduated from University of Science and Technology of China with BS in physics in 2014. He furthered his studies at the University of Southern California, USA, where he obtained PhD in physics in 2020 and MS in computer science in parallel. His research interests currently focus on computer vision and multimodal perception methods for autonomous systems.
    CAO Hong received his PhD degree from Zhejiang University, China in 2014. He is currently a associate research fellow with the Institute of Artificial Intelligence, Hefei Comprehensive National Science Center (Anhui Artificial Intelligence Laboratory), China. His research interests include autonomous driving, roadside perception and robotics.
    ZHANG Yanyong (yanyongz@ustc.edu.cn) received her BS from the University of Science and Technology of China (USTC) in 1997, and PhD from Penn State University in 2002. From 2002 and 2018, she was on the faculty of the Electrical and Computer Engineering Department at Rutgers University, USA. She was also a member of the Wireless Information Networks Laboratory (Winlab). Since July 2018, she joined the school of Computer Science and Technology at USTC. She has 21 years of research experience in the areas of sensor networks, ubiquitous computing, and high-performance computing, and has published more than 140 technical papers in these fields. She received the NSF CAREER award in 2006, and was elevated to IEEE Fellow in 2017. She has served/currently serves as the Associate Editor for several journals, including IEEE/ACM Transactions on Networking, IEEE Transactions on Mobile Computing, IEEE Transactions on Service Computing, IEEE Transactions on Dependable and Secure Computing, and Elsevier Smart Health. She has served on various conference TPCs including DSN, Sensys, Infocom, etc. She is the TPC co-chair of IPSN’22.


Light detection and ranging (LiDAR) sensors play a vital role in acquiring 3D point cloud data and extracting valuable information about objects for tasks such as autonomous driving, robotics, and virtual reality (VR). However, the sparse and disordered nature of the 3D point cloud poses significant challenges to feature extraction. Overcoming limitations is critical for 3D point cloud processing. 3D point cloud object detection is a very challenging and crucial task, in which point cloud processing and feature extraction methods play a crucial role and have a significant impact on subsequent object detection performance. In this overview of outstanding work in object detection from the 3D point cloud, we specifically focus on summarizing methods employed in 3D point cloud processing. We introduce the way point clouds are processed in classical 3D object detection algorithms, and their improvements to solve the problems existing in point cloud processing. Different voxelization methods and point cloud sampling strategies will influence the extracted features, thereby impacting the final detection performance.

Key words: point cloud processing, 3D object detection, point cloud voxelization, bird's eye view, deep learning