ZTE Communications ›› 2026, Vol. 24 ›› Issue (1): 81-87.DOI: 10.12142/ZTECOM.202601011

• Special Topic • Previous Articles     Next Articles

Steel Surface Anomaly Detection Using 3D Depth and 2D RGB Features

Zheng Wangguandong1, Lu Ping2, Deng Fangwei2, Huang Shijun2, Xia Siyu1()   

  1. 1.Southeast University, Nanjing 210096, China
    2.ZTE Corporation, Shenzhen 518057, China
  • Received:2024-09-27 Online:2026-03-25 Published:2026-03-17
  • About author:Zheng Wangguandong is pursuing his master’s degree at the School of Automation, Southeast University, China. His research interests focus on artificial intelligence and computer vision, with a specific specialization in image and video generation. He has published four CCF-A conference papers and possesses extensive research experience in image segmentation and object detection.
    Lu Ping is the executive deputy director of the National Key Laboratory of Mobile Networks and Mobile Multimedia Technology, China. His research areas encompass cloud computing, big data, augmented reality, and multimedia serviceization. He leads and participates in major national science and technology projects and national science and technology support programs. He has published numerous academic papers and is the author of the books Internet of Things Capability Development and Application and Big Data Technology and Application in Cloud Computing.
    Deng Fangwei is a senior strategic planner at ZTE Corporation, specializing in industry-specific digital infrastructure, mobile robots, and supporting products for industrial digital transformation.
    Huang Shijun is a senior strategic planner at ZTE Corporation, with research interests encompassing machine vision, artificial intelligence, computer vision, and deep learning.
    Xia Siyu (xsy@seu.edu.cn) received his BE and MS degrees in automation engineering from Nanjing University of Aeronautics and Astronautics, China in 2000 and 2003, respectively, and the PhD degree in pattern recognition and intelligence systems from Southeast University, China in 2006. He is currently an associate professor with the School of Automation, Southeast University. His research interests include object detection, applied machine learning, social media analysis, and intelligent vision systems. He was a recipient of the Science Research Famous Achievement Award from the Higher Institution of China in 2015. He has served as a reviewer for many journals including IEEE T-PAMI, T-IP, T-SMCB, T-IFS, T-MM, and Neurocomputing. He received the Outstanding Reviewer Award for Neurocomputing in 2016. He has also served on the PC/SPC for conferences including CVPR, AAAI, ACM MM, and IJCAI. He is a member of the ACM and IEEE.
  • Supported by:
    ZTE Industry?University?Institute Cooperation Funds(HC?CN?20221107001)

Abstract:

The detection of steel surface anomalies has become an industrial challenge due to variations in production equipment, processes, and steel characteristics. To alleviate the problem, this paper proposes a detection and localization method combining 3D depth and 2D RGB features. The framework comprises three stages: defect classification, defect location, and warpage judgment. The first stage uses a data-efficient image Transformer model, the second stage utilizes reverse knowledge distillation, and the third stage performs feature fusion using 3D depth and 2D RGB features. Experimental results show that the proposed algorithm achieves relatively high accuracy and feasibility, and can be effectively used in industrial scenarios.

Key words: anomaly detection, anomaly localization, feature fusion, reverse distillation