[an error occurred while processing this directive]

ZTE Communications ›› 2024, Vol. 22 ›› Issue (3): 91-98.DOI: 10.12142/ZTECOM.202403011

• Review • 上一篇    下一篇

  

  • 收稿日期:2023-09-14 出版日期:2024-09-25 发布日期:2024-09-29

Multi-View Image-Based 3D Reconstruction in Indoor Scenes: A Survey

LU Ping1,2(), SHI Wenzhe1,2(), QIAO Xiuquan3   

  1. 1.State Key Laboratory of Mobile Network and Mobile Multimedia Technology, Shenzhen 518055, China
    2.ZTE Corporation, Shenzhen 518057, China
    3.State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2023-09-14 Online:2024-09-25 Published:2024-09-29
  • About author:LU Ping (lu.ping@zte.com.cn) is the deputy president of ZTE Corporation, where he is also the general manager of the Industrial Digitalization Solution Dept., and the executive deputy director of State Key Laboratory of Mobile Network and Mobile Multimedia Technology. His research interests include cloud computing, big data, augmented reality, and multimedia service-based technologies. He has supported and participated in multiple major national science and technology projects and national science and technology support projects. He has published multiple papers and authored two books.
    SHI Wenzhe (shi.wenzhe@zte.com.cn) is a strategy planner of ZTE Corporation where he is also an engineer for XRExplore Platform Product Planning and a member of the National Key Laboratory for Mobile Network and Mobile Multimedia Technology. His research interests include indoor visual AR navigation, SFM 3D reconstruction, visual SLAM, real-time cloud rendering, VR, and spatial perception.
    QIAO Xiuquan is currently a full professor with Beijing University of Posts and Telecommunications, China, where he is also the deputy director of the Network Service Foundation Research Center, State Key Laboratory of Networking and Switching Technology. He has authored or co-authored over 60 technical papers in international journals and at conferences, including the IEEE Communications Magazine, Proceedings of IEEE, Computer Networks, IEEE Internet Computing, IEEE Transactions on Automation Science and Engineering, and ACM SIGCOMM Computer Communication Review. His current research interests include the future Internet, services computing, computer vision, distributed deep learning, augmented reality, virtual reality, and 5G networks. Dr. QIAO was a recipient of the Beijing Nova Program in 2008 and the First Prize of the 13th Beijing Youth Outstanding Science and Technology Paper Award in 2016. He served as the associate editor for Computing (Springer) and the editor board of China Communications.
  • Supported by:
    ZTE Industry?University?Institute Cooperation Funds(HC?CN?20221102002)

Abstract:

Three-dimensional reconstruction technology plays an important role in indoor scenes by converting objects and structures in indoor environments into accurate 3D models using multi-view RGB images. It offers a wide range of applications in fields such as virtual reality, augmented reality, indoor navigation, and game development. Existing methods based on multi-view RGB images have made significant progress in 3D reconstruction. These image-based reconstruction methods not only possess good expressive power and generalization performance, but also handle complex geometric shapes and textures effectively. Despite facing challenges such as lighting variations, occlusion, and texture loss in indoor scenes, these challenges can be effectively addressed through deep neural networks, neural implicit surface representations, and other techniques. The technology of indoor 3D reconstruction based on multi-view RGB images has a promising future. It not only provides immersive and interactive virtual experiences but also brings convenience and innovation to indoor navigation, interior design, and virtual tours. As the technology evolves, these image-based reconstruction methods will be further improved to provide higher quality and more accurate solutions to indoor scene reconstruction.

Key words: 3D reconstruction, MVS, NeRF, neural implicit surface