ZTE Communications ›› 2020, Vol. 18 ›› Issue (2): 31-39.DOI: 10.12142/ZTECOM.202002005

• Special Topic • Previous Articles     Next Articles

Communication-Efficient Edge AI Inference over Wireless Networks

YANG Kai, ZHOU Yong(), YANG Zhanpeng, SHI Yuanming   

  1. School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
  • Received:2020-02-10 Online:2020-06-25 Published:2020-08-07
  • About author:YANG Kai received the B.S. degree in electronic engineering from Dalian University of Technology, China in 2015. He is currently working toward the Ph.D. degree with the School of Information Science and Technology, ShanghaiTech University, China, also with the Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, China, and also with the University of Chinese Academy of Sciences, Beijing, China. His research interests include data processing and optimization for mobile edge artificial intelligence.|ZHOU Yong (zhouyong@shanghaitech.edu.cn) received the B.S. and M.Eng. degrees from Shandong University, China in 2008 and 2011, respectively, and the Ph.D. degree from The University of Waterloo, Canada in 2015. From November 2015 to January 2018, he worked as a postdoctoral research fellow in the Department of Electrical and Computer Engineering, The University of British Columbia, Canada. Since March 2018, he has been with the School of Information Science and Technology, ShanghaiTech University, where he is currently an assistant professor. His research interests include 5G and beyond, IoT, and edge AI.|YANG Zhanpeng will receive his B.S. degree from Xidian University, China on July 2020. He will join the School of Information Science and Technology, ShanghaiTech University, in Fall 2020. He mainly focuses on developed reconfigurable intelligence surface based 6G wireless technologies for mobile edge AI systems.|SHI Yuanming received the B.S. degree in electronic engineering from Tsinghua University, China in 2011. He received the Ph.D. degree in electronic and computer engineering from The Hong Kong University of Science and Technology (HKUST), China in 2015. Since September 2015, he has been with the School of Information Science and Technology, ShanghaiTech University, China, where he is currently a tenured associate professor. He visited University of California, Berkeley, USA from October 2016 to February 2017. Dr. SHI is a recipient of the 2016 IEEE Marconi Prize Paper Award in Wireless Communications and the 2016 Young Author Best Paper Award by the IEEE Signal Processing Society. He is an editor of IEEE Transactions on Wireless Communications. His research areas include optimization, statistics, machine learning, signal processing, and their applications to 6G, IoT, AI and FinTech.

Abstract:

Given the fast growth of intelligent devices, it is expected that a large number of high-stakes artificial intelligence (AI) applications, e.g., drones, autonomous cars, and tactile robots, will be deployed at the edge of wireless networks in the near future. Therefore, the intelligent communication networks will be designed to leverage advanced wireless techniques and edge computing technologies to support AI-enabled applications at various end devices with limited communication, computation, hardware and energy resources. In this article, we present the principles of efficient deployment of model inference at network edge to provide low-latency and energy-efficient AI services. This includes the wireless distributed computing framework for low-latency device distributed model inference as well as the wireless cooperative transmission strategy for energy-efficient edge cooperative model inference. The communication efficiency of edge inference systems is further improved by building up a smart radio propagation environment via intelligent reflecting surface.

Key words: communication efficiency, cooperative transmission, distributed computing, edge AI, edge inference