ZTE Communications ›› 2023, Vol. 21 ›› Issue (3): 70-76.DOI: 10.12142/ZTECOM.202303010

• Research Papers • Previous Articles     Next Articles

Log Anomaly Detection Through GPT-2 for Large Scale Systems

JI Yuhe1, HAN Jing2(), ZHAO Yongxin1, ZHANG Shenglin1, GONG Zican2   

  1. 1.Nankai University, Tianjin 300071, China
    2.ZTE Corporation, Shenzhen 518057, China
  • Received:2022-12-08 Online:2023-09-21 Published:2023-03-22
  • About author:JI Yuhe received his bachelor’s degree in software engineering from the College of Software, Nankai University, China in 2022. He is now pursuing his master’s degree at the School of Software, Nankai University. His research interests include anomaly detection and natural language processing.|HAN Jing (han.jing28@zte.com.cn) received her master’s degree from Nanjing University of Aeronautics and Astronautics, China. She has been with ZTE Corporation since 2000. She had been engaged in 3G/4G key technologies, from 2000 to 2016, and has become a technical director responsible for intelligent operation of cloud platforms and wireless networks since 2016. Her research interests include machine learning, data mining, and signal processing.|ZHAO Yongxin received her bachelor’s degree in software engineering from Nankai University, China in 2021. She is currently pursuing her master’s degree at the School of Software, Nankai University. Her research interests include anomaly detection and failure diagnosis.|ZHANG Shenglin received his BS degree in network engineering from the School of Computer Science and Technology, Xidian University, China in 2012 and PhD degree in computer science from Tsinghua University, China in 2017. He is currently an associate professor with the College of Software, Nankai University, China. His current research interests include failure detection, diagnosis and prediction for service management. He is an IEEE Member.|GONG Zican received his master’s degree in professional computing and artificial intelligence from the Australian National University in 2019. He has been a machine learning engineer in ZTE Corporation since 2020. His research interests include machine learning, professional computing and system architecture.

Abstract:

As the scale of software systems expands, maintaining their stable operation has become an extraordinary challenge. System logs are semi-structured text generated by the recording function in the source code and have important research significance in software service anomaly detection. Existing log anomaly detection methods mainly focus on the statistical characteristics of logs, making it difficult to distinguish the semantic differences between normal and abnormal logs, and performing poorly on real-world industrial log data. In this paper, we propose an unsupervised framework for log anomaly detection based on generative pre-training-2 (GPT-2). We apply our approach to two industrial systems. The experimental results on two datasets show that our approach outperforms state-of-the-art approaches for log anomaly detection.

Key words: hybrid beamforming, hybrid architecture, weighted mean square error, manifold optimization, dynamic subarrays