ZTE Communications ›› 2022, Vol. 20 ›› Issue (4): 15-21.DOI: 10.12142/ZTECOM.202204003

• Special Topic • Previous Articles     Next Articles

Utility-Improved Key-Value Data Collection with Local Differential Privacy for Mobile Devices

TONG Ze, DENG Bowen, ZHENG Lele, ZHANG Tao()   

  1. School of Computer Science and Technology, Xidian University, Xi'an 710071, China
  • Received:2022-09-09 Online:2022-12-31 Published:2022-12-30
  • About author:TONG Ze received his BS degree from Chang’an University, China in 2019, where he is currently pursuing the MS degree with the School of Computer Science and Technology, Xidian University, China. His research interests include differential privacy and network security.|DENG Bowen received his BS degree from Xidian University, China in 2020, where he is currently pursuing the MS degree with the School of Computer Science and Technology, Xidian University. His research interests include differential privacy and social networks.|ZHENG Lele received his BS degree from Xidian University, China in 2018, where he is currently pursuing the PhD degree with the School of Computer Science and Technology, Xidian University. His research interests include differential privacy and the IoT data security.|ZHANG Tao (taozhang@xidian.edu.cn) received his MS and PhD degrees in computer science from Xidian University, China in 2011 and 2015, respectively. He is currently an associate professor with the School of Computer Science and Technology, Xidian University. His research interests include network security and privacy protection.

Abstract:

The structure of key-value data is a typical data structure generated by mobile devices. The collection and analysis of the data from mobile devices are critical for service providers to improve service quality. Nevertheless, collecting raw data, which may contain various personal information, would lead to serious personal privacy leaks. Local differential privacy (LDP) has been proposed to protect privacy on the device side so that the server cannot obtain the raw data. However, existing mechanisms assume that all keys are equally sensitive, which cannot produce high-precision statistical results. A utility-improved data collection framework with LDP for key-value formed mobile data is proposed to solve this issue. More specifically, we divide the key-value data into sensitive and non-sensitive parts and only provide an LDP-equivalent privacy guarantee for sensitive keys and all values. We instantiate our framework by using a utility-improved key value-unary encoding (UKV-UE) mechanism based on unary encoding, with which our framework can work effectively for a large key domain. We then validate our mechanism which provides better utility and is suitable for mobile devices by evaluating it in two real datasets. Finally, some possible future research directions are envisioned.

Key words: key-value data, local differential privacy, mobile devices, privacy-preserving data collection