ZTE Communications ›› 2014, Vol. 12 ›› Issue (4): 16-22.DOI: DOI:10.3969/j.issn.1673-5188.2014.04.003
Zhenjiang Dong1, Lixia Liu1, Bin Wu2, and Yang Liu2
Zhenjiang Dong1, Lixia Liu1, Bin Wu2, and Yang Liu2
摘要: This paper proposes an analytical mining tool for big graph data based on MapReduce and bulk synchronous parallel (BSP) computing model. The tool is named Mapreduce and BSP based Graph-mining tool (MBGM). The core of this mining system are four sets of parallel graph-mining algorithms programmed in the BSP parallel model and one set of data extraction-transformation-loading (ETL) algorithms implemented in MapReduce. To invoke these algorithm sets, we designed a workflow engine which optimized for cloud computing. Finally, a well-designed data management function enables users to view, delete and input data in the Hadoop distributed file system (HDFS). Experiments on artificial data show that the components of graph-mining algorithm in MBGM are efficient.