Deep Neural Network-Based Chinese Semantic Role Labeling

doi:10.3969/j.issn.1673-5188.2017.S2.010

Abstract

Abstract:

A recent trend in machine learning is to use deep architectures to discover multiple levels of features from data, which has achieved impressive results on various natural language processing (NLP) tasks. We propose a deep neural network-based solution to Chinese semantic role labeling (SRL) with its application on message analysis. The solution adopts a six-step strategy: text normalization, named entity recognition (NER), Chinese word segmentation and part-of-speech (POS) tagging, theme classification, SRL, and slot filling. For each step, a novel deep neural network-based model is designed and optimized, particularly for smart phone applications. Experiment results on all the NLP sub-tasks of the solution show that the proposed neural networks achieve state-of-the-art performance with the minimal computational cost. The speed advantage of deep neural networks makes them more competitive for large-scale applications or applications requiring real-time response, highlighting the potential of the proposed solution for practical NLP systems.

Key words: deep learning, sequence labeling, natural language understanding, convolutional neural network, recurrent neural network

ZHENG Xiaoqing, CHEN Jun, SHANG Guoqiang. Deep Neural Network-Based Chinese Semantic Role Labeling[J]. ZTE Communications, 2017, 15(S2): 58-64.

Figures/Tables 8

References 25

[1]	N. W. Xue , “Labeling Chinese predicates with semantic roles,” Computational Linguistics, vol. 34, no. 2, pp. 225-255, 2008. doi: 10.1162/coli.2008.34.2.225.
[2]	M. Surdeanu, S. Harabagiu, J. Williams, P. Aarseth , “Using predicate-argument structures for information extraction,” in Proc. The Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan, Jul. 2003, pp. 8-15. doi: 10.3115/1075096.1075098.
[3]	G. Melli, Y. Wang, Y. Liu , et al., “Descrition of SQUASH, the SFU question and summery handler for the DUC-2005 summarization task,” in Document Understanding Conference, 2005.
[4]	H. C. Boas , “Bilingual framenet dictionaries for machine translation,” in Proc. International Conference on Language Resources and Evaluation, Jan. 2002, pp. 1364-1371.
[5]	S. Narayanan, S. Harabagiu , “Question answering based on semantic structures,” in Proc. 20th International Conference on Computational Linguistics, Geneva, Switzerland, Aug. 2004. doi: 10.3115/1220355.1220455.
[6]	S. Pradhan, W. Ward, K. Hacioglu, J. H. Martin , “Semantic role labeling using different syntactic views,” in Proc. International Conference on Computational Linguistics, USA, Michigan, Jun. 2005, pp. 581-588. doi: 10.3115/1219840.1219912.
[7]	J. L. Packard , The Morphology of Chinese: a Linguistic and Cognitive Approach, Cambridge, United Kingdom: Cambridge University Press, 2004.
[8]	N. Xue and M. Palmer , “Annotating the propositions in the Penn Chinese Treebank,” in Proc. The 2nd SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan, Jul. 2003, pp. 47-54. doi: 10.3115/1119250.1119257.
[9]	Y. Bengio , “Learning deep architectures for AI,” Foundations and Trends in Machine Learning, vol. 2, no. 1, pp. 1-127, 2009. doi: 10.1561/2200000006.
[10]	G. F. Luger , Artificial Intelligence: Structures and Strategies for Complex Problem Solving 5th edition. New Jersey, USA: Addison Wesley, 2004.
[11]	H. T. Ng, J. K. Lou , “Chinese part-of-speech tagging: one-at-a-time or all-at-once? word-based or character-based?” in Proc. Conference on Empirical Methods in Natural Language Processing, Jul. 2004, pp. 277-284.
[12]	R. Collobert, J. Weston, L. Bottou , et al., “Natural language processing (almost) from scratch,” Journal of Machine Learning Research, vol. 12, no. 1, pp. 2493-2537, 2011.
[13]	X. Zheng, H. Chen, T. Xu , “Deep learning for Chinese word segmentation and POS tagging,” in Proc. Conference on Empirical Methods in Natural Language Processing, Oct. 2013, pp. 647-657.
[14]	N. Kalchbrenner, E. Grefenstette, P. Blunsom , “A convolutional neural network for modelling sentences,” in Proc. Annual Meeting of the Association for Computational Linguistics, Apr. 2014, pp. 655-665.
[15]	S. Hochreiter, J. Schmidhuber , “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997. doi: 10.1162/neco.1997.9.8.1735.
[16]	Z. Huang, W. Xu, K. Yu. (2015, Aug.). Bidirectional LSTM-CRF models for sequence tagging [Online]. Available: http://arxiv.org/abs/1508.01991v1
[17]	A. Graves, A. Mohamed, G. Hinton. (2013, Mar.). Speech recognition with deep recurrent neural networks [Online]. Available: http://arxiv.org/abs/1303.5778
[18]	M. Collins , “Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms,” in Proc. Conference on Empirical Methods in Natural Language Processing, 2002, pp. 1-8. doi: 10.3115/1118693.1118694.
[19]	R. Socher, C. C-Y. Lin, A. Y. Ng, and D. Christopher , “Parsing natural scenes and natural language with recursive neural networks,” in Proc. International Conference on Machine Learning, Washington, USA, Jun. 2011, pp. 129-136.
[20]	W. Pei, T. Ge, B. Chang , “Max-margin tensor neural network for Chinese word segmentation,” in Proc. The Annual Meeting of the Association for Computational Linguistics, 2014, doi: 10.3115/v1/P14-1028.
[21]	D. Erhan, Y. Bengio, A. Courville, P. Manzagol , et al., “Why does unsupervised pre-training help deep learning,” Journal of Machine Learning Research, vol.11, pp. 625-660, 2010.
[22]	T. Mikolov, K. Chen, G. Corrado, J. Dean. (2013, Jan.). “Efficient estimation of word representations in vector spaces [Online]. Available: http://arxiv.org/abs/1301.3781
[23]	G. A. Levow , “The third international Chinese language processing bakeoff: word segmentation and named entity recognition,” in Proc. SIGHAN Workshop on Chinese Language Processing, 2006, pp. 108-117.
[24]	R. T. Tsai, H. C. Hung, C. Sung, H. Dai , et al., “On closed task of Chinese word segmentation: an improved CRF model coupled with character clustering and automatically generated template matching,” in Proc. The Fifth SIGHAN Workshop on Chinese Language Processing, 2006, pp. 108-117.
[25]	H. Zhao, C. N. Huang, M. Li , “An improved Chinese word segmentation system with conditional random field,” in Proc. The Fifth SIGHAN Workshop on Chinese Language Processing, 2006, pp. 162-165.

Words	Labels
明天 “tomorrow”	DATE
上午九点 “9 am”	TIME
在 “at”	O
第一会议室 “meeting room No. 1”	LOCATION
我们 “we”	O
与 “with”	O
技术部 “technology section”	PARTICIPANT
开会 “have a meeting”	O
讨论 “discuss”	O
项目进展 “the progress of the project”	TOPIC
。 “.”	O

Words	Labels
明天 “tomorrow”	DATE
上午九点 “9 am”	TIME
在 “at”	O
第一会议室 “meeting room No. 1”	LOCATION
我们 “we”	O
与 “with”	O
技术部 “technology section”	PARTICIPANT
开会 “have a meeting”	O
讨论 “discuss”	O
项目进展 “the progress of the project”	TOPIC
。 “.”	O

Task	Goal	Model
Word segmentation (F1)	~ 85	≥ 90
POS tagging (F1)	~ 80	≥ 88
Named entity recognition (F1)	~ 75	≥ 84
SRL (F1)	~ 70	≥ 80

Task	Goal	Model
Word segmentation (F1)	~ 85	≥ 90
POS tagging (F1)	~ 80	≥ 88
Named entity recognition (F1)	~ 75	≥ 84
SRL (F1)	~ 70	≥ 80

System	Parameters	Time(ms)
Tsai et al. [24]	3027k	602
Zhao et al. [25]	3711k	859
Neural network	459k	49