Recognition Method Based on Deep Learning for Chinese Textual Entailment Chunks and Labels

doi:10.13328/j.cnki.jos.005885

微信服务号

微信订阅号

2025-4-6- 6

Home > Archive>Volume 31, Issue 12, 2020 >3772-3786. DOI:10.13328/j.cnki.jos.005885

PDF HTML XML Export Cite reminder

Recognition Method Based on Deep Learning for Chinese Textual Entailment Chunks and Labels
DOI:
                        10.13328/j.cnki.jos.005885
                    
Author:
                        YU DongYU Dong
College of Information Science, Beijing Language and Culture University, Beijing 100083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JIN Tian-HuaJIN Tian-Hua
College of Information Science, Beijing Language and Culture University, Beijing 100083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XIE Wan-YingXIE Wan-Ying
College of Information Science, Beijing Language and Culture University, Beijing 100083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG YiZHANG Yi
College of Information Science, Beijing Language and Culture University, Beijing 100083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
XUN En-DongXUN En-Dong
College of Information Science, Beijing Language and Culture University, Beijing 100083, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:National Key Research and Development Program of China (2018YFB1005105)

Article

Figures

Metrics

Reference [47]

Related [20]

Cited by

Materials

Comments

Abstract:

Recognizing textual entailment (RTE) is a task to recognize whether two sentences have an entailment relationship. In recent years, RTE in English had made a great progress. The current researches are mainly based on type judgment, and pay less attention to locate the language chunks that lead to the entailment relationship. More over, it leads to a low interpretability of the RTE models. This study selects 12 000 Chinese entailment sentence pairs from the Chinese Natural Language Inference (CNLI) data and labeled chunks which lead to their entailment relationship. Then 7 entailment types are summarized considering Chinese linguistic features. On the basis, two tasks are proposed. One is to recognize the seven-category of entailment type for each entailment sentence pairs, another is to recognize the boundaries of the entailment chunks in it. The proposed deep learning based method reaches an accuracy of 69.19% and 62.09% in the two tasks. The experimental results show that proposed approaches can effectively identifying different types of entailment in Chinese and find the boundaries of the entailment chunks, which demonstrate that the proposed model provides a reliable benchmark for further research.

Key words:recognizing textual entailment;chunk labeling;deep learning

Reference

[1] Guo MS, Zhang Y, Liu T. Research advances and prospect of recognizing textual entailment and knowledge acquisition. Chinese Journal of Computers, 2017,40(4):889-910(in Chinese with English abstract). http://cjc.ict.ac.cn/online/onlinepaper/gms-201745180721.pdf[doi:10.11897/SP.J.1016.2017.00889]

[2] Li JM. An overview of the research on prefabricated chunks home and abroad. Shandong Foreign Language Teaching Journal, 2011, 32(5):17-23(in Chinese with English abstract).

[3] Skehan P. A Cognitive Approach to Language Learning. Oxford:Oxford University Press, 1998.

[4] Wray A. Formulaic Language and the Lexicon. Cambridge:Cambridge University Press, 2005.

[5] Russell B. Introduction to Mathematical Philosophy. North Chelmsford:Courier Corporation, 1993.

[6] Flew A. A Dictionary of Philosophy. London:Pan Book Ltd., 1979.

[7] Bowman SR, Angeli G, Potts C, et al. A large annotated corpus for learning natural language inference. arXiv preprint arXiv:1508. 05326, 2015.

[8] Rocktäschel T, Grefenstette E, Hermann KM, et al. Reasoning about entailment with neural attention. arXiv preprint arXiv:1509. 06664, 2015.

[9] Liu Y, Sun C, Lin L, et al. Learning natural language inference using bidirectional LSTM model and inner-attention. arXiv preprint arXiv:1605.09090, 2016.

[10] Sammons M, Vydiswaran VGV, Vieira T, et al. Relation alignment for textual entailment recognition. In:Proc. of the Text Analysis Conf. (TAC). 2009.

[11] Chen Q, Zhu X, Ling Z, et al. Enhanced lstm for natural language inference. arXiv preprint arXiv:1609.06038, 2016.

[12] Devlin J, Chang MW, Lee K, et al. Bert:Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.

[13] Dagan I, Glickman O. Probabilistic textual entailment:Generic applied modeling of language variability. In:Proc. of the PASCAL Workshop on Learning Methods for Text Understanding and Mining. 2004. 26-29.

[14] Dagan I, Glickman O, Magnini B. The PASCAL recognising textual entailment challenge. In:Quiñonero-Candela, Joaquin, et al., eds. Proc. of the Int'l Conf. on Machine Learning Challenges:Evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment. Springer-Verlag, 2005. 177-190.

[15] Bar-Haim R, Dagan I, Dolan B, et al. The 2nd Pascal recognising textual entailment challenge. In:Proc. of the 2nd PASCAL Challenges Workshop on Recognising Textual Entailment. 2006,6(1):6.4.

[16] Giampiccolo D, Magnini B, Dagan I, et al. The 3rd Pascal recognizing textual entailment challenge. In:Proc. of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing. Association for Computational Linguistics, 2007. 1-9.

[17] Shima H, Kanayama H, Lee CW, et al. Overview of NTCIR-9 RITE:Recognizing inference in text. In:Proc. of the 9th NⅡ Test Collection for Information Retrieval Workshop. 2011. 291-301.

[18] Watanabe Y, Miyao Y, Mizuno J, et al. Overview of the recognizing inference in text (RITE-2) at NTCIR-10. In:Proc. of the 10th NⅡ Test Collection for Information Retrieval Workshop. 2013. 385-404.

[19] Matsuyoshi S, Miyao Y, Shibata T, et al. Overview of the NTCIR-11 recognizing inference in text and validation (RITE-VAL) task. In:Proc. of the 11th NⅡ Test Collection for Information Retrieval Workshop. 2014. 223-232.

[20] Williams A, Nangia N, Bowman SR. A broad-coverage challenge corpus for sentence understanding through inference. arXiv preprint arXiv:1704.05426, 2017.

[21] Demszky D, Guu K, Liang P. Transforming question answering datasets into natural language inference datasets. arXiv preprint arXiv:1809.02922, 2018.

[22] https://github.com/blcunlp/CNLI

[23] https://github.com/liuhuanyong/ChineseTextualInference

[24] Ren H. Research on annotation of linguistic phenomena for Chinese text reasoning. Journal of Henan Institute of Science and Technology, 2017,37(7):75-78(in Chinese with English abstract).

[25] Bentivogli L, Cabrio E, Dagan I, et al. Building textual entailment specialized data sets:A methodology for isolating linguistic phenomena relevant to inference. In:Proc. of the LREC 2010. 2010.

[26] De Marneffe MC, Rafferty AN, Manning CD. Finding contradictions in text. In:Proc. of the HLT, Association for Computational Linguistics (ACL 2008). Columbus, 2008. 1039-1047.

[27] Iftene A. UAIC participation at RTE4. In:Proc. of the 1st Text Analysis Conf. (TAC). 2008. 35, 104, 105.

[28] MacCartney B, Manning CD. Natural logic and natural language inference. In:Proc. of the Computing Meaning. Dordrecht:Springer-Verlag, 2014. 129-147.

[29] Wang S, Jiang J. Learning natural language inference with LSTM. arXiv preprint arXiv:1512.08849, 2015.

[30] Sammons M, Vydiswaran VGV, Vieira T, et al. Relation alignment for textual entailment recognition. In:Proc. of the Text Analysis Conf. (TAC). 2009.

[31] Tsuchida M, Ishikawa K. IKOMA at TAC2011:A method for recognizing textual entailment using lexical-level and sentence structure-level features. In:Proc. of the Text Analysis Conf. (TAC). 2011.

[32] Blunsom P, Camburu OM, Lukasiewicz T, et al. e-SNLI:Natural language inference with natural language explanations. arXiv preprint arXiv:1812.01193, 2018.

[33] Liu MF, Li Y, Ji DH. Event semantic feature based Chinese textual entailment recognition. Journal of Chinese Information Processing, 2013,27(5):129-136(in Chinese with English abstract).

[34] Tan YM, Liu SW, Lv XQ. CNN and BiLSTM based Chinese textual entailment recognition. Journal of Chinese Information Processing, 2018,32(7):11-19(in Chinese with English abstract).

[35] Jin TH, Jiang S, Yu D, et al. Chinese chunked-based heterogeneous entailment parser and boundary identification. Journal of Chinese Information Processing, 2019,33(2):17-25(in Chinese with English abstract).

[36] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In:Proc. of the Neural Information Processing Systems (NIPS). 2017. 5998-6008.

[37] Schuster M, Paliwal KK. Bidirectional recurrent neural networks. IEEE Trans. on Signal Processing, 1997,45(11):2673-2681.

[38] Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 2005,18(5-6):602-610.

[39] Lafferty J, McCallum A, Pereira FCN. Conditional random fields:Probabilistic models for segmenting and labeling sequence data. In:Proc. of the ICML. 2001. 282-289.

[40] Lample G, Ballesteros M, Subramanian S, et al. Neural architectures for named entity recognition. arXiv preprint arXiv:1603. 01360, 2016.

附中文参考文献:

[1] 郭茂盛,张宇,刘挺.文本蕴含关系识别与知识获取研究进展及展望.计算机学报,2017,40(4):889-910. http://cjc.ict.ac.cn/online/onlinepaper/gms-201745180721.pdf[doi:10.11897/SP.J.1016.2017.00889]

[2] 李继民.国内外语块研究述评.山东外语教学,2011,32(5):17-23.

[24] 任函.面向汉语文本推理的语言现象标注规范研究.河南科技学院学报,2017,37(7):75-78.

[33] 刘茂福,李妍,姬东鸿.基于事件语义特征的中文文本蕴含识别.中文信息学报,2013,27(5):129-136.

[34] 谭咏梅,刘姝雯,吕学强.基于CNN与双向LSTM的中文文本蕴含识别方法.中文信息学报,2018,32(7):11-19.

[35] 金天华,姜姗,于东,等.中文句法异构蕴含语块标注和边界识别研究.中文信息学报,2019,33(2):17-25.

Get Citation

于东,金天华,谢婉莹,张艺,荀恩东.中文文本蕴含类型及语块识别方法研究.软件学报,2020,31(12):3772-3786

Copy

Article Metrics

Abstract:1372
PDF: 3514
HTML: 2540
Cited by: 0

History

Received:April 02,2019
Revised:June 05,2019
Adopted:
Online: December 03,2020
Published: December 06,2020

You are the first2033299Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History