国家自然科学基金(61836007, 61772354, 61773276); 江苏高校优势学科建设工程项目
篇章结构分析旨在理解文章的整体结构及其各部分之间的语义联系. 作为自然语言处理的研究热点, 近年来篇章结构分析研究发展迅速. 首先总结英语和汉语中篇章结构分析理论, 然后介绍相关篇章语料库及其计算模型的研究. 在此基础上, 梳理了当前英语、汉语中篇章结构分析的相关工作脉络, 构建了篇章结构分析研究框架, 归纳总结出当前研究的趋势和热点. 然后, 简要介绍篇章结构在下游任务中的应用. 最后, 指出当前汉语篇章结构分析存在的问题与挑战, 为今后的研究提供指导和帮助.
Discourse structure analysis aims to understand the overall structure of an article and the semantic relationships between its various parts. As a research hotspot of natural language processing, it has developed rapidly in recent years. This study first summarizes the mainstream discourse structure analysis theories in English and Chinese and then introduces the research on the popular English and Chinese discourse corpora as well as their calculation models. On this basis, this study surveys the current work context of discourse structure analysis in Chinese and English and constructs its research framework. Moreover, the current research trends and focuses are summarized, and the application of discourse structure in downstream tasks is introduced briefly. Finally, this study points out the issues and challenges in the current Chinese discourse structure analysis to provide guidance and help for future research.