基于多任务预训练的AMR文本生成研究

doi:10.13328/j.cnki.jos.006207

微信服务号

微信订阅号

首页 > 过刊浏览>2021年第32卷第10期 >3036-3050. DOI:10.13328/j.cnki.jos.006207

PDF HTML阅读 XML下载导出引用引用提醒

基于多任务预训练的AMR文本生成研究
DOI:
                        10.13328/j.cnki.jos.006207
                    
作者:
                        
                        
                    
作者单位:
作者简介:徐东钦(1995-),男,硕士,主要研究领域为自然语言处理.
李军辉(1983-),男,博士,副教授,CCF专业会员,主要研究领域为自然语言处理.
朱慕华(1981-),男,博士,主要研究领域为自然语言处理.
周国栋(1967-),男,博士,教授,博士生导师,CCF杰出会员,主要研究领域为自然语言处理.
通讯作者:李军辉,E-mail:jhli@suda.edu.cn
中图分类号:TP18
基金项目:国家重点研发计划（2017YFB1002101）；国家自然科学基金（61876120）

Improving AMR-to-text Generation with Multi-task Pre-training

Author:

Affiliation:

Fund Project:

National Key Research and Development Project of China (2017YFB1002101); National Natural Science Foundation of China (61876120)

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

抽象语义表示（abstract meaning representation，简称AMR）文本生成的任务是给定AMR图，生成与其语义一致的文本.相关工作表明，人工标注语料的规模大小直接影响了AMR文本生成的性能.为了降低对人工标注语料的依赖，提出了基于多任务预训练的AMR文本生成方法.特别地，基于大规模自动标注AMR语料，提出与AMR文本生成任务相关的3个预训练任务，分别是AMR降噪自编码、句子降噪自编码以及AMR文本生成任务本身.此外，基于预训练模型，在朴素微调方法的基础上，进一步提出了基于多任务训练的微调方法，使得最终模型不仅适用于AMR文本生成，同时还适用于预训练任务.基于两个AMR标准数据集的实验结果表明：使用0.39M自动标注数据，提出的预训练方法能够大幅度提高AMR文本生成的性能，在AMR2.0和AMR3.0上分别提高了12.27和7.57个BLEU值，性能分别达到40.30和38.97.其中，在AMR2.0上的性能为目前报告的最优值，在AMR3.0上的性能为目前为止首次报告的性能.

Abstract:

Given an AMR (abstract meaning representation) graph, AMR-to-text generation aims to generate text with the same meaning. Related studies show that the performance of AMR-to-text severely suffers from the size of the manually annotated dataset. To alleviate the dependence on manually annotated dataset, this study proposes a novel multi-task pre-training for AMR-to-text generation. In particular, based on a large-scale automatic AMR dataset, three relevant pre-training tasks are defined, i.e., AMR denoising auto-encoder, sentence denoising auto-encoder, and AMR-to-text generation itself. In addition, to fine-tune the pre-training models, the vanilla fine-tuning method is further extended to multi-task learning fine-tuning, which enables the final model to maintain performance on both AMR-to-text and pre-training tasks. With the automatic dataset of 0.39M sentences, detailed experimentation on two AMR benchmarks shows that the proposed pre-training approach significantly improves the performance of AMR-to-text generation, with the improvement of 12.27 BLEU on AMR2.0 and 7.57 on AMR3.0, respectively. This greatly advances the state-of-the-art performance with 40.30 BLEU on AMR2.0 and 38.97 on AMR 3.0, respectively. To the best knowledge, this is the best result achieved so far on AMR 2.0 while AMR-to-text generation performance on AMR 3.0 is firstly reported.

参考文献

相似文献

引证文献

引用本文

徐东钦,李军辉,朱慕华,周国栋.基于多任务预训练的AMR文本生成研究.软件学报,2021,32(10):3036-3050

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2020-07-30
最后修改日期:2020-10-19
录用日期:
在线发布日期: 2020-12-02
出版日期: 2021-10-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码