Multi-granularity Metamorphic Testing for Neural Machine Translation System

doi:10.13328/j.cnki.jos.006221

微信服务号

微信订阅号

2025-5-2- 10

Home > Archive>Volume 32, Issue 4, 2021 >1051-1066. DOI:10.13328/j.cnki.jos.006221

PDF HTML XML Export Cite reminder

Multi-granularity Metamorphic Testing for Neural Machine Translation System
DOI:
                        10.13328/j.cnki.jos.006221
                    
Author:
                        ZHONG Wen-KangZHONG Wen-Kang
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
GE Ji-DongGE Ji-Dong
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
CHEN XiangCHEN Xiang
School of Information Science and Technology, Nantong University, Nantong 226019, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Chuan-YiLI Chuan-Yi
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
TANG ZeTANG Ze
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LUO BinLUO Bin
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP311
Fund Project:National Natural Science Foundation of China (61802167, 61972197, 61802095); Natural Science Foundation of Jiangsu Province of China (BK20201250)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Machine translation task focuses on converting one natural language into another. In recent years, neural machine translation models based on sequence-to-sequence models have achieved better performance than traditional statistical machine translation models on multiple language pairs, and have been used by many translation service providers. Although the practical application of commercial translation system shows that the neural machine translation model has great improvement, how to systematically evaluate its translation quality is still a challenging task. On the one hand, if the translation effect is evaluated based on the reference text, the acquisition cost of high-quality reference text is very high. On the other hand, compared with the statistical machine translation model, the neural machine translation model has more significant robustness problems. However, there are no relevant studies on the robustness of the neural machine translation model. This study proposes a multi-granularity test framework MGMT based on metamorphic testing, which can evaluate the robustness of neural machine translation systems without reference translations. The testing framework first replaces the source sentence on sentence-granularity, phrase-granularity, and word-granularity respectively, then compares the translation results of the source sentence and the replaced sentences based on the constituency parse tree, and finally judges whether the result satisfies the metamorphic relationship. The experiments are conducted on multi-field Chinese-English translation datasets and six industrial neural machine translation systems are evaluated, and compared with same type of metamorphic testing and methods based on reference translations. The experimental results show that the proposed method MGMT is 80% and 20% higher than similar methods in terms of Pearson's correlation coefficient and Spearman's correlation coefficient respectively. This indicates that the non-reference translation evaluation method proposed in this study has a higher positive correlation with the reference translation based evaluation method, which verifies that MGMT's evaluation accuracy is significantly better than other methods of the same type.

Key words:neural network;machine translation;quality estimation;metamorphic test;multi-granularity

Get Citation

钟文康,葛季栋,陈翔,李传艺,唐泽,骆斌.面向神经机器翻译系统的多粒度蜕变测试.软件学报,2021,32(4):1051-1066

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 12,2020
Revised:October 26,2020
Adopted:
Online: January 22,2021
Published: April 06,2021

You are the first2041645Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History