Fact Verification with Chinese Tabular Data Based on Capsule Heterogeneous Graph Attention Network

doi:10.13328/j.cnki.jos.006951

微信服务号

微信订阅号

2025-5-16- 2

Home > Archive>Volume 35, Issue 9, 2024 >4324-4345. DOI:10.13328/j.cnki.jos.006951

PDF HTML XML Export Cite reminder

Fact Verification with Chinese Tabular Data Based on Capsule Heterogeneous Graph Attention Network
DOI:
                        10.13328/j.cnki.jos.006951
                    
Author:
                        YANG PengYANG Peng
School of Computer Science and Engineering, Southeast University, Nanjing 211189, China;Key Laboratory of Computer Network and Information Integration, Ministry of Education (Southeast University), Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHA Xian-YuZHA Xian-Yu
School of Computer Science and Engineering, Southeast University, Nanjing 211189, China;Key Laboratory of Computer Network and Information Integration, Ministry of Education (Southeast University), Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHAO Guang-ZhenZHAO Guang-Zhen
School of Computer Science and Engineering, Southeast University, Nanjing 211189, China;Key Laboratory of Computer Network and Information Integration, Ministry of Education (Southeast University), Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LIN XiLIN Xi
College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP18
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Fact verification is intended to check whether a textual statement is supported by a given piece of evidence. Due to the structural dependence and implicit content of tables, the task of fact verification with tables as the evidence still faces many challenges. Existing literature has either used logical expressions to parse statements based on tabular evidence or designed table-aware neural networks to encode statement-table pairs and thereby accomplish table-based fact verification tasks. However, these approaches fail to fully utilize the implicit tabular information behind the statements, which leads to the degraded inference performance of the model. Moreover, Chinese statements based on tabular evidence have more complex syntax and semantics, which also adds to the difficulties in model inference. For this reason, the study proposes a method of fact verification with Chinese tabular data based on the capsule heterogeneous graph attention network (CapsHAN). This method can fully understand the structure and semantics of statements. On this basis, the tabular information implied by the statements is mined and utilized to effectively improve the accuracy of table-based fact verification tasks. Specifically, a heterogeneous graph is constructed by performing syntactic dependency parsing and named entity recognition of statements. Subsequently, the graph is learned and understood by the heterogeneous graph attention network and the capsule graph neural network, and the obtained textual representation of the statements is sliced together with the textual representation of the encoded tables. Finally, the result is predicted. Further, this study also attempts to address the problem that the datasets of fact verification based on Chinese tables are scarce and thus unable to support the performance evaluation of table-based fact verification methods. For this purpose, the study transforms the mainstream English table-based fact verification datasets TABFACT and INFOTABS into Chinese and constructs a dataset that is based on the uniform content label (UCL) national standard and specifically tailored to the characteristics of Chinese tabular data. This dataset, namely, UCLDS, takes Wikipedia infoboxes as evidence of manually annotated natural language statements and labels them into three classes: entailed, contradictory, and neutral. UCLDS outperforms the traditional datasets TABFACT and INFOTABS in supporting both single-table and multi-table inference. The experimental results on the above three Chinese benchmark datasets show that the proposed model outperforms the baseline model invariably, demonstrating its superiority for Chinese table-based fact verification tasks.

Key words:table-based fact verification;heterogeneous graph attention network (HAN);capsule graph neural network (CapsGNN);dependency parsing;named entity recognition

Get Citation

杨鹏,查显宇,赵广振,林茜.基于胶囊异构图注意力网络的中文表格型数据事实验证.软件学报,2024,35(9):4324-4345

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 24,2022
Revised:March 06,2023
Adopted:
Online: August 23,2023
Published: September 06,2024

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History