Complex Entity Recognition Based on Prior Semantic Knowledge and Type Embedding

doi:10.13328/j.cnki.jos.006750

微信服务号

微信订阅号

2025-4-6- 9

Home > Archive>Volume 34, Issue 12, 2023 >5649-5669. DOI:10.13328/j.cnki.jos.006750

PDF HTML XML Export Cite reminder

Complex Entity Recognition Based on Prior Semantic Knowledge and Type Embedding
DOI:
                        10.13328/j.cnki.jos.006750
                    
Author:
                        JIANG Xiao-BoJIANG Xiao-Bo
School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HE KunHE Kun
School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
YAN Guang-YuYAN Guang-Yu
School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP18
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Entity recognition is a key task of information extraction. With the development of information extraction technology, researchers turn the research direction from the recognition of simple entities to the recognition of complex ones. Complex entities usually have no explicit features, and they are more complicated in syntactic constructions and parts of speech, which makes the recognition of complex entities a great challenge. In addition, existing models widely use span-based methods to identify nested entities. As a result, they always have an ambiguity in the detection of entity boundaries, which affects recognition performance. In response to the above challenge and problem, this study proposes an entity recognition model GIA-2DPE based on prior semantic knowledge and type embedding. The model uses keyword sequences of entity categories as prior semantic knowledge to improve the cognition of entities, utilizes type embedding to capture potential features of different entity types, and then combines prior knowledge with entity-type features through the gated interactive attention mechanism to assist in the recognition of complex entities. Moreover, the model uses 2D probability encoding to predict entity boundaries and combines boundary features and contextual features to enhance accurate boundary detection, thereby improving the performance of nested entity recognition. This study conducts extensive experiments on seven English datasets and two Chinese datasets. The results show that GIA-2DPE outperforms state-of-the-art models and achieves a 10.4% F1 boost compared with the baseline in entity recognition tasks on the ScienceIE dataset.

Key words:information extraction;complex entity recognition;gated interactive attention;2D probability encoding

Get Citation

姜小波,何昆,阎广瑜.基于语义先验知识与类型嵌入的复杂实体识别.软件学报,2023,34(12):5649-5669

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:December 02,2021
Revised:February 25,2022
Adopted:
Online: February 15,2023
Published: December 06,2023

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History