融合文本概念化与网络表示的观点检索

doi:10.13328/j.cnki.jos.005548

微信服务号

微信订阅号

2025年4月10日 22:26 星期四

首页 > 过刊浏览>2018年第29卷第10期 >2899-2914. DOI:10.13328/j.cnki.jos.005548

PDF HTML阅读 XML下载导出引用引用提醒

融合文本概念化与网络表示的观点检索
DOI:
                        10.13328/j.cnki.jos.005548
                    
CSTR:
                        
                    
作者:
                        廖祥文廖祥文
福州大学 数学与计算机科学学院, 福建 福州 350116;福建省网络计算与智能信息处理重点实验室(福州大学), 福建 福州 350116
在期刊界中查找
在百度中查找
在本站中查找
刘德元刘德元
福州大学 数学与计算机科学学院, 福建 福州 350116;福建省网络计算与智能信息处理重点实验室(福州大学), 福建 福州 350116
在期刊界中查找
在百度中查找
在本站中查找
桂林桂林
福州大学 数学与计算机科学学院, 福建 福州 350116;福建省网络计算与智能信息处理重点实验室(福州大学), 福建 福州 350116
在期刊界中查找
在百度中查找
在本站中查找
程学旗程学旗
网络数据科学与技术重点实验室(中国科学院), 北京 100190
在期刊界中查找
在百度中查找
在本站中查找
陈国龙陈国龙
福州大学 数学与计算机科学学院, 福建 福州 350116;福建省网络计算与智能信息处理重点实验室(福州大学), 福建 福州 350116
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:廖祥文(1980-),男,福建安溪人,博士,副教授,CCF高级会员,主要研究领域为文本倾向性检索与挖掘;刘德元(1992-),男,硕士生,主要研究领域为知识图谱,观点检索;桂林(1987-),男,博士,主要研究领域为自然语言处理;程学旗(1971-),男,博士,教授,博士生导师,CCF会士,主要研究领域为网络科学与社会计算,互联网搜索与挖掘;陈国龙(1965-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为计算智能,计算机网络.
通讯作者:桂林,guilin.nlp@gmail.com
中图分类号:
基金项目:国家自然科学基金（61772135，U1605251）；中国科学院网络数据科学与技术重点实验室开放基金（CASNDST 201708，CASNDST201606）；可信分布式计算与服务教育部重点实验室主任基金（2017KF01）；福建省自然科学基金（2017J01755）；赛尔网络下一代互联网技术创新项目（NGⅡ20160501）

Opinion Retrieval Method Combining Text Conceptualization and Network Embedding

Author:

LIAO Xiang-Wen
LIAO Xiang-Wen
College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China;Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou 350116, China
在期刊界中查找
在百度中查找
在本站中查找
LIU De-Yuan
LIU De-Yuan
College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China;Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou 350116, China
在期刊界中查找
在百度中查找
在本站中查找
GUI Lin
GUI Lin
College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China;Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou 350116, China
在期刊界中查找
在百度中查找
在本站中查找
CHENG Xue-Qi
CHENG Xue-Qi
Key Laboratory of Network Data Science and Technology(The Chinese Academy of Sciences), Beijing 100190, China
在期刊界中查找
在百度中查找
在本站中查找
CHEN Guo-Long
CHEN Guo-Long
College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China;Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou 350116, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

National Natural Science Foundation of China (61772135, U1605251); Open Project of Key Laboratory of Network Data Science & Technology of the Chinese Academy of Sciences (CASNDST201708, CASNDST201606); Director's Project Fund of Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education (2017KF01); Natural Science Foundation of Fujian Province of China (2017J01755); CERNET Innovation Project (NGⅡ20160501)

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

观点检索是自然语言处理领域中的一个热点研究课题.现有的观点检索模型在检索过程中往往无法根据上下文将词汇进行知识、概念层面的抽象，在语义层面忽略词汇之间的语义联系，观点层面缺乏观点泛化能力.因此，提出一种融合文本概念化与网络表示的观点检索方法.该方法首先利用知识图谱分别将用户查询和文本概念化到正确的概念空间，并利用网络表示将知识图谱中的词汇节点表示成低维向量，然后根据词向量推出查询和文本的向量，并用余弦公式计算用户查询与文本的相关度，接着引入基于统计机器学习的分类方法挖掘文本的观点.最后，利用概念空间、网络表示空间以及观点分析结果构建特征，并服务于观点检索模型.相关实验结果表明，所提出的检索模型可以有效提高多种检索模型的观点检索性能.其中，基于统一相关模型的观点检索方法在两个实验数据集上相比于基准方法，在MAP评价指标上分别提升了6.1%和9.3%，基于排序学习的观点检索方法在两个实验数据集上相比于基准方法，在MAP评价指标上分别提升了2.3%和14.6%.

关键词:信息检索;观点检索;知识图谱;文本概念化;网络表示

Abstract:

Opinion retrieval is a hot topic in the research of natural language processing. Most existing approaches in text opinion retrieval can not extract knowledge and concept from context. They also lack opinion generalization ability and overlook the semantic relations between words. This paper proposes an opinion retrieval method based on knowledge graph conceptualization and network embedding. First, conceptual knowledge graph is used to conceptualize the queries and texts into the correct conceptual space while the nodes in the knowledge graph are embedded into low dimensional vectors space by network embedding technology. Then, the similarity between queries and texts is calculated based on embedding vectors. According to the similarity score, the opinion scores of texts can be captured based on statistical machine learning methods. Finally, the concept space, knowledge representation space, and opinion mining result serve opinion retrieval models. The experiment shows that the retrieval model proposed in this paper can effectively improve the retrieval performance of multiple retrieval models. Compared with referenced method based on unified opinion, the proposed approach improves the MAP scores by 6.1% and 9.3%, respectively. Compared with referenced method based on learning to rank, proposed approach improves the MAP scores by 2.3% and 14.6%, respectively.

Key words:information retrieval;opinion retrieval;knowledge graph;text conceptualization;network embedding

引用本文

廖祥文,刘德元,桂林,程学旗,陈国龙.融合文本概念化与网络表示的观点检索.软件学报,2018,29(10):2899-2914

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2017-07-20
最后修改日期:2017-11-08
录用日期:
在线发布日期: 2018-02-08
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码