CHANG Yao-Cheng
School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, ChinaZHANG Yu-Xiang
School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, ChinaWANG Hong
School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, ChinaWAN Huai-Yu
School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, ChinaXIAO Chun-Jing
School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, ChinaNational Natural Science Foundation of China (U1533104, U1633110, 61603028); Fundamental Research Funds for the Central Universities (ZXH2012P009)
Keyphrases that efficiently represent the main topics discussed in a document are widely used in various document processing tasks, and automatic keyphrase extraction has been one of fundamental problems and hot research issues in the field of natural language processing (NLP). Although automatic keyphrase extraction has received a lot of attention and the extraction technologies have developed quickly, the state-of-the-art performance on this task is far from satisfactory. In order to help to solve the keyphrase extraction problem, this paper presents a survey of the latest development in keyphrase extraction, mainly including candidate keyphrase generation, feature engineering and keyphrase extraction models. In addition, some published datasets are listed, the evaluation approaches are analyzed, and the challenges and trends of automatic keyword extraction techniques are also discussed. Different from the existing surveys that mainly focus on the models of keyphrase extraction, this paper provides a features oriented survey of automatic keyphrase extraction. This perspective may help to utilize the existing features and propose the new effective extraction approaches.
常耀成,张宇翔,王红,万怀宇,肖春景.特征驱动的关键词提取算法综述.软件学报,2018,29(7):2046-2070
Copy