FENG Yao-Gong
School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of artificial intelligence, Beijing Jiaotong University, Beijing 100044, ChinaYU Jian
School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of artificial intelligence, Beijing Jiaotong University, Beijing 100044, ChinaSANG Ji-Tao
School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of artificial intelligence, Beijing Jiaotong University, Beijing 100044, ChinaYANG Peng-Bo
School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of artificial intelligence, Beijing Jiaotong University, Beijing 100044, ChinaNational Key Research and Development Program of China (2017YFC1703506); National Natural Science Foundation of China (61632004, 61832002, 61672518); Fundamental Research Funds for the Central Universities (2020YJS030, 2018JBZ006, 2019JBZ110)
Zero-shot learning aims to recognize the unseen classes by using the knowledge of the seen classes that has been learned. In recent years, ‘knowledge+data driven’ has become a new trend but lacking of unified definition of ‘knowledge’ in the current zero-shot tasks of computer vision. This study tries to define the ‘knowledge’ in this field and divided it into three categories, which are primary knowledge, abstract knowledge, and external knowledge. In addition, based on the definition and classification of knowledge, the current works on zero-shot learning (mainly in image classification task) are sorted out, they are divided into zero-shot models based on primary knowledge, zero-shot models based on abstract knowledge, and zero-shot models based on external knowledge. This study also introduces the problems which are domain shift and hubness in this field, and further summarizes existing works based on the problems. Finally, the paper summarizes the datasets and knowledge bases that commonly used in image classification tasks, the evaluation criteria of image classification experiment and the experimental results of representative models. The future works are also summarized and prospected.
冯耀功,于剑,桑基韬,杨朋波.基于知识的零样本视觉识别综述.软件学报,2021,32(2):370-405
Copy