[关键词]
[摘要]
近年来,各种各样的推荐算法层出不穷,特别是深度学习的发展,极大地推动了推荐系统的研究.然而,各个推荐算法在实现细节、评价方式、数据集处理等方面存在众多差异,越来越多的研究者开始对推荐领域的可复现性产生担忧.为了帮助缓解上述问题,基于PyTorch实现了一个综合、高效、易扩展的轻量级推荐算法框架ReChorus,意为构建一个推荐算法的“合唱团”.ReChorus框架中实现了多种不同类型的推荐算法,类别涵盖常规推荐、序列推荐、引入知识图谱的推荐、引入时间动态性的推荐等;同时,对于一些常见的数据集也提供统一的预处理范式.相比其他推荐系统库,ReChorus在保证综合高效的基础上尽可能做到了轻量实用,同时具有较高的可扩展性,尤其以方便学术研究为导向,非常容易上手实现新的模型.不同的推荐算法在ReChorus框架中能够在相同的实验设定下进行训练和评测,从而实现推荐算法间的有效对比.该项目目前已在GitHub发布:https://github.com/THUwangcy/ReChorus.
[Key word]
[Abstract]
In recent years, many recommendation algorithms have been proposed, and the research of recommender system has been greatly boosted with the development of deep learning. However, concerns about the reproducibility in this field have increasingly arisen in the research community, owing to the slight but influential differences between recommendation algorithms, such as implementation details, evaluation protocols, dataset splitting, etc. To address this issue, ReChorus is presented, of which it is a comprehensive, efficient, flexible, and lightweight framework for recommendation algorithms based on PyTorch, with aims to form a “Chorus” of recommendation algorithms. In this framework, a wide range of recommendation algorithms of different categories is implemented, covering general recommendation, sequential recommendation, knowledge-aware recommendation, and time-aware recommendation. ReChorus also provides the paradigm of dataset preprocessing for some common datasets. Compared to other recommendation algorithm libraries, ReChorus is featured for that it strives to keep lightweight while unifies as many as different algorithms at the same time. ReChorus is also flexible, efficient, and easy to use, especially for research purposes. Researchers will find it effortless to implement new algorithms with ReChorus. Such a framework can help to train and evaluate different recommendation models under the same experimental setting, so as to avoid the impacts resulting from implementation details and assure an effective comparison among recommendation algorithms. The project has been released on GitHub: https://github.com/THUwangcy/ReChorus.
[中图分类号]
[基金项目]
国家重点研发计划(2018YFC0831900);国家自然科学基金(61672311,61532011,62002191);清华大学国强研究院资助