Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems

doi:10.13328/j.cnki.jos.007006

微信服务号

微信订阅号

Home > Archive>Volume 35, Issue 2, 2024 >711-738. DOI:10.13328/j.cnki.jos.007006

PDF HTML XML Export Cite reminder

Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems
DOI:
                        10.13328/j.cnki.jos.007006
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In recent years, reinforcement learning methods based on environmental interactions have achieved great success in robotic applications, providing a practical and feasible solution for optimizing the behavior control strategies of robots. However, collecting interactive samples in the real world can lead to problems such as high cost and low efficiency. Therefore, the simulation environment is widely used in the training process of robot reinforcement learning. By obtaining a large number of training samples at a low cost in the virtual simulation environment for strategy training and transferring learning strategies to the real world, the security, reliability, and real-time problems in the real robot training process can be alleviated. However, due to the difference between the simulation environment and the real environment, it is often difficult to obtain ideal performance when directly transferring the strategy trained in the simulation environment to the real robot. To solve this problem, sim-to-real transfer reinforcement learning methods are proposed to reduce the environmental gap, so as to achieve effective strategy transfer. According to the direction of information flow in the process of transfer reinforcement learning and the different objects targeted by intelligent methods, this survey first proposes a sim-to-real transfer reinforcement learning framework, based on which the existing related work is then divided into three categories: the model optimization methods focusing on the real environment, the knowledge transfer methods focusing on the simulation environment, and the iterative policy promotion methods focusing on both simulation and real environments. Then, the representative technologies and related work in each category are described. Finally, the opportunities and challenges in this field are briefly discussed.

Reference

Cited by

Get Citation

林谦,余超,伍夏威,董银昭,徐昕,张强,郭宪.面向机器人系统的虚实迁移强化学习综述.软件学报,2024,35(2):711-738

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 13,2023
Revised:June 22,2023
Adopted:
Online: November 08,2023
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History