Title |
Policy-based Deep Reinforcement Learning for Sparse Reward Environment |
Authors |
김명섭(MyeongSeop Kim) ; 김정수(Jung-Su Kim) |
DOI |
https://doi.org/10.5370/KIEE.2021.70.3.506 |
Keywords |
Reinforcement Learning; Sparse Reward Problem |
Abstract |
Sparse reward environment is the main problems encountered by reinforcement learning. When there are many specific tasks that the agent must go through to reach the final goal, the reward signal becomes very sparse in the environment. And this situation makes reinforcement learning less effective. To overcome this, we give the agent an intrinsic reward to induce the agent to explore more. With this reward setting, the agent can continue to search for reward signal and learn another action that is better than the best action which is currently known. In this paper, we describe the implementation of the proposed method and estimate its performance. For the learning algorithm, we use Proximal Policy Optimization(PPO) and train the agent in a distributed environment. The agent is trained to solve the game of Tetris that is a representative sparse reward problem. |