변분 추론의 증거 하한을 사용한 강화학습
SCOPUS
KCI
- Title
- 변분 추론의 증거 하한을 사용한 강화학습
- Authors
- 백종찬; 한수희
- Date Issued
- 2022-11
- Publisher
- 제어·로봇·시스템학회
- Abstract
- This paper presents a practical method of designing reinforcement learning (RL) algorithms with the evidence lower bound (ELBO) of variational inference (VI). The proposed approach provides opportunities to easily employ the existing results of supervised and unsupervised learning for reinforcement learning. By linking the likelihood functions with the state-action-value functions reasonably, we design the machine learning algorithms in a unified frame. As a special application of ELBO-based RL algorithms, network sparsification is introduced, which is achieved by employing a sparsity-induced regularization term. To help the overall understanding and gain physical insights, a schematic view is provided. A quadrotor is then employed to validate the proposed method.
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/117936
- DOI
- 10.5302/J.ICROS.2022.22.0184
- ISSN
- 1976-5622
- Article Type
- Article
- Citation
- 제어.로봇.시스템학회 논문지, vol. 28, no. 11, page. 981 - 985, 2022-11
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.