Open Access System for Information Sharing

Login Library

 

Article
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

변분 추론의 증거 하한을 사용한 강화학습 SCOPUS KCI

Title
변분 추론의 증거 하한을 사용한 강화학습
Authors
백종찬한수희
Date Issued
2022-11
Publisher
제어·로봇·시스템학회
Abstract
This paper presents a practical method of designing reinforcement learning (RL) algorithms with the evidence lower bound (ELBO) of variational inference (VI). The proposed approach provides opportunities to easily employ the existing results of supervised and unsupervised learning for reinforcement learning. By linking the likelihood functions with the state-action-value functions reasonably, we design the machine learning algorithms in a unified frame. As a special application of ELBO-based RL algorithms, network sparsification is introduced, which is achieved by employing a sparsity-induced regularization term. To help the overall understanding and gain physical insights, a schematic view is provided. A quadrotor is then employed to validate the proposed method.
URI
https://oasis.postech.ac.kr/handle/2014.oak/117936
DOI
10.5302/J.ICROS.2022.22.0184
ISSN
1976-5622
Article Type
Article
Citation
제어.로봇.시스템학회 논문지, vol. 28, no. 11, page. 981 - 985, 2022-11
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse