NORMA eResearch @NCI Library

Optimization of Actor Critic Policy in Continuous Action Space

Sarkar, Simanta (2020) Optimization of Actor Critic Policy in Continuous Action Space. Masters thesis, Dublin, National College of Ireland.

[img]
Preview
PDF (Master of Science)
Download (831kB) | Preview
[img]
Preview
PDF (Configuration manual)
Download (520kB) | Preview

Abstract

The implementation of Reinforcement learning algorithms has made a huge impact on various problems where no existing methodologies has succeeded in control task and make decision. In this paper we are implementing a hybrid algorithm to virtual selfdriving car through collating the Actor-Critic and Proximal Policy Optimization (PPO) methods to introduce a continuous control tasks for locomotion of cars. Successful locomotion of a self-driving car can be achieved through angular movements of the steering by understanding the changes in environment where the actions like to take turns smoothly or throttle maps to continuous action space. The policy which maps input received from the sensors which causes change of action in cars is upgraded to achieve rewards. Due to these upgraded techniques the general policy-based methods have been improvised by the Actor-Critic method. The primary purpose of the research is to study the performance of the modified policy optimization techniques which enhances the interaction of the agent with the environment resulting in improved rewards in comparison with other policy-based methods. The testbeds used for the implementation of the modified algorithm are Cartpole and MountainCarContinuous. The modified actor-critic algorithm has yielded consistent policy update reducing the risk of learning a sudden irreversible bad policy.
Keywords: Reinforcement learning, Machine learning, Policy Gradient, Actor-Critic, PPO

Item Type: Thesis (Masters)
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science

Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Dan English
Date Deposited: 21 Jan 2021 10:24
Last Modified: 21 Jan 2021 10:24
URI: http://norma.ncirl.ie/id/eprint/4415

Actions (login required)

View Item View Item