Skip to content

Implementation of several RL algorithms on the CartPole-v1 environment.

License

Notifications You must be signed in to change notification settings

Cezarrr9/CartPole-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CartPole-RL

Overview

Hello everyone! This repository contains implementations of various reinforcement learning algorithms to solve the classic CartPole-v1 problem, a popular benchmark task in Reinforcement Learning. The CartPole-v1 problem involves balancing a pole on a cart that moves along a frictionless track. The goal is to prevent the pole from falling over by moving the cart to the left or right.

Environment

All the information about the environment can be found here: https://gymnasium.farama.org/environments/classic_control/cart_pole/.

Algorithms Implemented

The following reinforcement learning algorithms have been implemented and applied to solve the CartPole-v1 problem:

  • Q-Learning

  • SARSA

  • REINFORCE

  • DQN (Deep Q-Network)

  • DDQN (Double Deep Q-Network)

Results

The performance of each algorithm is depicted in the data folder. The performance is measured in how long the CartPole managed to stay up without falling. After a maximum of 500 steps, the episode terminates by itself for CartPole-v1 from the gymnasium library.

There are two types of figures. The figures named name_of_the_algorithm.png depict the algorithm's performance when seed = 1 is used. They show the raw performance, the average of the last 100 steps and the average of the previous steps at each point. The other figures show the algorithm's average performance across 5 seeds (1, 2, 3, 5 and 8) using a dark blue line. In these figures, the 95% confidence interval for the mean is also represented by the light blue shades.

About

Implementation of several RL algorithms on the CartPole-v1 environment.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages