Bipedal Walker using DQN
-
Updated
Jun 26, 2024 - Python
Bipedal Walker using DQN
Use Gymnasium to develop a reinforcement learning agent. First Semester of the Third Year of the Bachelor's Degree in Artificial Intelligence and Data Science.
Automated LLM-Guided Reinforcement Learning Testbed. This project leverages the modern BipedalWalker-v3 environment from Gymnasium to orchestrate a continuous cycle of agent training and intelligent reward shaping. By combining Stable Baselines3's PPO algorithm with the reasoning capabilities of Large Language Models (LLMs)
Reinforcement Learning and Deeep reinforcement Learning
Teaching an bipedal bot how to walk using a TD3 algorithm (variant of Reinforcement Learning - Actor & Critic method)
This project implements agent training using the Proximal Policy Optimization (PPO) algorithm in the BipedalWalker-v3 environment at two difficulty levels: normal and hardcore. The model's performance is evaluated based on rewards collected during the training process.
Add a description, image, and links to the bipedalwalker-v3 topic page so that developers can more easily learn about it.
To associate your repository with the bipedalwalker-v3 topic, visit your repo's landing page and select "manage topics."