ppo-algorithm

Here are 27 public repositories matching this topic...

VachanVY / Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

reinforcement-learning deep-reinforcement-learning pytorch artificial-intelligence dqn policy-gradient deep-deterministic-policy-gradient ddpg-algorithm proximal-policy-optimization actor-critic-algorithm dqn-pytorch rl-book sutton-barto-book policy-gradient-with-baseline actor-critic-pytorch soft-actor-critic-continuous ppo-algorithm reinforcement-learning-an-introduction

Updated Aug 14, 2025
Python

Vitao2 / Hollow-Knight-Neural-Network

Star

A C# Unity mod connected through a named pipe with Python for training a Reinforcement Learning agent to fight Hollow Knight Hornet Protector

reinforcement-learning hollow-knight hollow-knight-mod ppo-algorithm

Updated Apr 9, 2026
Python

amin-sharifi-github / quant-rl-trading-agent

Star

End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.

reinforcement-learning ai algotrading reinforcement-learning-algorithms trading-algorithms quantitative-finance attention-mechanism quantitative-trading backtesting trading-systems gym-environment reinforcement-learning-agent financial-machine-learning quantitative-research market-simulation stable-baselines3 ppo-algorithm

Updated Aug 6, 2025
Python

negarhonarvar / DeepReinforcementLearning

Star

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

dqn boltzmann-exploration sarsa lunar-lander cartpole-v1 d3qn swimmer softmax-exploration drl-algorithms ppo-algorithm gymnasium-environment

Updated Apr 13, 2025
Python

Ruchit-Gaurh / AI-Traffic-Management-System

Star

🚦 Next-generation AI Traffic Management System with real-time computer vision, reinforcement learning optimization, emergency vehicle detection, and immersive 3D visualization

Updated Oct 14, 2025
Python

mturan33 / isaaclab-anymal-locomotion

Star

A legged locomotion project

ppo anymal isaacsim isaac-sim locomation legged-locomotion ppo-algorithm isaac-lab isaaclab anymal-c

Updated Nov 29, 2025
Python

RongzheZhao2R2-lab / Implementing-Core-LLM-Algorithms-from-Scratch

Star

This repository is dedicated to implementing algorithms "From Scratch". It goes beyond simple API calls, diving deep into the underlying logic of everything from basic training to cutting-edge techniques like DeepSeek-R1.

moe knowledge-distillation multimodal-learning alignment-algorithm rag mixture-of-experts rlhf ppo-algorithm grpo

Updated Nov 26, 2025
Python

Abhinav0710rajput / rl_llm_multiturn_cmdp

Star

Constrained Clarification: Training LLM Agents to Ask Better Questions Under Budget Constraints

reinforcement-learning constrained-optimization cmdp ppo-algorithm llm-post-training

Updated May 1, 2026
Python

zxy-tech / ppo-for-S-P-500-trading-strategy

Star

This is a project for PPO S&P 500 trading

time-series-forecasting stockprediction stocktrader ppo-algorithm

Updated Mar 10, 2025
Python

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

Star

2D orbital rocket sim with PPO in PyTorch. Models thrust, drag, gravity, fuel; agent learns efficient ascent. Includes telemetry & visualization

ai python3 rocketry ppo-algorithm

Updated Dec 23, 2025
Python

omerjakoby / MARIO-RL-PPO

Star

This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!

machine-learning tensorflow keras openai-gym cnn actor-critic mario-game proximal-policy-optimization ppo reinforcement-learning-agent ppo-algorithm

Updated Dec 12, 2025
Python

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

Star

Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.

reinforcement-learning openai-gym reinforcement-learning-algorithms inverted-pendulum ppo-algorithm

Updated Mar 3, 2026
Python

MarGo-20 / isaaclab-anymal-locomotion

Star

🐾 Implement Proximal Policy Optimization (PPO) for quadruped locomotion, achieving 96% performance of RSL-RL with a custom solution for enhanced robot control.

ppo anymal isaacsim isaac-sim locomation legged-locomotion ppo-algorithm isaac-lab isaaclab

Updated May 22, 2026
Python

mafaldaaires / Reinforcement-Learning

Star

Stable Baselines3

gymnasium a2c-algorithm car-racing-environment ppo-algorithm

Updated Dec 26, 2023
Python

sanatren / Legal-Document-Analyzer

Star

This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.

deep-learning transformer bart reinforcement-learning-algorithms byte-pair-encoding huggingface ppo-algorithm finetuning-transformers

Updated Jun 8, 2025
Python

AmudhanManimaran / AutoHeal_Autonomous-Server-Remediation-via-PPO-Reinforcement-Learning

Star

python reinforcement-learning pytorch tensorboard self-healing anomaly-detection stable-baselines3 ppo-algorithm gymnasium-environment

Updated Apr 26, 2026
Python

Lekssz / rl_trading_agent

Star

Multi-modal RL trading agent (CNN + PPO) integrating market prices, macroeconomic indicators, and news signals . MSc dissertation artefact.

python machine-learning reinforcement-learning deep-learning cnn pytorch supervised-learning policy-gradient quantitative-finance feature-engineering algorithmic-trading macroeconomics news-analysis financial-time-series ppo-algorithm

Updated Jan 31, 2026
Python

Anca-Mt / TrackmaniaRL-AI

Star

AI agents for Trackmania using the TMRL package. Implemented DDPG, PPO, and used two SAC algorithms (with one or two critics) to train cars to navigate custom-built tracks.

python ai reinforcement-learning-algorithms game-ai ddpg-algorithm ppo-algorithm sac-algorithm tmrl tmrl-package modern-game-ai

Updated Aug 20, 2024
Python

Degik / ClimatePredictor

Star

ClimatePredictor implemented by using Proximal policy optimization (PPO) with ray framework for the FederatedLearning approach

distributed-systems reinforcement-learning raylib federated-learning noaa-data ppo-algorithm

Updated Feb 9, 2025
Python

praani193 / Reinforcement-Learning

Star

Working on new variant of PPO implemented in predefined jvrc robot simulation and SLM model integration

reinforcement-learning python3 slm ppo-algorithm

Updated Dec 16, 2025
Python

Improve this page

Add a description, image, and links to the ppo-algorithm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo-algorithm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo-algorithm

Here are 27 public repositories matching this topic...

VachanVY / Reinforcement-Learning

Vitao2 / Hollow-Knight-Neural-Network

amin-sharifi-github / quant-rl-trading-agent

negarhonarvar / DeepReinforcementLearning

Ruchit-Gaurh / AI-Traffic-Management-System

mturan33 / isaaclab-anymal-locomotion

RongzheZhao2R2-lab / Implementing-Core-LLM-Algorithms-from-Scratch

Abhinav0710rajput / rl_llm_multiturn_cmdp

zxy-tech / ppo-for-S-P-500-trading-strategy

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

omerjakoby / MARIO-RL-PPO

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

MarGo-20 / isaaclab-anymal-locomotion

mafaldaaires / Reinforcement-Learning

sanatren / Legal-Document-Analyzer

AmudhanManimaran / AutoHeal_Autonomous-Server-Remediation-via-PPO-Reinforcement-Learning

Lekssz / rl_trading_agent

Anca-Mt / TrackmaniaRL-AI

Degik / ClimatePredictor

praani193 / Reinforcement-Learning

Improve this page

Add this topic to your repo