Tabular methods for reinforcement learning
-
Updated
Jul 3, 2020 - Python
Tabular methods for reinforcement learning
This repository contains the code for automatically generating piano fingerings using a reinforcement learning agent that uses Q-Learning.
Open-zero is a research project aiming to realize the various projects of the company DeepMind
The objective is to teach robot to find and reach the target object in the minimum number of steps and using the shortest path and avoiding any obstacles such as humans, walls, etc usinf reinforcement learning algorithms.
Turn based strategy game with AI
Implementation of Q-learning to solve GridWorld
A reinforcement learning agent with reflection capabilities for dynamic maze navigation. Implements dual memory system, real-time adaptation, and environment change detection. Open source with research papers and documentation.
Mastering complex card games like GuanDan (掼蛋) and DouDiZhu (斗地主) using self-play RL and sequence modeling with zero domain knowledge
The implementation for the paper Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis // NeurIPS 2022
Two intelligent agents (cat and mouse) compete with each other to achieve their goal. Agents are trained through reinforcement learning (Q-learning).
Demonstration of Q-Learning and SARSA algorithms utilizing Python and OpenAI GYM
a Python-based platformer infused with Q-Learning and dynamic level creation from simple JSON files.
Docking robot in a grid environment trained it with Q-learning
Q-Learning applied to Gymnasium's Toy Text environments: FrozenLake, CliffWalking, BlackJack, and Taxi.
Codes for the AISTATS 2023 paper, A Statistical Analysis of Polyak-Ruppert Averaged Q-learning.
SUTD 50.021 Artificial Intelligence Project - Wordle Solver using Reinforcement Learning
Made with the gym package from the farama foundation, this project is an hyper detailed version of the Q-Learning reinforcement on the Frozen lake's game.
the objective of this repository is the navigation of a Turtlebot3 based on RL (Q-learning) in which goals send in the form of sequences.
Python projects for Introduction to Artificial Intelligence course at Warsaw University of Technology.
Add a description, image, and links to the q-learning-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the q-learning-algorithm topic, visit your repo's landing page and select "manage topics."