Codes and Notes

“Success is not final, failure is not fatal: it is the courage to continue that counts.”

Algorithms Implementation for Reinforcement Learning

Implementation with codes (updating)

Basic:

/ REINFORCE-1 / REINFORCE-2 / Actor-Critic (AC) - 1 / AC - 2

Advanced:

DQN / Double DQN / Dueling DQN / h-DQN / DDPG

/ A3C / TRPO / PPO / TD3 / Soft AC

Multi-Agent RL:

MADDPG / COMA / QMIX / MAPPO

Environments:

Softwares:

PyTorch (Best!) / Keras (suitable for beginners in Machine Learning) / TensorFlow2 (for hard-core TensorFlow1 users and PyTorch non-likers)

Reinforcement Learning Notes

"He who refuses to do arithmetic is doomed to talk nonsense."

--John McCarthy--

--Yoav Shoham, Rob Powers & Trond Grenager--