1 min read
There are a lot of packages out there for deep Reinforcement Learning (RL), and so it became obvious to me that there needed to be yet another implementation! I’m calling it yarlp
, Yet Another Reinforcement Learning Package! I’m developing it mostly for educational purposes.
Here are several great implementations that I’ve learned and borrowed from:
yarlp
heavily uses tensorflow==1.3
and gym
, and is meant to be simple to run, yet very configurable. Here is an example running TRPO:
from yarlp.agent.trpo_agent import TRPOAgent
from yarlp.utils.env_utils import NormalizedGymEnv
from yarlp.model.networks import mlp
env = NormalizedGymEnv(
'MountainCarContinuous-v0',
normalize_obs=True)
agent = TRPOAgent(
env, discount_factor=0.99,
policy_network=mlp)
agent.train(100, 0, n_steps=1024)
I’m planning to benchmark TRPO on several environments and then move over to value-based methods.
Here are tons of other RL packages:
- tensorforce
- universe-starter-agent
- imitation
- modular_rl
- keras-rl
- reinforcement-learning
- python-rl
- rlpark
- AgentNet
- openai_lab
- deep-rl-tensorflow
- drop a comment to add more to the list!