A simple stochastic OpenAI environment for training RL agents
-
Updated
Feb 8, 2023 - Python
A simple stochastic OpenAI environment for training RL agents
OpenAI's PPO baseline applied to the classic game of Snake
Force any OpenAI-compatible or Anthropic API-compatible tool (Claude Code, Qwen Code, Aider, Fabric, Interpreter) to use Gemini, Groq, Cerebras, or Ollama. Pure Bash. Zero dependencies.
OpenAI environment for CTO domain
Custom Environments for training RL Agents
Add a description, image, and links to the openai-environment topic page so that developers can more easily learn about it.
To associate your repository with the openai-environment topic, visit your repo's landing page and select "manage topics."