An archive of legacy RL Swarm environments, including Reasoning Gym, which has been replaced by CodeZero.
Overview
Before CodeZero, RL Swarm used an early research environment called Reasoning Gym.
Reasoning Gym focused on math and logic tasks verified by symbolic correctness checks. It provided a foundation for distributed reinforcement learning research and demonstrated the viability of peer-to-peer RL training.
Deprecation
This environment is now deprecated and archived.
This environment has been deprecated and replaced by CodeZero.
All current nodes now run CodeZero automatically, and no manual migration is required beyond a simple git pull command to update. Existing nodes retain their same network, identity, and connection structure during the transition.