Legacy Environments
An archive of legacy RL Swarm environments, including Reasoning Gym, which has been replaced by CodeZero.
Overview
Before CodeZero, RL Swarm used an early research environment called Reasoning Gym.
Reasoning Gym focused on math and logic tasks verified by symbolic correctness checks. It provided a foundation for distributed reinforcement learning research and demonstrated the viability of peer-to-peer RL training.
Deprecation
This environment is now deprecated and archived.
This environment has been deprecated and replaced by CodeZero.
All current nodes now run CodeZero automatically, and no manual migration is required beyond a simple git pull command to update. Existing nodes retain their same network, identity, and connection structure during the transition.
Archived Resources
Reasoning Gym Repository: Original implementation and research codebase
Migration Notes
If you have documentation or scripts referencing Reasoning Gym:
Node setup: No changes required. CodeZero uses the same installation process.
Identity files:
swarm.pemfiles remain compatible.Network connection: Same peer-to-peer protocol and gossip mechanism.
The transition from Reasoning Gym to CodeZero is transparent to end users, as both environments run on the same RL Swarm infrastructure.
RL Swarm's architecture is designed to support multiple environments.
As new environments are developed, they will be documented here alongside CodeZero.
Last updated