Legacy Environments

An archive of legacy RL Swarm environments, including Reasoning Gym, which has been replaced by CodeZero.

Overview

Before CodeZero, RL Swarm used an early research environment called Reasoning Gym.

Reasoning Gym focused on math and logic tasks verified by symbolic correctness checks. It provided a foundation for distributed reinforcement learning research and demonstrated the viability of peer-to-peer RL training.

Deprecation

This environment has been deprecated and replaced by CodeZero.

All current nodes now run CodeZero automatically, and no manual migration is required beyond a simple git pull command to update. Existing nodes retain their same network, identity, and connection structure during the transition.

Archived Resources

Migration Notes

If you have documentation or scripts referencing Reasoning Gym:

  • Node setup: No changes required. CodeZero uses the same installation process.

  • Identity files: swarm.pem files remain compatible.

  • Network connection: Same peer-to-peer protocol and gossip mechanism.

The transition from Reasoning Gym to CodeZero is transparent to end users, as both environments run on the same RL Swarm infrastructure.

Last updated