Legacy Environments

An archive of legacy RL Swarm environments, including Reasoning Gym, which has been replaced by CodeZero.

Overview

Before CodeZero, RL Swarm used an early research environment called Reasoning Gym.

Reasoning Gym focused on math and logic tasks verified by symbolic correctness checks. It provided a foundation for distributed reinforcement learning research and demonstrated the viability of peer-to-peer RL training.

Deprecation

This environment is now deprecated and archived.

This environment has been deprecated and replaced by CodeZero.

All current nodes now run CodeZero automatically, and no manual migration is required beyond a simple git pull command to update. Existing nodes retain their same network, identity, and connection structure during the transition.

Archived Resources

Reasoning Gym Repository: Original implementation and research codebase

Migration Notes

If you have documentation or scripts referencing Reasoning Gym:

Node setup: No changes required. CodeZero uses the same installation process.
Identity files: swarm.pem files remain compatible.
Network connection: Same peer-to-peer protocol and gossip mechanism.

The transition from Reasoning Gym to CodeZero is transparent to end users, as both environments run on the same RL Swarm infrastructure.

RL Swarm's architecture is designed to support multiple environments.

As new environments are developed, they will be documented here alongside CodeZero.

PreviousCodeZero NextGetting Started

Last updated 4 months ago

hashtagOverview

hashtagDeprecation

hashtagArchived Resources

hashtagMigration Notes

Overview

Deprecation

Archived Resources

Migration Notes