Monitoring your training stats
Last updated
Last updated
You can now upload your RL Swarm logs to and easily monitor your system stats (such as GPU Utilization
, GPU Temperature
), and training stats (such as loss
, learning_rate
, and rewards
).
First, make sure you're running the of rl-swarm
.
Once you stop the rl_swarm.sh
process in your console (e.g., by pressing Ctrl+C), you will see a message similar to this:
To upload your training statistics:
Make sure you have created an account on .
Copy the wandb sync command provided in your terminal (the part that looks like wandb sync logs/wandb/offline-run-xxxxxxxx_xxxxxx-xxxxxxxxxx
).
Run that command in your terminal.
When prompted, enter your API key that can be found in .
This will upload your local training run data to the Weights & Biases cloud, allowing you to visualize and track your experiments. For more details on this command, you can refer to the .