Context

I think we all recognize brax environments are useful for very fast RL training. How do we create custom envs which are useful?

Setup

  • Prerequisites
  • Repo or package links
  • Environment details

Core steps

  1. Define environment state and action space.
  2. Implement reset and step transitions.
  3. Register the environment.
  4. Connect it to your training loop.

Common issues

  • Symptom: …
  • Cause: …
  • Fix: …

References

  • Link 1
  • Link 2

Next actions

  • Follow-up task 1
  • Follow-up task 2