CrackedAI
Problems Tracks Learn JAX Roadmap Articles
Log in Sign up
Problems Tracks Learn JAX Articles Roadmap
Log in Sign up
Radio

We can't find the internet

Attempting to reconnect

Something went wrong!

Attempting to reconnect

← All tracks

Optax

Gradient transforms, optimizer chains, schedules, weight decay, EMA, masking. The production optimizer library for JAX.

0 / 25 solved Continue →
  1. 1. ○ Optax SGD Step
  2. 2. ○ Optax SGD with Momentum
  3. 3. ○ Optax Adam Step
  4. 4. ○ AdamW with Decoupled Weight Decay
  5. 5. ○ Optax RMSprop Step
  6. 6. ○ Constant Schedule
  7. 7. ○ Linear Schedule
  8. 8. ○ Warmup + Cosine Decay
  9. 9. ○ Piecewise Constant Schedule
  10. 10. ○ Exponential Decay Schedule
  11. 11. ○ Chain: Clip + SGD
  12. 12. ○ Adam + Weight Decay (Chain)
  13. 13. ○ Global-Norm Gradient Clipping
  14. 14. ○ Lookahead Optimizer Wrapper
  15. 15. ○ multi_transform per-Param Group
  16. 16. ○ Optax EMA on Params
  17. 17. ○ Gradient Accumulation via MultiSteps
  18. 18. ○ masked: Apply WD Only To Certain Params
  19. 19. ○ inject_hyperparams for Runtime LR
  20. 20. ○ zero_nans for NaN-Safe Training
  21. 21. ○ Full Training Step (Loss + Grad + Update)
  22. 22. ○ Train Step with Frozen Params (Mask)
  23. 23. ○ Train Step with Global-Norm Clipping
  24. 24. ○ Train Step with Warmup Schedule
  25. 25. ○ 4-Step Training Loop with Scan + Loss Curve