stable-baselines3
K-Dense-AI/claude-scientific-skills
Stable Baselines3 is a robust, PyTorch-based library providing production-ready implementations of core reinforcement learning algorithms (PPO, SAC, DQN, etc.). It facilitates the training of single-agent RL models, supports the creation of custom Gymnasium environments, and handles advanced parallel training via vectorized environments. It is ideal for rapid prototyping and rigorous RL experiments.