stable-baselines3
K-Dense-AI/scientific-agent-skills
Stable Baselines3 is a PyTorch-based library providing production-ready implementations of core reinforcement learning algorithms, including PPO, SAC, DQN, and TD3. It offers a scikit-learn-like API for easily training RL agents, developing custom Gymnasium environments, and managing complex training workflows. Ideal for standard RL research, rapid prototyping, and advanced model optimization.