huggingface-accelerate
Orchestra-Research/AI-Research-SKILLs
HuggingFace Accelerate adds distributed support to any PyTorch script with just four lines, providing a unified API for DDP, DeepSpeed, FSDP, and Megatron, plus automatic device placement, mixed precision, and simple launch/config commands for fast prototyping.