experiment-queue
wanshuiyin/Auto-claude-code-research-in-sleep
A robust job queue designed to orchestrate large-scale, multi-seed, and multi-configuration Machine Learning experiments on remote GPU clusters. It handles complex workflows, including wave transitions, dependency chaining (teacher-student), Out-of-Memory (OOM) retries, and stale screen cleanup, ensuring reliable execution for demanding scientific computing tasks.