Login
Download
Skill UI
Browse and discover
9688+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Multi-node Clusters
, found
3
results
Default
Newest
Most Downloaded
Ray Data Processing
ray-data
Orchestra-Research/AI-Research-SKILLs
76
Ray Data offers scalable distributed data processing for ML workloads, streaming across CPU/GPU clusters and integrating with PyTorch, TensorFlow, and Ray Train. Use it for batch inference, ETL, and multi-modal preprocessing from a laptop up to hundreds of nodes.
View Details
Vast.ai Distributed Training and Cost Management
vastai-core-workflow-b
jeremylongshore/claude-code-plugins-plus-skills
439
This advanced workflow orchestrates complex, multi-node GPU clusters on Vast.ai. It is designed for large-scale distributed machine learning training, automatically handling spot instance preemption and resuming jobs from checkpoints. Furthermore, it provides comprehensive cost analysis by tracking billing history, allowing users to optimize GPU spending and manage cluster teardown efficiently.
View Details
CoreWeave Distributed GPU Training Workflow
coreweave-core-workflow-b
jeremylongshore/claude-code-plugins-plus-skills
142
This guide details how to set up and execute large-scale, distributed GPU training jobs on the CoreWeave platform. It covers both single-node multi-GPU setups and multi-node training utilizing PyTorch DDP and Kubernetes resources (A100/H100). This workflow is ideal for fine-tuning large language models (LLMs) or running complex deep learning models that require high-performance, shared-storage compute clusters.
View Details
1
Language
简体中文
English