Download

Skill UI

Browse and discover 9688+ curated skills

All Development Artificial Intelligence Design & Creative Product & Business Data Science Marketing Soft Skills Productivity Engineering Languages

Search FP8 , found 3 results

Default Newest Most Downloaded

Enterprise RL Model Training

miles-rl-training

Orchestra-Research/AI-Research-SKILLs

Guides production-grade miles RL training for massive MoE models, covering FP8/INT4 low precision, train-inference alignment, and speculative rollout workflows to maximize throughput in enterprise deployments.

Flash Attention Optimizer

optimizing-attention-flash

Orchestra-Research/AI-Research-SKILLs

Flash Attention optimizer delivers 2-4x speedup and 10-20x memory savings for transformer attention, ideal when training or running long-context models that hit GPU memory or latency limits; supports PyTorch native scaled_dot_product_attention, the flash-attn library, H100 FP8 precision, and sliding-window attention.

TensorRT LLM Optimizer

Orchestra-Research/AI-Research-SKILLs

Optimizes large language model inference on NVIDIA GPUs with TensorRT, delivering 10-100× faster throughput, quantized precision (FP8/INT4), multi-GPU scaling, and serving-ready tooling for production deployments.

1

Language