Login
Download
Skill UI
Browse and discover
9688+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
FP8
, found
3
results
Default
Newest
Most Downloaded
Enterprise RL Model Training
miles-rl-training
Orchestra-Research/AI-Research-SKILLs
211
Guides production-grade miles RL training for massive MoE models, covering FP8/INT4 low precision, train-inference alignment, and speculative rollout workflows to maximize throughput in enterprise deployments.
View Details
Flash Attention Optimizer
optimizing-attention-flash
Orchestra-Research/AI-Research-SKILLs
55
Flash Attention optimizer delivers 2-4x speedup and 10-20x memory savings for transformer attention, ideal when training or running long-context models that hit GPU memory or latency limits; supports PyTorch native scaled_dot_product_attention, the flash-attn library, H100 FP8 precision, and sliding-window attention.
View Details
TensorRT LLM Optimizer
tensorrt-llm
Orchestra-Research/AI-Research-SKILLs
334
Optimizes large language model inference on NVIDIA GPUs with TensorRT, delivering 10-100× faster throughput, quantized precision (FP8/INT4), multi-GPU scaling, and serving-ready tooling for production deployments.
View Details
1
Language
简体中文
English