Download

Skill UI

Browse and discover 6581+ curated skills

All Development Artificial Intelligence Design & Creative Product & Business Data Science Marketing Soft Skills Productivity Engineering Languages

Search TRL , found 3 results

Default Newest Most Downloaded

TRL Reinforcement Fine-Tuning

fine-tuning-with-trl

Orchestra-Research/AI-Research-SKILLs

Provides TRL-based RLHF fine-tuning flows covering SFT, reward-model training, PPO, DPO, and GRPO so teams can align HuggingFace models with preferences using both pipeline scripts and CLI helpers.

GRPO RL Training

grpo-rl-training

Orchestra-Research/AI-Research-SKILLs

Provides expert guidance for implementing GRPO with TRL to fine-tune reasoning models, enforce structured outputs, and optimize custom reward functions for verifiable and multi-objective tasks.

TRL Hugging Face Trainer

hugging-face-model-trainer

sickn33/antigravity-awesome-skills

Orchestrates TRL-based fine-tuning or reward-model training on Hugging Face Jobs, submitting scripts via hf_jobs with Trackio monitoring and supporting SFT, DPO, GRPO plus GGUF export for deployment.

1

Language