Login
Download
Skill UI
Browse and discover
6581+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
TRL
, found
3
results
Default
Newest
Most Downloaded
TRL Reinforcement Fine-Tuning
fine-tuning-with-trl
Orchestra-Research/AI-Research-SKILLs
481
Provides TRL-based RLHF fine-tuning flows covering SFT, reward-model training, PPO, DPO, and GRPO so teams can align HuggingFace models with preferences using both pipeline scripts and CLI helpers.
View Details
GRPO RL Training
grpo-rl-training
Orchestra-Research/AI-Research-SKILLs
490
Provides expert guidance for implementing GRPO with TRL to fine-tune reasoning models, enforce structured outputs, and optimize custom reward functions for verifiable and multi-objective tasks.
View Details
TRL Hugging Face Trainer
hugging-face-model-trainer
sickn33/antigravity-awesome-skills
325
Orchestrates TRL-based fine-tuning or reward-model training on Hugging Face Jobs, submitting scripts via hf_jobs with Trackio monitoring and supporting SFT, DPO, GRPO plus GGUF export for deployment.
View Details
1
Language
简体中文
English