Login
Download
Skill UI
Browse and discover
5019+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
RLHF
, found
1
results
Default
Newest
Most Downloaded
TRL Reinforcement Fine-Tuning
fine-tuning-with-trl
Orchestra-Research/AI-Research-SKILLs
240
Provides TRL-based RLHF fine-tuning flows covering SFT, reward-model training, PPO, DPO, and GRPO so teams can align HuggingFace models with preferences using both pipeline scripts and CLI helpers.
View Details
1
Language
简体中文
English