Search Skills

Search for skills or navigate to categories

Skillforthat
AI & Machine Learning
hugging-face-model-trainer

hugging-face-model-trainer

This skill should be used when users want to train or fine-tune language models using TRL (Transf...

Category

AI & Machine Learning

Developer

huggingface
huggingface

Updated

Jan
2026

Tags

2
Total

Description

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for tasks involving cloud GPU training, GGUF conversion, or when users mention training on Hugging Face Jobs without local GPU setup.

Skill File

SKILL.md
1This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for tasks involving cloud GPU training, GGUF conversion, or when users mention training on Hugging Face Jobs without local GPU setup.

Tags

AiUi

Information

Developerhuggingface
CategoryAI & Machine Learning
CreatedJan 14, 2026
UpdatedJan 14, 2026

You Might Also Like