Search Skills

Search for skills or navigate to categories

Skillforthat
AI & Machine Learning
model-pruning

model-pruning

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT

Category

AI & Machine Learning

Developer

davila7
davila7

Updated

Jan
2026

Tags

1
Total

Description

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

Skill File

SKILL.md
1Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

Tags

Ai

Information

Developerdavila7
CategoryAI & Machine Learning
CreatedJan 15, 2026
UpdatedJan 15, 2026

You Might Also Like