AI & Machine Learning
L
Install Command
claude skill add wshobson/agentsDescription
Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.
Tags
EvaluationLLMMetricsBenchmarking
Information
Developerwshobson
CategoryAI & Machine Learning
CreatedJan 15, 2026
UpdatedJan 15, 2026
You Might Also Like
B
art-master
自动生成艺术风格提示词,支持水墨画、油画、超现实、插画等多种艺术风格
B
openscad
Create and render OpenSCAD 3D models.
B
creating-financial-models
Advanced financial modeling suite with DCF analysis and more.
B
cli-e2e-testing
Guide for writing Aspire CLI end-to-end tests using Hex1b terminal automation.
B
blucli
BluOS CLI (blu) for discovery, playback, grouping, and volume.
B
async-repl-protocol
Agent Skill by parcadei