llm-evaluation

Name: llm-evaluation
Author: wshobson

Implement comprehensive evaluation strategies for LLM applications.

wshobson

Install Command

claude skill add wshobson/agents

Description

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.

Information

Developerwshobson

CategoryAI & Machine Learning

CreatedJan 15, 2026

UpdatedJan 15, 2026

View Source Documentation

art-master

自动生成艺术风格提示词，支持水墨画、油画、超现实、插画等多种艺术风格

openscad

Create and render OpenSCAD 3D models.

creating-financial-models

Advanced financial modeling suite with DCF analysis and more.

cli-e2e-testing

Guide for writing Aspire CLI end-to-end tests using Hex1b terminal automation.

blucli

BluOS CLI (blu) for discovery, playback, grouping, and volume.

async-repl-protocol

Agent Skill by parcadei

llm-evaluation

Install Command

Description

Tags

Information

You Might Also Like

art-master

openscad

creating-financial-models

cli-e2e-testing

blucli

async-repl-protocol

Search Skills

llm-evaluation

Install Command

Description

Tags

Information

You Might Also Like

art-master

openscad

creating-financial-models

cli-e2e-testing

blucli

async-repl-protocol