Download

Skill UI

Browse and discover 6283+ curated skills

All Development Artificial Intelligence Design & Creative Product & Business Data Science Marketing Soft Skills Productivity Engineering Languages

Search Prompt Evaluation , found 5 results

Default Newest Most Downloaded

Langfuse Evaluation Workflow

langfuse-core-workflow-b

jeremylongshore/claude-code-plugins-plus-skills

Use Langfuse Core Workflow B to collect manual feedback, run automated evaluation functions, operate LLM-as-judge scoring, and compare prompts for AI quality scoring.

LLM Evaluation Toolkit

sickn33/antigravity-awesome-skills

Guides comprehensive evaluation of LLM applications with automated metrics, human reviews, and LLM-as-judge patterns to benchmark prompts, detect regressions, and validate deployments.

LLM Prompt Engineering

prompt-engineer

Jeffallan/claude-skills

Designs, refines, and evaluates prompts for LLM workflows, covering zero/few-shot, chain-of-thought, system personas, structured outputs, and testable evaluation suites to boost accuracy and efficiency.

Prompt Engineering Toolkit

senior-prompt-engineer

alirezarezvani/claude-skills

Senior Prompt Engineer centralizes prompt optimization, RAG evaluation, and agent orchestration so teams can analyze token usage, build few-shot examples, and validate structured workflows for LLM projects.

AI Engineering Toolkit

ai-engineering-toolkit

sickn33/antigravity-awesome-skills

Provides six structured AI engineering workflows—from prompt evaluation and context budgeting to RAG architecture, security audits, evaluation harnesses, and product sense coaching—ensuring repeatable, CI/CD-friendly decisions before deployment.

1

Language