Login
Download
Skill UI
Browse and discover
6283+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Prompt Evaluation
, found
5
results
Default
Newest
Most Downloaded
Langfuse Evaluation Workflow
langfuse-core-workflow-b
jeremylongshore/claude-code-plugins-plus-skills
400
Use Langfuse Core Workflow B to collect manual feedback, run automated evaluation functions, operate LLM-as-judge scoring, and compare prompts for AI quality scoring.
View Details
LLM Evaluation Toolkit
llm-evaluation
sickn33/antigravity-awesome-skills
205
Guides comprehensive evaluation of LLM applications with automated metrics, human reviews, and LLM-as-judge patterns to benchmark prompts, detect regressions, and validate deployments.
View Details
LLM Prompt Engineering
prompt-engineer
Jeffallan/claude-skills
248
Designs, refines, and evaluates prompts for LLM workflows, covering zero/few-shot, chain-of-thought, system personas, structured outputs, and testable evaluation suites to boost accuracy and efficiency.
View Details
Prompt Engineering Toolkit
senior-prompt-engineer
alirezarezvani/claude-skills
124
Senior Prompt Engineer centralizes prompt optimization, RAG evaluation, and agent orchestration so teams can analyze token usage, build few-shot examples, and validate structured workflows for LLM projects.
View Details
AI Engineering Toolkit
ai-engineering-toolkit
sickn33/antigravity-awesome-skills
171
Provides six structured AI engineering workflows—from prompt evaluation and context budgeting to RAG architecture, security audits, evaluation harnesses, and product sense coaching—ensuring repeatable, CI/CD-friendly decisions before deployment.
View Details
1
Language
简体中文
English