Login
Download
Skill UI
Browse and discover
6069+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Bench
, found
13
results
Default
Newest
Most Downloaded
Agent Evaluation CLI
agent-eval
affaan-m/everything-claude-code
207
Lightweight CLI lets teams benchmark coding agents (Claude Code, Aider, Codex, etc.) on reproducible YAML tasks, isolating runs via git worktrees while recording pass rate, cost, time, and consistency.
View Details
Prev
1
2
Language
简体中文
English