agentic-eval
github/awesome-copilot
Provides patterns and techniques for building robust evaluation loops for AI agents. This skill guides the implementation of self-critique, evaluator-optimizer pipelines, and test-driven refinement workflows. Use it to significantly enhance the accuracy and quality of complex outputs like code, reports, and analyses by implementing structured, iterative improvement cycles (Generate → Evaluate → Critique → Refine).