arize-evaluator
github/awesome-copilot
Defines and runs LLM-as-judge evaluators on Arize via ax commands, handling evaluator creation, continuous monitoring, column mapping, trigger runs, and evaluation spans or experiments with configurable templates, models, and query filters.