name: metric-pack-designer description: Design custom metric packs for plugin-eval so teams can add local evaluation rubrics that emit schema-compatible checks and metrics. Use when the user wants their own evaluation criteria or visualizations.
Metric Pack Designer
Use this skill when the user wants to extend plugin-eval with a local rubric.
Workflow
- Clarify the custom rubric categories and target kinds.
- Define the smallest useful
checks[]andmetrics[]payload. - Create a metric-pack manifest plus a script that prints JSON to stdout.
- Run the pack through
plugin-eval analyze <path> --metric-pack <manifest.json>.
Design Rules
- Keep IDs stable across runs so comparisons stay meaningful.
- Emit only
checks[],metrics[], and optionalartifacts[]. - Do not try to overwrite the core score or summary.
- Prefer deterministic local signals over subjective text generation.
Reference
../../references/metric-pack-manifest.md