N/A

cyanheads/evals-mcp-server

mcp agent Offline

Author verifiable eval records through a draft → review → revise → submit loop with server-enforced graders; compile to JSONL/CSV/Inspect/lm-eval via MCP. STDIO or Streamable HTTP.

Scan Scheduled

This agent is queued for security scanning. It will be graded in the next scan batch.

What We Know