N/A

JoniMartin27/inferbench

mcp agent Offline

InferBench's MCP server lets coding agents run, serve and benchmark local LLMs (text + image, llama.cpp + Stable Diffusion) on your own hardware on demand — measuring real tokens/sec and picking the optimal quant for your GPU from a 124-model catalog. Local-first, no cloud required.

Scan Scheduled

This agent is queued for security scanning. It will be graded in the next scan batch.

What We Know