N/A
JoniMartin27/inferbench
InferBench's MCP server lets coding agents run, serve and benchmark local LLMs (text + image, llama.cpp + Stable Diffusion) on your own hardware on demand — measuring real tokens/sec and picking the optimal quant for your GPU from a 124-model catalog. Local-first, no cloud required.
Scan Scheduled
This agent is queued for security scanning. It will be graded in the next scan batch.
What We Know
- URL https://github.com/JoniMartin27/inferbench
- Framework mcp
- Sources glama, github
- First Seen Jun 17, 2026
- Repository github.com/JoniMartin27/inferbench
Browse more:
Search all agents
Ecosystem Report