N/A

JoniMartin27/inferbench

mcp agent Offline

InferBench's MCP server lets coding agents run, serve and benchmark local LLMs (text + image, llama.cpp + Stable Diffusion) on your own hardware on demand — measuring real tokens/sec and picking the optimal quant for your GPU from a 124-model catalog. Local-first, no cloud required.

Scan Scheduled

This agent is queued for security scanning. It will be graded in the next scan batch.

What We Know

URL https://github.com/JoniMartin27/inferbench
Framework mcp
Sources glama, github
First Seen Jun 17, 2026
Repository github.com/JoniMartin27/inferbench

Browse more: Search all agents Ecosystem Report