LLM API Benchmark Server

MCPOpen SourceMIT9.0

by Yoosu-L • Uncategorized

An MCP server to benchmark Large Language Model APIs measuring throughput and latency metrics.

Example Use Cases

1
Measure and analyze the performance of LLM API endpoints.
2
Benchmark LLM APIs with customizable concurrency and token limits.
3
Detailed throughput and latency metrics for LLM model evaluation.

Description

This MCP server enables comprehensive benchmarking of LLM APIs by measuring generation throughput, prompt throughput, and Time To First Token (TTFT). It supports flexible deployment options including remote SSE servers and local stdio or SSE transports. The server provides detailed JSON output for in-depth performance analysis and is designed for easy integration with MCP clients.

Quick Actions

View on GitHub

Quick Stats

Service TypeMCP

Pricing ModelFree

Capabilities0 Tools / 0 Prompts / 0 Resources

OwnerYoosu-L

CategoryUncategorized

TagsNo tags

Set Your Username

LLM API Benchmark Server

Example Use Cases

Description

Quick Actions

Quick Stats