Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by grctest • Uncategorized
A FastAPI-based MCP server to manage and interact with llama.cpp BitNet model instances via REST API.
Programmatically control and interact with BitNet model chat sessions.
Benchmark and evaluate GGUF models automatically.
Integration with VS Code Copilot Chat for enhanced developer workflows.
This MCP server provides a robust REST API for starting, stopping, and managing multiple BitNet model sessions using llama-cli and llama-server. It supports batch operations, interactive chat, model benchmarking, and resource estimation, all containerized with Docker for easy deployment. Additionally, it integrates with VS Code Copilot Chat via the Model Context Protocol for seamless developer experience.