Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by NightTrek • Testing & QA
An MCP server that exposes the Moondream vision model for image captioning, object detection, and visual question answering to Claude/Cline and other agents.
Generate natural-language captions or alt text for images to improve accessibility and content summaries.
Detect or verify the presence and location of specific objects within images (e.g., "detect: car").
Answer targeted visual questions or extract specific information from images through visual question answering prompts.
This repository provides an MCP-compatible server that runs the Moondream quantized vision model and offers an HTTP API plus MCP tool interfaces. It automatically sets up the Python environment, downloads the model, and runs a model server while the MCP layer handles protocol communication and tool dispatch. The primary tool, analyze_image, supports caption generation, targeted object detection, and visual question answering using simple prompt patterns. The server emphasizes efficient, quantized inference for fast and resource-conscious image analysis.
Scores are informational only and provided “as is” without warranty. AgentHotspot assumes no liability for actions taken based on these ratings.