Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by Loveacup • Uncategorized
An MCP server providing multimodal vision capabilities powered by OpenAI-compatible vision models.
Analyze images with natural language prompts.
Extract text from images using OCR.
Compare multiple images or analyze video content.
Vision MCP Server enables AI agents to analyze images and videos, perform OCR, and compare images using any OpenAI-compatible vision model. It supports local files and URLs, configurable via environment variables or config files, and integrates easily with MCP clients like Claude Code. The server supports multiple tools including image analysis, OCR text extraction, image comparison, and video content analysis, making it a versatile solution for adding visual understanding to AI agents.