Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by jkawamoto • Analytics & Monitoring
An MCP server that processes images and PDFs with Microsoft Florence-2 to perform OCR text extraction and generate descriptive captions.
Extract searchable text from scanned documents, images, or PDFs hosted locally or on the web.
Generate concise descriptive captions or summaries of image content to aid downstream reasoning or to present visual context.
A drop-in MCP server to add image understanding capabilities to Claude Desktop, Goose CLI/Desktop, or LM Studio workflows.
This project provides an MCP (Model Context Protocol) server that uses Microsoft Florence-2 (large) to extract text from images/PDFs via OCR and to generate descriptive captions summarizing image content. It can process files located locally or on the web and is designed for easy integration into MCP-compatible clients like Claude Desktop, Goose (CLI/Desktop), and LM Studio. The server is open-source (MIT) and includes usage tooling for two primary operations: ocr and caption.
Scores are informational only and provided “as is” without warranty. AgentHotspot assumes no liability for actions taken based on these ratings.