Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by RayenMalouche • Uncategorized
An MCP-compliant server that extracts content and metadata from various file formats using Apache Tika.
Extract styled HTML content from PDFs and other document formats.
Retrieve detailed metadata and plain text from local files securely.
List and manage files available for extraction in a local directory.
The Tika MCP Extractor Server processes files stored locally to extract text, HTML with embedded CSS, and metadata using Apache Tika and PDFBox. It provides four MCP tools and REST endpoints for file listing, content extraction, and metadata retrieval, all without requiring internet access. This makes it ideal for secure, local document processing workflows integrated with MCP-compliant clients.