Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by elchika-inc • Uncategorized
An MCP server for web crawling and content extraction with multiple output formats.
Extract clean text or structured content from web pages.
Respect robots.txt rules and rate limits during crawling.
Multiple output formats including markdown, XML, and JSON.
Open Crawler MCP Server enables extraction of web page content in text, markdown, XML, or JSON formats with support for CSS selectors. It ensures compliance with robots.txt, enforces rate limiting and page size limits, and provides structured content extraction including headings, links, images, and lists. The server offers comprehensive error handling for various failure scenarios, making it reliable for automated web crawling tasks.