Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by JustAzul • Uncategorized
A Python-based MCP server for headless web scraping that extracts main text content from web pages and outputs Markdown, text, or HTML.
Extract clean, readable content from web pages for AI processing or automation.
A headless, robust web scraping tool integrated via MCP stdio/JSON-RPC.
Convert web content into Markdown, text, or HTML formats for seamless integration with IDEs and AI tools.
This MCP server uses a headless browser with Playwright, BeautifulSoup, and Markdownify to scrape web pages robustly, handling errors like timeouts and Cloudflare challenges. It outputs content in Markdown, text, or HTML formats, making it easy to integrate with AI tools and IDEs via the Model Context Protocol. The service is dockerized, configurable via environment variables, and supports per-domain rate limiting for responsible scraping.