Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by fvanevski • Uncategorized
An MCP server providing a tool-based interface to extract main content and metadata from web pages using Trafilatura.
Extract main textual content from web pages programmatically.
Retrieve metadata such as title, author, and publication date from URLs.
That require configurable web scraping with options to include or exclude comments and tables.
This MCP server leverages the Trafilatura library to perform web scraping and metadata extraction from URLs. It offers configurable options to include or exclude comments and tables, and exposes a simple asynchronous tool called 'fetch_and_extract' for easy integration. Designed for compatibility with MCP clients, it facilitates programmatic access to web content extraction without requiring API keys or complex setup.