Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by gautierdag • Uncategorized
An MCP server that extracts survey content and bibliographies from arXiv papers using their LaTeX source.
Extract related work and survey sections from arXiv papers.
Normalize and merge bibliographic citations from multiple papers.
Build large corpora of background content by recursively extracting related work sections.
Bibextract is a Python package with a Rust backend designed to extract survey, background, and related work sections directly from the LaTeX source of arXiv papers. It reconstructs and normalizes bibliography entries from BBL files, merging citations across multiple papers to create a unified output. This tool facilitates building large corpora of related work content for LLM agents by automating the extraction and citation process.