Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by AI-Riksarkivet • Uncategorized
An MCP server and CLI for searching and browsing AI-transcribed historical documents from the Swedish National Archives.
Perform full-text search on historical documents from the Swedish National Archives.
Browse and view AI-transcribed pages and handwritten text recognition results.
Interactive archival research guides integrated with LLM clients.
RA-MCP provides full-text search across millions of AI-transcribed pages, complete page transcriptions, handwritten text recognition, interactive document viewing, and archival research guides. It is accessible as MCP tools compatible with any LLM client, enabling streamlined archival research. The server supports streamable HTTP for easy integration with clients like ChatGPT and Claude, and offers a CLI for direct terminal interaction.
Search AI-transcribed text in digitised historical documents from the Swedish National Archives. IMPORTANT: Transcriptions are AI-generated (HTR/OCR) and contain recognition errors — always use fuzzy search (~) to compensate for misread characters and increase hits. Supports Solr syntax: wildcards (troll*), fuzzy (stockholm~1), Boolean ((A AND B)), proximity ("term1 term2"~10). Always group Boolean queries with outer parentheses. Use fuzzy (~) for OCR/HTR errors and old Swedish variants (präst/prest, silver/silfver). Paginate with offset (0, 50, 100...). Session dedup: re-calling returns stubs for already-seen documents.
Search document metadata (titles, names, places, descriptions) across the Swedish National Archives catalog. Covers 2M+ records when only_digitised=False, including non-digitised materials. Use the dedicated name parameter for person searches and place parameter for place searches — these can be combined with keyword. Does NOT search transcribed page text — use search_transcribed for that. Same Solr syntax as search_transcribed. Session dedup: re-calling returns stubs for already-seen documents. Important: name and place filter a dedicated metadata field that is sparsely populated. Most person/place matches are NOT digitised, so set only_digitised=False when using name or place to avoid empty results.
View full page transcriptions of a document by reference code. Use reference codes from search results. Returns original text (usually Swedish), links to bildvisaren (image viewer), and ALTO XML. Blank pages are normal (digitised but no text). Non-digitised materials return metadata only. Session dedup: re-browsing same pages returns stubs. Set dedup=False to force full text. TOKEN COST: ~300 tokens overhead per response + ~200-1500 tokens per page depending on content density. Dense court protocol pages average ~1000 tokens each; title/cover pages ~300. Request only the pages you need — start with 3-5 pages and paginate.
Transcribe handwritten documents and return results as file URLs. Sends images to the HTRflow Gradio Space for AI-powered handwritten text recognition. Returns URLs to an interactive viewer, per-page JSON transcriptions, and an archival export file.
Display document pages with zoomable images and text layer overlays. Takes a reference code and page specification (same as browse_document). Use after search to visually inspect document pages with transcription overlay. Use highlight_term to pre-populate the search bar and highlight matching text lines.
Get the table of contents (Innehållsförteckning) for the Riksarkivet historical guide.
>
File listing for archival-guide
Scores are informational only and provided “as is” without warranty. AgentHotspot assumes no liability for actions taken based on these ratings.