Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by radial-hks • Uncategorized
A Python server providing asynchronous speech transcription, audio validation, and voice activity detection using Alibaba's FunASR library.
Transcribe long audio files asynchronously with detailed timestamp information.
Validate audio file integrity and properties before processing.
Detect speech segments within audio files using voice activity detection.
This MCP server leverages Alibaba's FunASR models to offer robust speech processing capabilities including audio file validation, asynchronous speech-to-text transcription with detailed timestamping, and voice activity detection. It supports dynamic loading and switching of ASR and VAD models, allowing flexible configuration per request or globally. The server is designed for extensibility and integrates with the FastMCP framework for easy interaction via MCP clients or HTTP.