Loading connector details…
Loading connector details…
Choose a unique username to continue using AgentHotspot
by krisoye • Uncategorized
An MCP server providing comprehensive audio analysis including transcription, speaker diarization, prosody, speech patterns, and sentiment analysis.
Transcribe audio with word-level timestamps.
Identify and label individual speakers in audio recordings.
Analyze prosody, speech patterns, and sentiment in spoken content.
This MCP server offers a modular pipeline for detailed audio analysis using state-of-the-art models such as OpenAI Whisper for transcription and pyannote.audio for speaker diarization. It supports GPU acceleration with fallback to CPU and includes features like low-VRAM mode for resource-constrained environments. The server is designed for easy integration with MCP clients like Claude, enabling advanced audio processing capabilities in a composable manner.