#audio
25 results found
ElevenLabs MCP Server
Mirror of
MCP TTS Say
MCP Server Tool for Text To Speech
Video Extraction Server
MCP Video & Audio Text Extraction Server A Model Context Protocol (MCP) server that enables text extraction from various video platforms and audio files, allowing compatible host applications (like Claude Desktop, Cursor) to access video content and perform text transcription. What is it? MCP Video & Audio Text Extraction Server is a Model Context Protocol (MCP) server that can download videos from various platforms, extract audio, and convert it to text. The server utilizes OpenAI's Whisper model for high-quality audio-to-text conversion. How to use it? Clone the repository and install dependencies Ensure FFmpeg is installed Run the server Configure your MCP host application (like Claude Desktop) to use the server Key Features Support video downloads from multiple platforms including YouTube, Bilibili, TikTok, etc. Extract audio content from videos High-quality speech recognition using Whisper model Multi-language text recognition support Asynchronous processing for large files Standardized MCP tools interface Use Cases Provide text transcription capabilities for applications that need to process video content Batch process video content and extract text information Create custom applications requiring audio/video text extraction functionality Enable AI assistants to understand video content FAQ What are the system requirements to run the server? > Requires Python 3.9+, FFmpeg, minimum 8GB RAM, GPU acceleration recommended What should I know about first run? > The system will automatically download the Whisper model file (approximately 1GB), which may take several minutes to tens of minutes What audio formats are supported? > Supports common audio formats including mp3, wav, m4a, etc. This description maintains the core information from the original README while adopting a similar structure and style to the reference page. Would you like me to adjust or add anything to this description?
Audacity MCP Server
MCP server for Audacity
Pure Data MCP Server
A Model Context Protocol (MCP) server for Pure Data, an open-source visual programming language and patchable environment for real-time computer music.
MMAudio
AI-powered video-to-audio and text-to-audio generation using MMAudio's advanced AI technology.
Claude Desktop Real-time Audio MCP
Real-time microphone input MCP server for Claude Desktop on Windows - enabling live voice conversations with Claude through WASAPI audio capture and real-time speech recognition
Claude Desktop Real-time Audio MCP Server (Python Implementation)
Python-based Model Context Protocol (MCP) server for real-time microphone input to Claude Desktop on Windows. FastMCP + sounddevice + multiple STT engines for sub-500ms latency voice conversations.
gradio-transcript-mcp: A Gradio MCP Server for Audio/Video Transcription from URLs
Gradio demo cum MCP server to generate transcripts from Audio/Video
eShopLite 🛒
eShopLite is a set of reference .NET applications implementing an eCommerce site with features like Semantic Search, MCP, Reasoning models and more.
Sonos Mcp
MCP server for controlling Sonos speakers and playing audio streams.