#multimodal
8 results found
Multimodal Model Context Protocal Server
A multimodal mcp server
MCP servers to handle multimodal medical data
Process and prepare your multimodal medical data with natural language!
Voice MCP Client
A iOS/MacOS Swift MCP Client using voice interacting with python MCP servers both natively
👾 Digimon Engine
Digimon Engine — Multi-Agent, Multi-Player Framework for AI-Native Games and Agentic Metaverse
Rostro
Turn any language model into a multimodal powerhouse that can generate images, music, videos, 3D models and more on the fly. Rostro's tools are designed to be used by language models from the ground up, expanding capabilities with minimal context bloat.
Kultur.dev
Multimodal Cultural Intelligence Infrastructure for AI Agents. The only MCP server that analyzes text, images, and video for cultural risks across 200+ markets. 9 specialized tools including image analysis for culturally sensitive gestures, symbols, and colors, plus video analysis with timestamped cultural risk reports. Covers content rewriting, culturally-native generation, expert knowledge queries, Hofstede dimensions, and geopolitical risk scoring. Free tier available.
Superdocs
A structured-document editor for AI agents. SuperDocs gives your AI 21 MCP tools and 4 workflow prompts to make section-precise edits — bold a specific paragraph, replace a single table cell, restructure a heading — without disturbing surrounding content. Tables, borders, alternating row shading, fonts, and inline styling all survive AI edits AND round-trip exports across .docx, PDF, HTML, Markdown, and RTF. Other capabilities: pre-signed URL upload/download (no context bloat for files >100KB), compact response mode for editing 100-page documents efficiently (~140× token reduction), multimodal vision on attachments, human-in-the-loop approval for sensitive edits, and multi-language editing across 16+ languages. Free plan: 500 ops/month, no credit card required.