Submit

#vision

18 results found

🚀 OpenCV MCP Server

OpenCV MCP Server provides OpenCV's image and video processing capabilities through the Model Context Protocol (MCP). Access powerful computer vision tools for tasks ranging from basic image manipulation to advanced object detection and tracking.

M

MCP OpenVision

MCP Server using OpenRouter models to get descriptions for images

G

groundlight-mcp-server

MCP Server for Groundlight

M

MCP Server for CVDLT(Computer Vision & Deep Learning Tools)

The repo is based on Model Context procotol of Python SDK, including DL models in CV, and provide the abilities to the LLM or vLLM model

🚀 Wayland MCP Server

MCP Server for Wayland

M

MCPControl

MCP server for Windows OS automation

A

Apple RAG MCP

Transform your AI agents into Apple development experts! Apple RAG MCP gives you instant access to official Swift docs, design guidelines, and comprehensive Apple platform knowledge through cutting-edge RAG technology. With professional AI reranking and hybrid search across iOS, macOS, watchOS, tvOS, and visionOS documentation plus Apple Developer YouTube content, you'll get precise, contextual answers every time. Compatible with Cursor, Claude Desktop, and all MCP tools - start building smarter Apple apps today!

L

LibreChat

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

A

AutoProvisioner MCP Server (open beta)

Mirror of

U

UI-TARS Desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

T

Trend Vision One MCP Server

The Trend Vision One Model Context Protocol (MCP) Server enables natural language interaction between your favourite AI tooling and the Trend Vision One web APIs. This allows users to harness the power of Large Language Models (LLM) to interpret and respond to security events.

U

UI-TARS Desktop 🚀

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

V

Vision Mcp Server | 图片分析 Mcp

This MCP addresses the visual recognition limitations of text-based models by enabling accurate image description and identification, making it excellent for AI-assisted reference design interface analysis. It currently supports dropping links into the dialog box or placing images in the project folder for recognition. The tool can be integrated with MCP platforms like Claude Code, Cline, and Trae. Beyond programming applications, it also provides visual recognition capabilities for models that lack native image processing functionality. For visual models, users can select their preferred model from ModelScope community and replace it during MCP configuration setup. 📱 Daily Use Cases: Send screenshots to directly identify errors or issues Share image links or place screenshots in the project folder for AI-assisted layout optimization Submit product image links to generate promotional copy 该mcp可以解决文字模型图片识别的视觉的问题,可以准确识别描述图片,用来给AI看参考设计界面很nice~ 目前支持丢链接到对话框,以及把图片放到项目文件夹进行识别。 支持加入到Claude Code,Cline和Trae等mcp工具中。 除了编程外,如果你使用的模型本身不支持视觉图片识别,也可以使用~ 视觉模型可以自己去魔搭社区选一个自己喜欢的,在填写mcp配置的时候替换即可 📱 日常使用场景 - 截图发过去,直接告诉哪里出错了 - 丢过去一个图片链接或者截图放到项目文件夹内,让AI帮忙优化布局 - 发个产品图链接,让AI写推广文案

© 2025 MCP.so. All rights reserved.

Build with ShipAny.