mcp-server-webcrawl

@MCP-Mirror

Visit Server

10 months ago

Mirror of

Overview Tools Comments

Overview

What is mcp-server-webcrawl?

mcp-server-webcrawl is a tool that bridges the gap between web crawling and AI language models using the Model Context Protocol (MCP). It allows AI clients to filter and analyze web content either autonomously or under user direction.

How to use mcp-server-webcrawl?

To use mcp-server-webcrawl, install it via pip with the command pip install mcp-server-webcrawl. Configure it through the Claude Desktop settings to set up connections to various web crawlers.

Key features of mcp-server-webcrawl?

Compatibility with multiple web crawlers (WARC, wget, InterroBot, Katana, SiteOne)
Full-text search interface with boolean support
Resource filtering by type and HTTP status
Quick MCP configuration
Support for Claude Desktop and upcoming ChatGPT integration

Use cases of mcp-server-webcrawl?

Analyzing web content for research purposes.
Automating data extraction from websites.
Enhancing AI model training with real-time web data.

FAQ from mcp-server-webcrawl?

Is mcp-server-webcrawl free to use?

Yes! It is open-source and free for everyone.

What are the system requirements?

It requires Python (>=3.10) and Claude Desktop.

Can it work with any web crawler?

Yes, it supports various crawlers like WARC, wget, and more.

Build with ShipAny.