Doc Scraper MCP 服务器

提供文档抓取功能的模型上下文协议 (MCP) 服务器。该服务器使用 jina.ai 的转换服务将基于 Web 的文档转换为 Markdown 格式。

特征

从任何网址抓取文档
将 HTML 文档转换为 markdown 格式
将转换后的文档保存到指定的输出路径
与模型上下文协议 (MCP) 集成

安装

通过 Smithery 安装

要通过Smithery自动安装适用于 Claude Desktop 的 Doc Scraper：

npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude

克隆存储库：

git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper

创建并激活虚拟环境：

python -m venv venv
source venv/bin/activate  # On Windows, use: venv\Scripts\activate

安装依赖项：

pip install -e .

用法

可以使用 Python 运行服务器：

python -m mcp_doc_scraper

工具描述

服务器提供一个单一工具：

名称： scrape_docs
描述：从 URL 抓取文档并保存为 markdown
输入参数：
- url ：要抓取的文档的 URL
- output_path ：markdown 文件的保存路径

项目结构

doc_scraper/
├── __init__.py
├── __main__.py
└── server.py

依赖项

aiohttp
甲基氯丙烯
派丹蒂克

发展

设置开发环境：

安装开发依赖项：

pip install -r requirements.txt

服务器使用模型上下文协议 (MCP)。请务必熟悉MCP 文档。

执照

MIT 许可证

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

使用 jina.ai 的转换服务将基于 Web 的文档转换为 markdown 格式，允许用户从任何 URL 抓取文档并将其保存为 markdown 文件。

Related Resources

Reddit Discussion about this server

Related MCP Servers

Markdownify MCP Server
zcaceres
A
security
A
license
A
quality
Converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.
Last updated -
10
14
1,993
TypeScript
MIT License
Skrape MCP Serverofficial
skrapeai
A
security
A
license
A
quality
This server converts webpages into clean, structured Markdown optimized for language model consumption, removing unnecessary content and supporting JavaScript rendering.
Last updated -
1
10
JavaScript
MIT License
Markdownify MCP Server - UTF-8 Enhanced
JDJR2024
A
security
A
license
A
quality
A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
Last updated -
10
0
9
TypeScript
MIT License
MCP Markdown Conversion Server
FradSer
A
security
F
license
A
quality
A server that converts various file formats (PDF, images, Office documents, etc.) to Markdown descriptions using Cloudflare AI services.
Last updated -
1
11
33
JavaScript

View all related MCP servers

Doc Scraper MCP Server

Doc Scraper MCP 服务器

特征

安装

通过 Smithery 安装

用法

工具描述

项目结构

依赖项

发展

执照

Related Resources

Related MCP Servers

Markdownify MCP Server

Skrape MCP Serverofficial

Markdownify MCP Server - UTF-8 Enhanced

MCP Markdown Conversion Server

Appeared in Searches

New MCP Servers

MCP directory API