The Simple Document Processing MCP Server provides comprehensive document processing capabilities for various file formats:
Read content from PDF, DOCX, TXT, HTML, CSV, and Excel files
Convert documents between formats (DOCX to PDF/HTML, HTML to TXT/Markdown, Excel to JSON, and between Markdown/HTML/XML/JSON)
PDF manipulation: Merge multiple PDFs or split PDFs by page ranges
Text processing: Split text by lines/delimiters, compare files, format text, and convert between encodings (UTF-8, Big5, GBK)
HTML processing: Clean HTML, extract resources (images, links, videos), format HTML, and convert to plain text or Markdown
Simple Document Processing MCP Server
A powerful Model Context Protocol (MCP) server providing comprehensive document processing capabilities.
Features
Document Reader
Read DOCX, PDF, TXT, HTML, CSV
Document Conversion
DOCX to HTML/PDF conversion
HTML to TXT/Markdown conversion
PDF manipulation (merge, split)
Text Processing
Multi-encoding transfer support (UTF-8, Big5, GBK)
Text formatting and cleaning
Text comparison and diff generation
Text splitting by lines or delimiter
HTML Processing
HTML cleaning and formatting
Resource extraction (images, links, videos)
Structure-preserving conversion
Related MCP server: Fetch MCP Server
Installation
Installing via Smithery
To install Document Processing Server for Claude Desktop automatically via Smithery:
Manual Installation
Usage
Cli
With Dive Desktop
Click "+ Add MCP Server" in Dive Desktop
Copy and paste this configuration:
Click "Save" to install the MCP server
License
MIT
Contributing
Welcome community participation and contributions! Here are ways to contribute:
⭐️ Star the project if you find it helpful
🐛 Submit Issues: Report problems or provide suggestions
🔧 Create Pull Requests: Submit code improvements
Contact
If you have any questions or suggestions, feel free to reach out:
📧 Email: reahtuoo310109@gmail.com
📧 GitHub: CabLate
🤝 Collaboration: Welcome to discuss project cooperation
📚 Technical Guidance: Sincere welcome for suggestions and guidance