MCP PDF Reader
A Model Context Protocol (MCP) server that provides tools for reading and processing PDF documents. Built with Docling for document conversion and text extraction.
Features
- MCP Server with tools for PDF document processing
- Document Text Extraction: Convert PDF content to clean Markdown format
- Document Discovery: List and access available PDF files
Tools
The server provides two main tools:
get_document_list
: Returns a list of all available PDF files in the data directoryget_document_text
: Extracts and returns the full text content of a specified PDF file in Markdown format
Install
Make sure you have uv
installed.
Clone the repository:
Install Python:
Create and activate a virtual environment:
Install dependencies:
Usage
Add PDF Documents
Place your PDF files in the data/
directory. The server will automatically detect and make them available through the tools.
Run MCP Server
Start the MCP server:
The server runs using stdio transport and can be integrated with any MCP-compatible client.
Development and Testing
Use the MCP inspector to test the server:
This will open a web interface where you can test the available tools and inspect their responses.
Use in VSCode/Cursor
You can use the MCP integration in your editor. Tools & Integrations
-> New MCP Server
and edit the mcp.json
file to include the following:
License
See LICENSE file for details.
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A server that provides tools for reading and processing PDF documents, allowing users to list available PDFs and extract their content in Markdown format.
Related MCP Servers
- AsecurityFlicenseAqualityAn MCP server for converting Markdown documents to PDF files.Last updated -11010JavaScript
- AsecurityAlicenseAqualityA document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.Last updated -1009TypeScriptMIT License
- -securityAlicense-qualityA server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.Last updated -15PythonMIT License
Textin MCP Serverofficial
AsecurityAlicenseAqualityA server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.Last updated -31918JavaScriptMIT License