Provides containerized deployment option for the Auto-Snap MCP service, allowing users to run the screenshot and document processing functionality in an isolated environment.
Enables native support for capturing screenshots and processing documents on Linux systems using X11 display server.
Offers experimental support for capturing screenshots and processing documents on macOS systems when using XQuartz.
Utilizes Python for server functionality, allowing users to check dependencies and run the Auto-Snap MCP server for screenshot and document processing.
Auto-Snap MCP 📸
Automated screenshot capture and document processing for MCP Clients
Turn your screenshots into PDFs automatically! Auto-Snap lets your MCP client capture windows, process documents, and create PDFs with simple natural language commands.
🚀 Quick Start
Which Option Should I Choose?
- 🐳 Docker → Zero setup, works everywhere (recommended)
- 🪟 WSL2 → Windows users with WSL2 installed
- 🐧 Linux → Native Linux
Setup Steps
1. Get Auto-Snap:
2. Install dependencies (Linux only):
3. Configure Claude Desktop:
Edit ~/.claude/claude_desktop_config.json
and add one of these:
🐳 Docker (Recommended - No setup needed!)
🪟 WSL2 (Windows with WSL2)
🐧 Linux Native
4. Restart Claude Desktop and verify it works:
- "List all my open windows" ← Start with this to test the connection
- "Capture this PDF and convert to images"
- "Take 5 screenshots and make them into a PDF"
✅ Working? You should see your windows listed. If not, check the 🚨 Not Working? section below.
📁 Where Your Files Are Saved
By default, all your captures go to:
Full path examples:
- Linux:
/home/username/auto-snap-captures/
- WSL2:
/home/username/auto-snap-captures/
- Windows (Docker):
C:\Users\username\auto-snap-captures\
✨ What It Does & Commands to Try
📋 Document Capture
- Screenshot any window or the entire screen
- Capture multi-page documents automatically
- Works with PDFs, presentations, web pages
🔍 Image Processing
- Extract text from screenshots (OCR)
- Enhance image quality automatically
- Process multiple images at once
📄 PDF Creation
- Convert screenshots to PDF instantly
- Organize files with smart naming
- Compress PDFs for smaller size
🎯 Commands to Try
"Capture this document as PDF" → Takes screenshots and creates a PDF
"Extract text from these images" → Runs OCR on screenshots
"Archive this presentation" → Screenshots all slides into one PDF
"List all my open windows" → See what you can capture
"Take 5 screenshots and make them into a PDF" → Multi-page capture
💡 Pro Tips:
- Be specific: "Capture the Chrome window" works better than "take a screenshot"
- Multi-step: "Screenshot this presentation and extract all the text"
- Batch work: "Process all images in my Downloads and make PDFs"
Want to change where files are saved? See the 🎨 Customize Your Captures section below.
🛠️ System Support
- ✅ Linux (native X11)
- ✅ WSL2 (Windows apps from Linux)
- ⚠️ macOS (with XQuartz - experimental)
🚨 Not Working?
Common fixes:
Still stuck?
- Make sure the config path is correct:
~/.claude/claude_desktop_config.json
- Restart Claude Desktop after config changes
- Check Claude Desktop logs for errors
🎨 Customize Your Captures
Set where files go:
Organize by date:
Custom file names:
🌍 Multiple Languages
Add more OCR languages:
Then tell Claude: "Process this document in Spanish"
🔐 Privacy
- Everything runs locally on your machine
- No cloud services or uploads
- Only captures what you ask for
- Automatic cleanup of temp files
Ready to automate your screenshots? Install Auto-Snap and start talking to your MCP Clients about your documents! 🚀
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A server that automates screenshot capture and document processing, allowing users to convert screenshots into PDFs and extract text through natural language commands.
Related MCP Servers
- AsecurityAlicenseAqualityProvides screenshot and OCR capabilities for macOS.Last updated -169220JavaScriptMIT License
Textin MCP Serverofficial
AsecurityAlicenseAqualityA server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.Last updated -31918JavaScriptMIT License- -securityFlicense-qualityA PDF processing server that extracts text via normal parsing or OCR, and retrieves images from PDF files through the MCP protocol with a built-in web debugger.Last updated -26Python
- AsecurityAlicenseAqualityA server that allows AI agents and clients to programmatically capture screenshots of websites with options for image format and viewport selection.Last updated -126TypeScriptMIT License