Skip to main content
Glama

Open-Source MCP servers

Production-ready MCP servers that extend AI capabilities through file access, database connections, API integrations, and other contextual services.

9,472 servers. Last updated -

Matching MCP tools:

Matching MCP servers:

  • A
    security
    A
    license
    A
    quality
    This is a server implementation for performing Optical Character Recognition (OCR) using the Google Cloud Vision API. It is built on top of the FastMCP framework, which allows for the creation of modular and extensible command processing tools.
    Last updated -
    1
    1
    MIT License
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.
    Last updated -
    9
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
    Last updated -
    31
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.
    Last updated -
    3
    27
    MIT License
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    A lightweight server that provides detailed text analysis, counting total characters, characters without spaces, letters, numbers, and symbols for AI assistants like Claude Desktop and GitHub Copilot.
    Last updated -
    1
    2
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides tools for image, audio, and video recognition using Google's Gemini AI through the Model Context Protocol.
    Last updated -
    3
    11
    MIT License
    • Linux
    • Apple

Interested in MCP?

Join the MCP community for support and updates.

RedditDiscord
  • -
    security
    A
    license
    -
    quality
    Enables counting characters or bytes in text with options to include or exclude whitespace. Provides a simple tool for text analysis and length measurement.
    Last updated -
    1
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides voice recognition and text extraction capabilities with support for both stdio and MCP modes, processing audio files or base64 encoded data and returning structured results with language, emotion, and speaker information.
    Last updated -
    MIT License
  • -
    security
    F
    license
    -
    quality
    An MCP server that implements iterative refinement of responses through self-critique cycles, breaking the process into discrete steps to avoid timeouts and show progress.
    Last updated -
    9
  • -
    security
    F
    license
    -
    quality
    Enables AI assistants to analyze images from URLs or local files using xAI's Grok API. Provides detailed image descriptions, technical metadata extraction, and optical character recognition (OCR) capabilities.
    Last updated -
    2
  • -
    security
    F
    license
    -
    quality
    Enables Claude to read and analyze PDF documents with automatic OCR processing for scanned files. Features intelligent text extraction, caching for performance, and secure file access with search capabilities.
    Last updated -
    • Apple
  • -
    security
    F
    license
    -
    quality
    The fileAI MCP Server offers a robust set of tools to work with the fileAI file processing pipeline. It allows for uploading files, performing Optical Character Recognition (OCR), classifying documents, and extracting structured data. The server leverages the Model Context Protocol (MCP) to provide
    Last updated -
    2
  • -
    security
    A
    license
    -
    quality
    Extracts content from multiple video platforms (Douyin, Bilibili, Xiaohongshu, Zhihu) and generates intelligent knowledge graphs with OCR text recognition capabilities.
    Last updated -
    1
    MIT License
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables semantic search and contextual conversations with your Calibre ebook library using vector-based RAG technology. Supports project-based organization, multi-format book processing, and OCR capabilities for enhanced content extraction and retrieval.
    Last updated -
    7
  • -
    security
    F
    license
    -
    quality
    A Python MCP server for invoice and receipt processing that uses OCR technology to extract data from PDFs and images, offering AI assistants the ability to process, extract text from, and merge invoice documents.
    Last updated -
    2
    • Apple
  • -
    security
    A
    license
    -
    quality
    Multiple text-type recognition, handwriting recognition, and high-precision parsing of complex documents.
    Last updated -
    54,965
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A multi-agent human-computer interaction system that enables natural interaction through integrated visual recognition, speech recognition, and speech synthesis capabilities.
    Last updated -
    19
    Apache 2.0
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    Last updated -
    1
    56
    22
    MIT License
    • Apple
  • -
    security
    F
    license
    -
    quality
    A powerful speech-to-text MCP server that supports multiple audio formats and recognition engines including remote APIs (Bailian, OpenAI Whisper, iFLYTEK), Google Speech Recognition, and CMU Sphinx.
    Last updated -
  • -
    security
    F
    license
    -
    quality
    Provides comprehensive text analysis capabilities including character counting, word statistics, character type analysis, and text length validation for Korean and English text. Supports AI agents in analyzing and validating text content with detailed statistics and Unicode support.
    Last updated -
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Archive Agent is an open-source semantic file tracker with OCR + AI search (RAG) and MCP capability.
    Last updated -
    50
    GPL 3.0
    • Linux
  • -
    security
    A
    license
    -
    quality
    An MCP server that implements a conversational AI 'waifu' character using a text generation service with Redis queuing and GPU acceleration.
    Last updated -
    1
    MIT No Attribution
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that manages character knowledge and relationships for creative writing projects, offering semantic search and AI-powered analysis.
    Last updated -
    MIT License
  • A
    security
    A
    license
    A
    quality
    A server that generates MP3 audio files from text using Kokoro TTS technology with optional S3 upload capabilities.
    Last updated -
    1
    56
    Apache 2.0
    • Apple
  • -
    security
    F
    license
    -
    quality
    Provides basic text manipulation and analysis tools including word reversal and character counting. Designed for integration with Le Chat and other MCP-compatible clients.
    Last updated -
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables natural language queries to MySQL databases, powered by XiYanSQL text-to-SQL technology.
    Last updated -
    207
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A MCP Server that gives AI assistants the ability to remember information about users across conversations using vector search technology.
    Last updated -
  • -
    security
    A
    license
    -
    quality
    Enables creation and management of structured game worlds for text adventures and RPGs with character creation, world generation, and natural language interaction through AI integration.
    Last updated -
    MIT License
  • A
    security
    A
    license
    A
    quality
    A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.
    Last updated -
    3
    55
    23
    MIT License
  • -
    security
    A
    license
    -
    quality
    MCP server that provides computer control capabilities including mouse movements, keyboard actions, screenshot capture with OCR, and window management through a unified API.
    Last updated -
    37
    MIT License
  • -
    security
    A
    license
    -
    quality
    A high-performance server for analyzing GitHub user starred repositories, providing insights into development trends, technology adoption patterns, and timeline tracking.
    Last updated -
    2
    Apache 2.0