Files-DB-MCP: Vector Search for Code Projects

A local vector database system that provides LLM coding agents with fast, efficient search capabilities for software projects via the Message Control Protocol (MCP).

Features

Zero Configuration - Auto-detects project structure with sensible defaults
Real-Time Monitoring - Continuously watches for file changes
Vector Search - Semantic search for finding relevant code
MCP Interface - Compatible with Claude Code and other LLM tools
Open Source Models - Uses Hugging Face models for code embeddings

Related MCP server: MCPunk

Installation

Option 1: Clone and Setup (Recommended)

# Using SSH (recommended if you have SSH keys set up with GitHub) git clone git@github.com:randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh # Using HTTPS (if you don't have SSH keys set up) git clone https://github.com/randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh

Option 2: Automated Installation Script

curl -fsSL https://raw.githubusercontent.com/randomm/files-db-mcp/main/install/install.sh | bash

Usage

After installation, run in any project directory:

files-db-mcp

The service will:

Detect your project files
Start indexing in the background
Begin responding to MCP search queries immediately

Requirements

Docker
Docker Compose

Configuration

Files-DB-MCP works without configuration, but you can customize it with environment variables:

EMBEDDING_MODEL - Change the embedding model (default: 'jinaai/jina-embeddings-v2-base-code' or project-specific model)
FAST_STARTUP - Set to 'true' to use a smaller model for faster startup (default: 'false')
QUANTIZATION - Enable/disable quantization (default: 'true')
BINARY_EMBEDDINGS - Enable/disable binary embeddings (default: 'false')
IGNORE_PATTERNS - Comma-separated list of files/dirs to ignore

First-Time Startup

On first run, Files-DB-MCP will download embedding models which may take several minutes depending on:

The size of the selected model (300-500MB for high-quality models)
Your internet connection speed

Subsequent startups will be much faster as models are cached in a persistent Docker volume. For faster initial startup, you can:

# Use a smaller, faster model (90MB) EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L6-v2 files-db-mcp # Or enable fast startup mode FAST_STARTUP=true files-db-mcp