This MCP server enables comprehensive management of RunPod cloud infrastructure through the Model Context Protocol, allowing AI assistants like Claude to programmatically interact with the RunPod REST API.
Capabilities:
Pod Management: Create, list, retrieve details, update, start, stop, and delete pods with configurable options including GPU type/count, container images, Docker settings, environment variables, ports, storage volumes, data centers, and container disk sizes. Filter pods by name, GPU type, compute type, and data center.
Serverless Endpoint Management: Create, list, retrieve details, update, and delete serverless endpoints with auto-scaling configurations (min/max workers, scaler type, idle timeout), GPU configuration, data center selection, and template-based deployment.
Template Management: Create, list, retrieve details, update, and delete templates for reusable container configurations with Docker settings, environment variables, volumes, ports, and serverless options.
Network Volume Management: Create, list, retrieve details, update (name and size), and delete network volumes (1-4000 GB) for persistent storage across data centers.
Container Registry Authentication: Create, list, retrieve details, and delete container registry authentication credentials (username and password) for accessing private Docker images.
Required runtime environment for the MCP server implementation
Enables management of NVIDIA GPU-powered computing resources through RunPod's platform
Supports deployment of PyTorch environments through RunPod's container infrastructure
Facilitates creation and management of Ubuntu-based development environments on RunPod
Runpod MCP Server
This Model Context Protocol (MCP) server enables you to interact with the Runpod REST API through Claude or other MCP-compatible clients.
Features
The server provides tools for managing:
Pods: Create, list, get details, update, start, stop, and delete pods
Endpoints: Create, list, get details, update, and delete serverless endpoints
Templates: Create, list, get details, update, and delete templates
Network Volumes: Create, list, get details, update, and delete network volumes
Container Registry Authentications: Create, list, get details, and delete authentications
Related MCP server: Webflow MCP Server
Quick Start
Prerequisites
Node.js 18 or higher
A Runpod account and API key (get your API key)
Running with npx
You can run the server directly without installation:
Installing via Smithery
To install for Claude Desktop automatically via Smithery:
Setting up with Claude for Desktop
Open Claude for Desktop
Edit the config file:
macOS:
~/Library/Application Support/Claude/claude_desktop_config.jsonWindows:
%APPDATA%\Claude\claude_desktop_config.json
Add the server configuration:
Restart Claude for Desktop
Usage Examples
List all pods
Create a new pod
Create a serverless endpoint
Security Considerations
This server requires your Runpod API key, which grants full access to your Runpod account. For security:
Never share your API key
Be cautious about what operations you perform
Consider setting up a separate API key with limited permissions
Don't use this in a production environment without proper security measures
License
Apache-2.0