This server provides AI-powered image generation and editing using Google's Gemini models with intelligent model selection between Gemini 2.5 Flash (fast, up to 1024px) and Gemini 3 Pro (high-quality, up to 4K).
Core Capabilities:
Generate new images from text prompts with automatic or manual model selection
Edit existing images using natural language instructions via file paths or Files API IDs
Multi-image conditioning - provide up to 3 input images to guide generation
Batch generation - create multiple images at once (up to 4 per request)
Aspect ratio control - specify output dimensions (1:1, 16:9, 9:16, 21:9, and more)
Negative prompts and custom system instructions for precise control
Advanced Pro Features (Gemini 3 Pro):
4K resolution output for professional-grade images
Google Search grounding for factually accurate images
Configurable thinking levels (LOW/HIGH) for complex compositions
Superior text rendering and media resolution control
File Management:
Upload files to Gemini Files API for large images (>20MB) or reusable assets
Track file metadata including URIs, expiration times, and display names
View output statistics and directory information
Maintenance & Cleanup:
Remove expired Files API entries and clean old local files
Check storage quota against the ~20GB Files API budget
Database hygiene and full cleanup operations with dry-run support
Output: Returns MCP image content blocks for direct client display with structured JSON metadata, local file storage, and smart templates for photography, design, and editing workflows.
Provides AI-powered image generation capabilities through Google's Gemini 2.5 Flash Image model, including file management via the Gemini Files API and smart prompt templates for photography, design, and editing
Nano Banana MCP Server 🍌
A production-ready Model Context Protocol (MCP) server that provides AI-powered image generation capabilities through Google's Gemini models with intelligent model selection.
⭐ NEW: Gemini 3 Pro Image Support! 🚀
Now featuring Nano Banana Pro - Google's latest and most powerful image generation model:
🏆 Professional 4K Quality: Generate stunning images up to 3840px resolution
🌐 Google Search Grounding: Access real-world knowledge for factually accurate images
🧠 Advanced Reasoning: Configurable thinking levels for complex compositions
🎯 Superior Text Rendering: Crystal-clear text in images at high resolution
🎨 Enhanced Understanding: Better context comprehension for complex prompts
Related MCP server: MCP Gemini API Server
✨ Features
🎨 Multi-Model AI Image Generation: Intelligent selection between Flash (speed) and Pro (quality) models
⚡ Gemini 2.5 Flash Image: Fast generation (1024px) for rapid prototyping
🏆 Gemini 3 Pro Image: High-quality up to 4K with Google Search grounding
🤖 Smart Model Selection: Automatically chooses optimal model based on your prompt
📐 Aspect Ratio Control ⭐ NEW: Specify output dimensions (1:1, 16:9, 9:16, 21:9, and more)
📋 Smart Templates: Pre-built prompt templates for photography, design, and editing
📁 File Management: Upload and manage files via Gemini Files API
🔍 Resource Discovery: Browse templates and file metadata through MCP resources
🛡️ Production Ready: Comprehensive error handling, logging, and validation
⚡ High Performance: Optimized architecture with intelligent caching
🚀 Quick Start
Prerequisites
Google Gemini API Key - Get one free here
Python 3.11+ (for development only)
Installation
Option 1: From MCP Registry (Recommended) This server is available in the Model Context Protocol Registry. Search for "nanobanana" or use the MCP name below with your MCP client.
mcp-name: io.github.zhongweili/nanobanana-mcp-server
Option 2: Using uvx
Option 3: Using pip
🔧 Configuration
Claude Desktop
Add to your claude_desktop_config.json:
Configuration file locations:
macOS:
~/Library/Application Support/Claude/claude_desktop_config.jsonWindows:
%APPDATA%\Claude\claude_desktop_config.json
Claude Code (VS Code Extension)
Install and configure in VS Code:
Install the Claude Code extension
Open Command Palette (
Cmd/Ctrl + Shift + P)Run "Claude Code: Add MCP Server"
Configure:
{ "name": "nanobanana", "command": "uvx", "args": ["nanobanana-mcp-server@latest"], "env": { "GEMINI_API_KEY": "your-gemini-api-key-here" } }
Cursor
Add to Cursor's MCP configuration:
Continue.dev (VS Code/JetBrains)
Add to your config.json:
Open WebUI
Configure in Open WebUI settings:
Gemini CLI / Generic MCP Client
🤖 Model Selection
Nano Banana supports two Gemini models with intelligent automatic selection:
🏆 Pro Model - Nano Banana Pro (Gemini 3 Pro Image) ⭐ NEW!
Google's latest and most advanced image generation model
Quality: Professional-grade, production-ready
Resolution: Up to 4K (3840px) - highest available
Speed: ~5-8 seconds per image
Special Features:
🌐 Google Search Grounding: Leverages real-world knowledge for accurate, contextual images
🧠 Advanced Reasoning: Configurable thinking levels (LOW/HIGH) for complex compositions
📐 Media Resolution Control: Fine-tune vision processing detail (LOW/MEDIUM/HIGH/AUTO)
📝 Superior Text Rendering: Exceptional clarity for text-in-image generation
🎨 Enhanced Context Understanding: Better interpretation of complex, narrative prompts
Best for: Production assets, marketing materials, professional photography, high-fidelity outputs, images requiring text, factual accuracy
Cost: Higher per image (premium quality)
⚡ Flash Model (Gemini 2.5 Flash Image)
Fast, reliable model for rapid iteration
Speed: Very fast (2-3 seconds)
Resolution: Up to 1024px
Quality: High quality for everyday use
Best for: Rapid prototyping, iterations, high-volume generation, drafts, sketches
Cost: Lower per image
🤖 Automatic Selection (Recommended)
By default, the server uses AUTO mode which intelligently analyzes your prompt and requirements:
Pro Model Selected When:
Quality keywords detected: "4K", "professional", "production", "high-res", "HD"
High resolution requested:
resolution="4k"orresolution="high"Google Search grounding enabled:
enable_grounding=TrueHigh thinking level requested:
thinking_level="HIGH"Multi-image conditioning with multiple input images
Flash Model Selected When:
Speed keywords detected: "quick", "draft", "sketch", "rapid"
High-volume batch generation:
n > 2Standard or lower resolution requested
No special Pro features required
Usage Examples
📐 Aspect Ratio Control ⭐ NEW!
Control the output image dimensions with the aspect_ratio parameter:
Supported Aspect Ratios:
1:1- Square (Instagram, profile pictures)4:3- Classic photo format3:4- Portrait orientation16:9- Widescreen (YouTube thumbnails, presentations)9:16- Mobile portrait (phone wallpapers, stories)21:9- Ultra-wide cinematic2:3,3:2,4:5,5:4- Various photo formats
Note: Aspect ratio works with both Flash and Pro models. For best results with specific aspect ratios at high resolution, use the Pro model with resolution="4k".
⚙️ Environment Variables
Configuration options:
🐛 Troubleshooting
Common Issues
"GEMINI_API_KEY not set"
Add your API key to the MCP server configuration in your client
Get a free API key at Google AI Studio
"Server failed to start"
Ensure you're using the latest version:
uvx nanobanana-mcp-server@latestCheck that your client supports MCP (Claude Desktop 0.10.0+)
"Permission denied" errors
The server creates images in
~/nanobanana-imagesby defaultEnsure write permissions to your home directory
Development Setup
For local development:
📄 License
MIT License - see LICENSE for details.
🆘 Support
Issues: GitHub Issues
Discussions: GitHub Discussions