Provides OpenAI-compatible text generation models for simple and advanced text generation with system prompts through the Pollinations API
🚀 MCPO - MCP Pollinations Proxy
A Docker-containerized MCP (Model Context Protocol) proxy that combines mcpo CLI tool with Pollinations MCP server, providing AI image, text, audio, and vision generation capabilities through standard REST endpoints.
🌟 Features
🎨 Multimodal AI Capabilities
Image Generation: Create stunning images from text prompts with 1024x1024 default resolution
Image-to-Image: Transform existing images using text descriptions
Vision Analysis: Analyze, describe, compare images and extract text (OCR)
Text Generation: Simple and advanced text generation with system prompts
Text-to-Speech: Convert text to speech with multiple voice options
Audio Generation: Create contextual audio responses
🔧 Technical Features
OpenAPI REST Endpoints: Standard HTTP/REST interface for all MCP capabilities
Docker Containerized: Easy deployment and consistent environment
Real-time Processing: Direct API integration with Pollinations services
Multiple Model Support: Access various AI models for different tasks
🚀 Quick Start
Prerequisites
Docker and Docker Compose
Port 7777 available
Installation & Usage
Clone the repository
Build and run the container
Access the service
Service runs on:
http://localhost:7777
OpenAPI docs:
http://localhost:7777/docs
API endpoints:
http://localhost:7777/api/...
Development Commands
🎯 API Endpoints
The service exposes Pollinations MCP server functionality through REST endpoints:
🖼️ Image Generation
POST /api/generateImage
- Generate image from text promptPOST /api/generateImageUrl
- Get image generation URLPOST /api/generateImageToImage
- Transform image with text promptGET /api/listImageModels
- List available image models
📝 Text Generation
POST /api/generateText
- Simple text generationPOST /api/generateAdvancedText
- Advanced text with system promptsGET /api/listTextModels
- List available text models
👁️ Vision & Analysis
POST /api/analyzeImageFromUrl
- Analyze image from URLPOST /api/analyzeImageFromData
- Analyze base64 image dataPOST /api/compareImages
- Compare two imagesPOST /api/extractTextFromImage
- OCR text extraction
🎵 Audio Generation
POST /api/sayText
- Text-to-speech conversionPOST /api/respondAudio
- Generate contextual audio responsesGET /api/listAudioVoices
- List available voices
🏗️ Architecture
Container Stack
Base: Node.js 18 Alpine Linux
Python: Installed for mcpo CLI tool
Port: 7777 exposed for HTTP access
Host: Configured to bind to 0.0.0.0
Service Flow
Container starts with
mcpo
CLI toolmcpo
proxies thepollinations-model-context-protocol
MCP serverMCP server capabilities become available via OpenAPI endpoints
External applications use standard HTTP/REST calls
📁 Project Structure
🔧 Configuration
Default Settings
Image Resolution: 1024x1024 pixels
Image Quality: Private=true, NoLogo=true, Enhance=true
Text Generation: OpenAI-compatible models
Audio Format: MP3 with Alloy voice
Vision Models: GPT-4o for image analysis
Environment Variables
The container automatically configures the MCP proxy without additional environment variables needed.
🎨 Usage Examples
Image Generation
Vision Analysis
Text-to-Speech
🤝 Contributing
Fork the repository
Create a feature branch (
git checkout -b feature/amazing-feature
)Commit your changes (
git commit -m 'Add amazing feature'
)Push to the branch (
git push origin feature/amazing-feature
)Open a Pull Request
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🙏 Acknowledgments
Pollinations.AI for the amazing AI APIs
Model Context Protocol for the MCP standard
mcpo CLI tool for MCP to OpenAPI conversion
🔗 Links
Built with ❤️ using Docker, Node.js, and Python
This server cannot be installed
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
A Docker-containerized MCP proxy that provides AI image generation, text generation, vision analysis, and text-to-speech capabilities through REST endpoints using Pollinations AI services. Enables multimodal AI interactions including image creation, transformation, OCR, and audio generation through standard HTTP APIs.