Enables browser automation with Databricks language models for web-based tasks via API.
Allows loading environment variables from .env files for configuring API keys and other sensitive information.
Enables interaction with GitHub through browser automation, including searching for repositories and accessing project information.
Provides integration with Google's language models for web browsing and automation tasks.
Supports using Hugging Face models for browser automation and web interaction tasks.
Integrates with LangChain components for creating browser automation agents that can execute web-based tasks.
Enables using NVIDIA's language models for browser automation through the MCP server.
Allows using Ollama's local language models for browser automation and web interaction.
Integrates with OpenAI's language models to enable browser automation and web interaction through natural language commands.
Provides Python client support for utilizing the browser automation capabilities of the MCP server.
Browser Use MCP Server
A FastMCP server that enables browser automation through natural language commands. This server allows Language Models to browse the web, fill out forms, click buttons, and perform other web-based tasks via a simple API.
Quick Start
1. Install the package
Install with a specific provider (e.g., OpenAI)
Or install all providers
Install Playwright browsers
2. Configure your MCP client
Add the browser-use-mcp server to your MCP client configuration:
Replace "your-openai-api-key"
with your actual API key or use an environment variable reference like process.env.OPENAI_API_KEY
.
3. Use it with your favorite MCP client
Example using mcp-use with Python
Using Claude for Desktop
- Open Claude for Desktop
- Go to Settings → Experimental features
- Enable Claude API Beta and OpenAPI schema for API
- Add the following configuration to your Claude Desktop config file:
- Mac:
~/Library/Application Support/Claude/claude_desktop_config.json
- Windows:
%AppData%\Claude\claude_desktop_config.json
- Mac:
- Start a new conversation with Claude and ask it to perform web tasks
Supported LLM Providers
The following LLM providers are supported for browser automation:
Provider | API Key Environment Variable |
---|---|
OpenAI | OPENAI_API_KEY |
Anthropic | ANTHROPIC_API_KEY |
GOOGLE_API_KEY | |
Cohere | COHERE_API_KEY |
Mistral AI | MISTRAL_API_KEY |
Groq | GROQ_API_KEY |
Together AI | TOGETHER_API_KEY |
AWS Bedrock | AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY |
Fireworks | FIREWORKS_API_KEY |
Azure OpenAI | AZURE_OPENAI_API_KEY and AZURE_OPENAI_ENDPOINT |
Vertex AI | GOOGLE_APPLICATION_CREDENTIALS |
NVIDIA | NVIDIA_API_KEY |
AI21 | AI21_API_KEY |
Databricks | DATABRICKS_HOST and DATABRICKS_TOKEN |
IBM watsonx.ai | WATSONX_API_KEY |
xAI | XAI_API_KEY |
Upstage | UPSTAGE_API_KEY |
Hugging Face | HUGGINGFACE_API_KEY |
Ollama | OLLAMA_BASE_URL |
Llama.cpp | LLAMA_CPP_SERVER_URL |
For more information check out: https://python.langchain.com/docs/integrations/chat/
You can create a .env
file in the project directory with your API keys:
Troubleshooting
- API Key Issues: Ensure your API key is correctly set in your environment variables or
.env
file. - Provider Not Found: Make sure you've installed the required provider package.
- Browser Automation Errors: Check that Playwright is correctly installed with
playwright install chromium
. - Model Selection: If you get errors about an invalid model, try using the
--model
flag to specify a valid model for your provider. - Debug Mode: Use
--debug
to enable more detailed logging that can help identify issues. - MCP Client Configuration: Make sure your MCP client is correctly configured with the right command and environment variables.
License
MIT # browser-use-mcp
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A FastMCP server that enables browser automation through natural language commands, allowing Language Models to browse the web, fill out forms, click buttons, and perform other web-based tasks via a simple API.
Related MCP Servers
- AsecurityAlicenseAqualityA MCP server that provides browser automation tools, allowing users to navigate websites, take screenshots, click elements, fill forms, and execute JavaScript through Playwright.Last updated -8PythonApache 2.0
- -securityAlicense-qualityAn MCP server that enables AI assistants to control a web browser through natural language commands, allowing them to navigate websites and extract information via SSE transport.Last updated -663PythonMIT License
- -securityFlicense-qualityA MCP server that allows AI assistants to interact with the browser, including getting page content as markdown, modifying page styles, and searching browser history.Last updated -79TypeScript
- AsecurityAlicenseAqualityA comprehensive MCP server that provides powerful web automation tools using Playwright, enabling web scraping, testing, and browser interaction through natural language commands.Last updated -206TypeScriptMIT License