analyze_screenshot
Analyzes webpage screenshots using AI (Gemma3) to describe visible content, identify elements, and provide structural insights. Capture full-page or specific sections for detailed examination.
Instructions
Take a screenshot and analyze it with AI (Gemma3) to describe what is visible on the page
Input Schema
Name | Required | Description | Default |
---|---|---|---|
detailed | No | Provide detailed structural analysis of the page | |
fullPage | No | Capture full scrollable page | |
model | No | AI model to use for analysis (default: gemma3:4b) | gemma3:4b |
path | No | Path to save screenshot (optional) | |
pretext | No | Optional context or specific instructions for what to look for in the analysis |
Input Schema (JSON Schema)
{
"properties": {
"detailed": {
"default": false,
"description": "Provide detailed structural analysis of the page",
"type": "boolean"
},
"fullPage": {
"default": false,
"description": "Capture full scrollable page",
"type": "boolean"
},
"model": {
"default": "gemma3:4b",
"description": "AI model to use for analysis (default: gemma3:4b)",
"type": "string"
},
"path": {
"description": "Path to save screenshot (optional)",
"type": "string"
},
"pretext": {
"description": "Optional context or specific instructions for what to look for in the analysis",
"type": "string"
}
},
"type": "object"
}