parse
Extract webpage content into clean, LLM-optimized Markdown by removing ads, navigation, and non-essential elements. Retrieve article title, main content, excerpt, byline, and site name using Mozilla's Readability algorithm.
Instructions
Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
url | Yes | The website URL to parse |
Input Schema (JSON Schema)
{
"properties": {
"url": {
"description": "The website URL to parse",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}