Skip to main content
Glama
by 8b-is
AI_OPTIMIZATION.md5.07 kB
# AI Optimization: Why Smart Tree Uses Hex, Compression, and Special Formats ## The Token Economy Problem 🪙 When AI assistants analyze directory structures, every character counts. Traditional tree outputs are human-friendly but wasteful for AI consumption. Smart Tree solves this with revolutionary optimization techniques. ## 🔢 Hex Format: Maximum Information Density ### Traditional Output (Wasteful) ``` 📁 my-project (2,358,272 bytes, modified 2024-12-19 15:30:00) ├── 📄 README.md (2,400 bytes, modified 2024-12-19 14:00:00) └── 📁 src (15 files, 1,024,000 bytes) └── 📄 main.rs (5,600 bytes, modified 2024-12-19 10:00:00) ``` **Character count: 201** ### Hex Format (Optimized) ``` 0 1fd 03e8 03e8 00240000 6853f4c0 d my-project 1 1b4 03e8 03e8 00000960 6853e980 f README.md 1 1fd 03e8 03e8 000fa000 6853f4c0 d src 2 1b4 03e8 03e8 000015e0 6853d480 f main.rs ``` **Character count: 134** (33% reduction!) ### Why Hex? - **Fixed-width fields**: Easy parsing, no delimiters needed - **Compact numbers**: Permissions (1fd), sizes (00240000), timestamps (6853f4c0) - **No fluff**: No icons, commas, or human formatting - **Predictable**: AI can learn the pattern instantly ## 🗜️ Compression: 10x Reduction ### Before Compression ``` Full directory tree: 500KB of text Tokens used: ~125,000 ``` ### After Compression (zlib) ``` Compressed output: 50KB Tokens used: ~12,500 (90% reduction!) ``` ### Smart Compression Features - Automatic with `AI_TOOLS=1` environment variable - Works with all output formats - Transparent to the AI (automatic decompression) - Preserves structure perfectly ## 🏷️ AI Tags: Context Without Confusion ### Special Markers ``` TREE_HEX_V1: # Version identifier CONTEXT: Rust: my-project # Project type detection HASH: 9b3b00cbcc1e8503 # Consistency verification END_AI # Clear boundaries ``` ### Why Tags Matter - **Version control**: AI knows which format to expect - **Context clues**: Project type helps AI understand structure - **Consistency**: Hash verifies nothing was truncated - **Clear boundaries**: No ambiguity about where data ends ## 📊 Real-World Impact ### Analyzing a Large Codebase #### Traditional Tree Output - Size: 2.5MB - Tokens: ~625,000 - Cost: $3.13 (GPT-4) - Time: 45 seconds #### Smart Tree AI Mode - Size: 250KB (compressed) - Tokens: ~62,500 - Cost: $0.31 (90% savings!) - Time: 5 seconds ## 🎯 Digest Mode: The Ultimate Optimization For quick directory fingerprinting: ``` HASH: 9b3b00cbcc1e8503 F:45 D:12 S:23fc00 TYPES: rs:35 toml:3 md:2 ``` One line tells you: - Directory fingerprint (HASH) - File count (F:45) - Directory count (D:12) - Total size in hex (S:23fc00) - File type distribution **Perfect for**: - Cache validation - Change detection - Quick summaries - Pre-analysis checks ## 🚀 Implementation in Your DXT ### Enable AI Optimizations ```javascript // In your manifest.json "env": { "AI_TOOLS": "1", // Auto-enable AI mode "DEFAULT_COMPRESSION": "1" // Always compress } // In your tool if (process.env.AI_TOOLS) { output = optimizeForAI(output); output = compress(output); } ``` ### Best Practices 1. **Use fixed-width formats** when possible 2. **Prefer hex** for numbers (more compact) 3. **Add version tags** to your output 4. **Include hashes** for consistency 5. **Compress by default** for AI consumers ## 📈 Benchmarks | Format | Size | Tokens | Relative Cost | |--------|------|--------|---------------| | Classic Tree | 1.2MB | 300K | 100% | | JSON | 2.1MB | 525K | 175% | | **Hex Mode** | 800KB | 200K | 67% | | **AI Mode + Compression** | 120KB | 30K | 10% | | **Digest** | 128B | 32 | 0.01% | ## 🎓 Lessons for DXT Developers 1. **Every character is money**: Optimize ruthlessly 2. **Structure > Readability**: AI doesn't need pretty output 3. **Compression is free performance**: Use it! 4. **Fixed formats parse faster**: Predictability helps AI 5. **Metadata matters**: Include type hints and context ## Example: Implementing AI Mode ```javascript // Bad: Human-friendly but token-heavy function formatSize(bytes) { return `${(bytes / 1024 / 1024).toFixed(2)} MB`; // "12.34 MB" } // Good: Compact and parseable function formatSizeHex(bytes) { return bytes.toString(16).padStart(8, '0'); // "00bc5a40" } // Best: With context marker function formatSizeAI(bytes) { return `S:${bytes.toString(16)}`; // "S:bc5a40" } ``` ## 🌟 The Smart Tree Advantage Smart Tree pioneered these optimizations: - **First** to use hex format for trees - **First** to integrate compression transparently - **First** to add AI-specific markers - **First** to provide digest mode When building your DXT, consider: - How can you reduce token usage? - What information is truly necessary? - Can you provide multiple formats? - Would compression help your users? --- *"In the economy of tokens, every byte saved is a dollar earned."* *- Trisha from Accounting (who really appreciates the cost savings!) 💰✨*

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/8b-is/smart-tree'

If you have feedback or need assistance with the MCP directory API, please join our Discord server