get_inference_usage
Retrieve usage statistics for serverless inference subscriptions, including token counts for chat models, character usage for text-to-speech, monthly allotment details, and overage information.
Instructions
Get usage statistics for a serverless inference subscription.
Args: subscription_id: The inference subscription ID or label
Returns: Detailed usage information including: - chat: Token usage for chat/completion models - audio: Character usage for text-to-speech models - monthly_allotment: Total tokens/characters allocated - overage: Usage exceeding the monthly limit
Input Schema
Name | Required | Description | Default |
---|---|---|---|
subscription_id | Yes |