Checking Usage

View your current token usage in the Geoff Dashboard.

Rate Limits

Rate limits are applied per API key and vary by plan:
PlanRequests/minRequests/dayConcurrent
Free105002
Pro10010,00010
EnterpriseCustomCustomCustom

Rate Limit Headers

Every API response includes rate limit information:
X-RateLimit-Limit: 100
X-RateLimit-Remaining: 95
X-RateLimit-Reset: 1620000000

Handling Rate Limits

When you exceed your rate limit, the API returns a 429 Too Many Requests response. Implement exponential backoff in your application:
import time
import requests

def make_request_with_retry(url, headers, data, max_retries=3):
    for attempt in range(max_retries):
        response = requests.post(url, headers=headers, json=data)
        if response.status_code == 429:
            wait_time = 2 ** attempt
            time.sleep(wait_time)
            continue
        return response
    raise Exception("Max retries exceeded")