Usage Analytics

Track and analyze your Aize Platform usage with comprehensive metrics and insights.

Overview

The Usage & Analytics page provides detailed insights into:

API request volumes
Token consumption
Cost breakdown
Model usage patterns
Performance metrics
Team activity (for Admins/Owners)

Key Metrics

Total Requests

What it shows:

Lifetime count of all API calls
Includes successful and failed requests
Updates in real-time

Useful for:

Understanding overall usage volume
Capacity planning
Rate limit monitoring
Trend analysis

View breakdown by:

Time period (daily, weekly, monthly)
Model
API key
Team member (Admins/Owners)
Status code (success/error)

Total Tokens

What it shows:

Combined input + output tokens
Lifetime aggregation
Separate input/output breakdown available

Token types:

Input tokens: Text you send in prompts
Output tokens: Generated responses
Total tokens: Sum of both

Why it matters:

Output tokens typically cost 2-3x input tokens
Helps optimize prompt engineering
Indicates data volume processed

Conversion guide:

~1 token = 4 characters
~1 token = 0.75 words
~100 tokens = 75 words
~1000 tokens = ~750 words or 1-2 pages

Total Cost

What it shows:

Lifetime spending in USD
Sum of all usage deductions
Excludes deposits and refunds

Cost components:

Request Cost = (Base Model Cost × Tokens) × Markup + Fixed Fee

View breakdown:

Cost per model
Cost per API key
Cost per team member
Cost trends over time

Current Spend

What it shows:

Current billing period spending
Resets based on organization tier
Progress towards any budget limits set

Color coding:

Green: Under 75% of budget
Yellow: 75-90% of budget
Red: Over 90% of budget

Request Logs

Detailed view of every API call made to the gateway.

Log Record Information

Each request record includes:

Request Identification:

Request ID: Unique UUID for tracing
Timestamp: Exact date and time
API Key: Which key was used (label shown)
User: Member who made request (team keys only)

Model Information:

Model Alias: What you requested (e.g., "gpt-4")
Provider Model: Actual model used (e.g., "gpt-4-0613")
Endpoint: API endpoint called (/v1/chat/completions)

Token Usage:

Input Tokens: Tokens in your prompt
Output Tokens: Tokens in response
Total Tokens: Combined count

Cost Breakdown:

Provider Cost: What we pay the AI provider
Customer Cost: What you're charged
Markup: Our markup (transparent)
Fixed Fee: Per-request fee (if applicable)

Performance:

Latency: Time to complete request (milliseconds)
Status Code: HTTP response code (200, 401, 402, etc.)
Error Message: Details if request failed

Metadata (expandable):

Model parameters (temperature, max_tokens, etc.)
Full request/response (if logging enabled)
Provider-specific details

Filtering Logs

By Time Period:

Last hour
Last 24 hours
Last 7 days
Last 30 days
Custom date range

By Status:

All requests
Successful only (2xx)
Client errors (4xx)
Server errors (5xx)

By Model:

Filter to specific model
Compare across models
Identify most-used models

By API Key:

View logs for specific key
Useful for debugging
Track key usage patterns

By User (Admins/Owners only):

See team member activity
Identify high-usage users
Track individual costs

"My Logs Only" (Members):

Toggle to see only your requests
Hides team activity
Privacy-focused view

Pagination

Page size: 50 records per page
Load more: Click to see older records
Export: Download as CSV (coming soon)

Cost Analysis

Understanding Costs

Provider Cost vs Customer Cost:

Provider Cost: $0.10      (what we pay)
Markup (25%):   $0.025    (our margin)
Fixed Fee:      $0.01     (per-request fee)
─────────────────────────
Customer Cost:  $0.135    (what you pay)

Why the difference?:

Infrastructure costs
Support and maintenance
Payment processing fees
Service reliability guarantees
Additional features (caching, routing, fallbacks)

Transparency:

Both costs shown in request logs
Clear markup percentage
No hidden fees
Predictable pricing

Cost Optimization Tips

1. Choose the Right Model:

Simple tasks: Use GPT-3.5 Turbo (~10x cheaper than GPT-4)
Complex reasoning: Use GPT-4 or Claude
Code generation: Consider GPT-4 Turbo
General chat: GPT-3.5 Turbo sufficient

2. Optimize Prompts:

Be concise but clear
Remove unnecessary examples
Use system messages efficiently
Avoid redundant instructions

3. Control Output Length:

Set appropriate max_tokens
Don't request more than needed
Output tokens cost more than input

4. Use Streaming:

Same cost, better UX
Users see results faster
Can cancel if sufficient

5. Implement Caching:

Cache common responses
Reduce repeat requests
Significant savings for high-traffic apps

6. Set Budget Limits:

Per-key monthly limits
Team member budgets
Alert before overspending

Cost Per Request Analysis

Average cost:

Average = Total Cost / Total Requests

Track trends:

Is average increasing? (using more expensive models?)
Sudden spikes? (longer outputs or errors?)
Compare against budget expectations

Example averages by model:

GPT-3.5 Turbo: $0.001 - $0.005 per request
GPT-4: $0.01 - $0.05 per request
Claude 3 Opus: $0.015 - $0.075 per request

Performance Analytics

Latency Metrics

What we measure:

Time from request received to response complete
Includes:
- API processing time
- Model inference time
- Network latency
- Queue wait time (if any)

Typical latencies:

Fast models (GPT-3.5): 500-2000ms
Standard models (GPT-4): 2000-5000ms
Large outputs: Can be 10+ seconds

Monitoring:

Average latency per model
P50, P95, P99 percentiles (coming soon)
Identify slow requests
Track performance trends

Success Rate

HTTP Status Codes:

2xx Success: Request completed successfully
4xx Client Error: Problem with your request
5xx Server Error: Issue on our side or provider

Track success rate:

Success Rate = (2xx responses / Total responses) × 100%

Target: > 99% success rate

Common errors:

401 Unauthorized: Invalid API key
402 Payment Required: Insufficient balance
403 Forbidden: Model not allowed for key
429 Too Many Requests: Rate limit exceeded
500 Internal Server Error: Temporary issue
503 Service Unavailable: Provider down

Error Analysis

View errors:

Go to Request Logs
Filter by status code (4xx, 5xx)
Review error messages
Identify patterns

Common patterns:

Spikes in 429: Need rate limit increase
401 errors: Key was revoked or blocked
402 errors: Wallet needs refill
500 errors: Contact support if persistent

Team Analytics (Admin/Owner)

Per-Member Usage

View individual member activity:

Go to Request Logs
Filter by team member
See their metrics:
- Total requests
- Token usage
- Cost incurred
- Models used
- Error rates

Use cases:

Identify power users
Budget allocation
Training needs
Cost attribution

Per-Key Analytics

Track API key usage:

Go to Request Logs
Filter by API key
Analyze:
- Usage patterns
- Cost trends
- Error rates
- Model preferences

Use cases:

Identify problematic keys
Optimize key limits
Debug integration issues
Plan key rotation

Team Spending Trends

Monitor team costs:

Daily spending breakdown
Week-over-week trends
Month-over-month growth
Per-member cost attribution

Set team budgets:

Organization-wide limits
Per-key limits
Per-member quotas (via key limits)
Alert on threshold breaches

Export and Reporting

Data Export

Available formats:

CSV (coming soon)
JSON (coming soon)
API access (coming soon)

What you can export:

Request logs
Cost reports
Usage summaries
Token breakdowns

Use cases:

Financial reconciliation
Custom analytics
Business intelligence
Compliance reporting

Scheduled Reports

Coming soon:

Daily usage summaries
Weekly cost reports
Monthly spending analysis
Custom report schedules

Access Control

Who Can View Analytics?

Metric	Member	Admin	Owner
Own requests	✅	✅	✅
Own costs	✅	✅	✅
Team requests	❌	✅	✅
Team costs	❌	✅	✅
All API keys	❌	✅	✅
Cost breakdowns	❌	✅	✅
Export data	❌	✅	✅

Privacy

Member privacy:

Members see only own activity
Cannot view team spending
No access to other members' logs
Perfect for contractors/consultants

Team transparency:

Admins/Owners see all activity
Required for cost management
Enables proper budgeting
Supports accountability

Best Practices

Regular Monitoring

Daily:

Check wallet balance
Review any errors
Monitor spending trends

Weekly:

Analyze per-key usage
Review team activity
Check for anomalies

Monthly:

Full cost analysis
Model usage review
Budget vs actual comparison
Optimize based on trends

Setting Up Alerts

Wallet alerts:

Low balance threshold ($10 default)
Email when triggered
Adjust in Organization Settings

API key alerts:

Monthly limit warnings (75%, 90%, 100%)
Automatic blocking at limit
Email notifications

Usage alerts (coming soon):

Daily spending thresholds
Unusual activity detection
Error rate spikes

Cost Governance

For small teams:

Set reasonable monthly limits per key
Monitor weekly
Review monthly for trends

For larger teams:

Per-key limits strictly enforced
Per-member quotas
Approval process for high-limit keys
Monthly cost reviews with team leads

Troubleshooting

Missing Logs

Problem: Don't see recent requests in logs

Possible causes:

Filters applied (check filter settings)
"My Logs Only" toggled (if you're looking for team logs)
Page not refreshed (refresh to see latest)

Solution: Clear filters and refresh

Costs Don't Match Expectations

Problem: Costs higher than anticipated

Investigation steps:

Check Request Logs for high-cost requests
Identify which models being used (GPT-4 vs GPT-3.5)
Look at token counts (unexpectedly long outputs?)
Check for errors (retries can double costs)
Review API key usage (is a key being overused?)

Solutions:

Switch to cheaper models where appropriate
Reduce max_tokens parameter
Optimize prompts to be more concise
Fix error causes to prevent retries
Set stricter API key limits

High Error Rates

Problem: Many failed requests

Check:

Filter logs by error status (4xx, 5xx)
Read error messages
Identify patterns (same key? same model? same time?)

Common fixes:

401: Check API key is valid
402: Refill wallet
429: Reduce request rate or contact for limit increase
500/503: Wait and retry, or contact support if persistent

Usage Analytics

Usage Analytics

Overview

Key Metrics

Total Requests

Total Tokens

Total Cost

Current Spend

Request Logs

Log Record Information

Filtering Logs

Cost Analysis

Understanding Costs

Cost Optimization Tips

Cost Per Request Analysis

Performance Analytics

Latency Metrics

Success Rate

Error Analysis

Team Analytics (Admin/Owner)

Per-Member Usage

Per-Key Analytics

Team Spending Trends

Export and Reporting

Data Export

Scheduled Reports

Access Control

Who Can View Analytics?

Privacy

Best Practices

Regular Monitoring

Setting Up Alerts

Cost Governance

Troubleshooting

Missing Logs

Costs Don't Match Expectations

High Error Rates

Next Steps

On this page