Total Requests Today

12,847
+23% from yesterday

Average Response Time

245ms
-12% improvement

Cost Savings

$2,341
+18% this month

Success Rate

99.7%
+0.3% uptime

Connected AI Providers

OpenAI

GPT-4, GPT-3.5, DALL-E, Whisper

Online • 3.2k requests/hr

Anthropic

Claude-3, Claude-2, Claude Instant

Online • 1.8k requests/hr

Google AI

Gemini Pro, PaLM, Bard API

Rate Limited • 892 requests/hr

Azure OpenAI

GPT-4, GPT-3.5 (Enterprise)

Online • 2.1k requests/hr

Quick Actions

Add New Provider Security Settings View Analytics

Recent Requests

POST /v1/chat/completions
200
245ms
POST /v1/embeddings
200
123ms
POST /v1/chat/completions
429
12ms
GET /v1/models
200
89ms
POST /v1/images/generations
200
3.2s

Performance Analytics Dashboard

Request Latency

ā“˜
245ms
P99: 380ms P90: 290ms P50: 180ms

Time to First Token (TTFS)

ā“˜
156ms
P99: 280ms P90: 210ms P50: 120ms

Inter-Token Latency (ITL)

ā“˜
45ms
P99: 89ms P90: 67ms P50: 32ms

Error Rate

ā“˜
0.3%
4xx: 0.2% 5xx: 0.1% Timeout: 0.0%

Monthly Cost Analysis & Savings

This Month: $2,341 ↓ -32% vs last month
šŸ’°
Total Savings
$1,127
+47% vs last month
⚔
Cost per Request
$0.0182
-28% vs last month
šŸ“ˆ
Optimization Rate
94.2%
+12% vs last month

Month-over-Month Spend Comparison

Last 6 Months
Monthly Spend ($)
Months (Gateway Implementation →)

Gateway Configuration Impact

🚦

Rate Limiting

127 triggers
2.3% of requests throttled
āš–ļø

Load Balancing

1,847 switches
Optimal distribution achieved
šŸ”„

Fallback Usage

23 fallbacks
0.4% fallback rate
šŸ›”ļø

Guardrails

8 blocks
Content policy enforced
šŸ’°

Budget Controls

3 alerts
87% of monthly budget used
šŸ‘„

Role-Based Access

247 sessions
12 active roles configured

Real-time Request Flow

~47 req/sec
Requests/sec
Time (Last 60 seconds)