Infrastructure Demo

Resilience in action

Watch Presence's multi-provider LLM gateway handle real failures in real time. No reloads. No downtime. Just seamless failover.

Live System

Multi-provider LLM Gateway

Live traffic flow visualization

Streaming

Click a provider node to kill it

Inject faults, watch recovery

Active

Failed

groq

Primary

Calls real backend endpoints. TrueFoundry routes automatically when a provider fails.

1
Groq (primary)
Fastest — llama3-70b at ~120ms. Handles all traffic while healthy.
2
Gemini (fallback)
Auto-promoted if Groq errors or latency spikes. gemini-flash-latest.
3
OpenAI (tertiary)
Last resort. gpt-4o-mini. Ensures availability even if two providers fail.
4
Zero user downtime
The companion chat continues responding — no error state shown to user.

Provider Status

PRIMARY

⚡

Groq

Active

Latency

120ms

Requests

served

Uptime99.9%

llama3-70b

✦

Gemini

Standby

Latency

340ms

Requests

served

Uptime98.7%

gemini-flash-latest

◎

OpenAI

Standby

Latency

580ms

Requests

served

Uptime99.5%

gpt-4o-mini