Voice AI @ 18¢ < 1¢ a minute.
No magic. Just the engineering
most voice AI skipped.
Voice AI that gets cheaper every call. Drop-in for every major voice agent API — same code, one tenth the bill.
# inside claude code /plugin marketplace add glimpass/voice /plugin install voice@glimpass
// ~/.cursor/mcp.json { "mcpServers": { "glimpass": { "command": "npx", "args": ["-y", "@glimpass/mcp"] } } }
# ~/.codex/config.toml [mcp_servers.glimpass] command = "npx" args = ["-y", "@glimpass/mcp"]
Most voice AI charges you the same on call 1 and call 100,000.
We charge less every time the same conversation happens.
Built like a database, not a model.
I built this after watching my own voice-AI bill go to inference returning the same answers, in the same voice, to the same questions — over and over.
The cache underneath isn't a wrapper around someone else's library. It's an original algorithm, designed for voice agents from the first line — informed by systems I've shipped through 20+ million production calls.
And everything else — done right.
The cache is the part you'll talk about.
The rest is the part you'll forget exists — because it just works.
You don't migrate. You just change the URL.
Your existing voice agent code just works.
We expose the same APIs every major voice platform exposes — same routes, same field names, same SDKs. Point your client at our base URL, swap the API key, ship.
No new mental model. No JSON migration. No flow rebuilding. Your runbooks don't change.
Honest billing. You win, we win.
Bring your own LLM, TTS, and STT keys. We sit in front, measure exactly what we save you, and bill 20% of the delta — itemized per cache hit, deducted in real time. Save nothing? Pay nothing.
- real-time deduction · powered by our observability layer
- per-call audit trail · every dollar reconciled
- methodology doc shared at signup
- flat-rate quote available for procurement on request
For teams running their own LLM and TTS. Glimpass sits in front of your inference cluster and intercepts cached turns before they ever hit a GPU. Same hardware budget, an order of magnitude more concurrent calls.
- 10× concurrent capacity on the same hardware
- 0 data leaves your network · air-gapped install
- SOC-friendly audit logs, exportable on request
- quote, deployment plan, methodology doc — within 48h
AI stack only. Telephony pass-through (~$0.008/min) billed at carrier cost.
Tell us about your campaign.
We reply within 48 hours. No SDR call. No demo. We send you a quote and a methodology doc — you decide from there.