Blog

Latest news and updates from LLM Gateway

LLM Gateway deployed across AWS, GCP, and Azure

How to Deploy LLM Gateway on Cloud Platforms

What it takes to run LLM Gateway in production on AWS, GCP, or Azure — the components you need, how they fit together, and why Kubernetes is the path we recommend once you outgrow a single box.

June 20, 2026

Secure API key rotation with LLM Gateway

API Key Rotation: How We Secure Your API Keys

Guides Engineering

Rotating API keys shouldn't cause service interruption for production AI. Learn how LLM Gateway enables secure key rotation for both providers and the gateway.

June 18, 2026

Building a Slack Q&A Bot with LLM Gateway and Chat SDK

Building a Slack Q&A Bot with LLM Gateway and Chat SDK

A walkthrough of our new open-source template: a Slack bot that streams AI answers, keeps thread context, and searches the web — backed by LLM Gateway so you can switch between 280+ models with one API key.

June 18, 2026

LLM Gateway SOC 2 Type II announcement

LLM Gateway Is Now SOC 2 Type II Compliant

LLM Gateway has successfully completed its SOC 2 Type II audit. Here's what that means, why it matters for teams routing LLM traffic through us, and how to request our report.

June 11, 2026

Sound waves flowing out of a single API endpoint into multiple speech models

Speech Generation Is Live: ElevenLabs, OpenAI & Gemini TTS Through One API

Nine text-to-speech models from ElevenLabs, OpenAI, and Google are now one OpenAI-compatible API call away — plus a new Audio Studio in the Playground to compare voices and models side by side.

June 10, 2026

A digital wallet streaming credit tokens into an AI chat interface

Stripe for AI: Embed AI + Credit Purchases in Your App

Our new LLM SDK lets your end-users buy credits inside your app and chat with any model — billed through LLM Gateway, with your markup as margin. Here's how it works and how to ship it in ~40 lines.

June 7, 2026

LLM Gateway vs Portkey: An Honest Comparison

LLM Gateway vs Portkey: An Honest Comparison

Looking for a Portkey alternative? A straightforward comparison of LLM Gateway and Portkey — features, pricing, deployment, and trade-offs — so you can pick the right AI gateway for your stack.

May 26, 2026

What Is LLM Orchestration? Patterns, Tools & When You Need One

What Is LLM Orchestration? Patterns, Tools & When You Need One

LLM orchestration is the layer that coordinates models, providers, and steps into one reliable workflow. A practical guide to the patterns, the tools, and when you need an LLM orchestrator.

May 26, 2026

ByteDance Seedance video models now live on LLM Gateway

ByteDance Seedance Lands on LLM Gateway: Three Video Models, One API

All three ByteDance Seedance video generation models — 2.0, 2.0 Fast, and 1.5 Pro with native audio — are now live on LLM Gateway. Same API key. Same billing. Same dashboard.

May 17, 2026

Embeddings on LLM Gateway — turn meaning into vectors

Embeddings on LLM Gateway: One API for Vectors and Chat

Generate vectors for semantic search, clustering, and RAG through the same gateway you already use for chat. OpenAI-compatible, drop-in, and tracked alongside your model spend.

May 15, 2026

LLM Gateway vs Direct API: When the Provider SDK Stops Scaling

LLM Gateway vs Direct API: When the Provider SDK Stops Scaling

Calling OpenAI or Anthropic directly is the right first call. Here's the honest case for when a gateway starts paying for itself — and when you don't need one yet.

April 26, 2026

Prompt Caching Explained: How to Cut LLM Costs by 30–99%

Prompt Caching Explained: How to Cut LLM Costs by 30–99%

How LLM response caching actually works, where it helps, where it doesn't, and how to turn it on without rewriting your app.

April 25, 2026

How to Estimate LLM Token Costs Before You Ship

How to Estimate LLM Token Costs Before You Ship

A practical guide to forecasting LLM costs: the token formula, real-world examples across GPT-5.4, Claude, and Gemini, and a free calculator to run the numbers.

April 24, 2026

LLM Gateway vs LiteLLM: An Honest Comparison

LLM Gateway vs LiteLLM: An Honest Comparison

A straightforward comparison of LLM Gateway and LiteLLM — features, operational cost, and trade-offs — so you can pick the right one for your stack.

April 23, 2026

LLM Gateway Is Now a Built-in Provider in OpenCode

LLM Gateway Is Now a Built-in Provider in OpenCode

No config files, no env vars. OpenCode ships LLM Gateway as a first-class provider — select it, paste your key, and start coding with 280+ models.

April 18, 2026

How to Choose the Right LLM for Your Use Case in 2026

How to Choose the Right LLM for Your Use Case in 2026

A practical framework for picking the right model — based on task type, budget, latency requirements, and context window — instead of chasing benchmarks.

April 11, 2026

How We Handle LLM Provider Failover at Scale

How We Handle LLM Provider Failover at Scale

A deep dive into the routing, retry, and failover systems that keep LLM Gateway reliable when upstream providers go down.

April 11, 2026

LLM Gateway vs OpenRouter: An Honest Comparison

LLM Gateway vs OpenRouter: An Honest Comparison

A straightforward comparison of LLM Gateway and OpenRouter — features, pricing, and trade-offs — so you can pick the right one for your stack.

April 11, 2026

LLM Guardrails Explained: Prompt Injection, PII Detection & Content Moderation

LLM Guardrails Explained: Prompt Injection, PII Detection & Content Moderation

What LLM guardrails are, why they matter in production, and how to implement content safety without building it yourself.

April 11, 2026

7 Best AI Gateways in 2026 (Compared)

7 Best AI Gateways in 2026 (Compared)

An honest comparison of the top AI gateways — features, pricing, and trade-offs — so you can pick the right one for your stack.

April 9, 2026

Up to 30% Off DeepSeek Models on LLM Gateway

Up to 30% Off DeepSeek Models on LLM Gateway

LLM Gateway offers exclusive discounts on DeepSeek V3.2, V3.1, and R1 through partner providers — up to 30% off base pricing, applied automatically to every request.

March 28, 2026

Top 10 Cheapest Providers for DeepSeek V3.2 in 2026

Top 10 Cheapest Providers for DeepSeek V3.2 in 2026

We compared DeepSeek V3.2 pricing across every major API provider. Here's the definitive ranking — and how our Token Cost Calculator can help you estimate exact savings.

March 28, 2026

Q1 2026 Feature Roundup

Q1 2026: Video Gen, Image Studio & Enterprise Features

Three months of updates: video generation, Image Studio, sessions, GPT-5.4 family, enterprise guardrails, 5+ new providers, and much more.

March 23, 2026

How We Cut Our LLM Costs 60% With Request Routing

How We Cut Our LLM Costs 60% With Request Routing

A practical breakdown of how intelligent routing, caching, and model selection through an LLM gateway can dramatically reduce your AI infrastructure costs.

February 14, 2026

OpenAI vs Anthropic vs Google: Real Cost Comparison 2026

OpenAI vs Anthropic vs Google: Real Cost Comparison 2026

Side-by-side pricing comparison of GPT-5, Claude Opus 4.6, and Gemini 2.5 Pro with real cost calculations for production workloads.

February 11, 2026

20% Off All Alibaba Cloud Qwen Models on LLM Gateway

20% Off All Alibaba Cloud Qwen Models on LLM Gateway

LLM Gateway partners with Alibaba Cloud to bring you 20% off 26 Qwen AI models — including Qwen3 Max, Qwen3 Coder, QwQ reasoning, vision-language, and image generation models.

February 10, 2026

Getting Started with LLM Gateway in 5 Minutes

Getting Started with LLM Gateway in 5 Minutes

A step-by-step guide to making your first LLM API request through LLM Gateway — from signup to seeing results in your dashboard.

February 8, 2026

The Hidden Cost of LLM Vendor Lock-in

The Hidden Cost of LLM Vendor Lock-in

Why building directly against a single LLM provider's API is riskier than you think, and how a gateway layer protects your AI investment.

February 5, 2026

Why Your AI App Needs a Gateway Layer

Why Your AI App Needs a Gateway Layer

What an LLM gateway does, why it matters, and how it lets you ship AI features faster by abstracting away provider complexity.

February 3, 2026

Enterprise AI Gateway connecting data sources, applications, and users to AI models

Beyond Proxies: Why Enterprises Need a Unified AI Gateway

Learn why simple LLM proxies aren't enough and how a unified AI gateway delivers centralized access control, cost visibility, compliance, and security.

January 25, 2026

What is an LLM Gateway?

What is an LLM Gateway?

Learn what an LLM Gateway is, why you need one, and how it simplifies integrating, managing, and deploying large language models in production.

January 25, 2026

Deploying Next.js on GCP

Deploying Next.js on GCP

How we deploy our Next.js apps on Google Cloud Platform without relying on Vercel.

December 27, 2025

Q4 2025 Feature Roundup

Quarter 4 2025: Major Platform Updates

Three months of updates: 15+ new models, team management, referral program, tiered pricing, data retention, new providers, and much more.

December 18, 2025

LLM Gateway Cyber Monday Credits Promotion

Cyber Monday at LLM Gateway – 50% Off Credits

Use the CYBERMONDAY promo code to get 50% off credits for a limited time.

December 1, 2025

LLM Gateway Black Friday Credits Promotion

Black Friday at LLM Gateway – 50% Off Credits

Use the BLACKFRIDAY promo code to get 50% off credits for a limited time.

November 28, 2025

LLM Gateway Playground

Introducing LLM Gateway Chat — Test 280+ Models in One Interface

Compare GPT-5, Claude, Gemini, and 280+ other models side by side. Generate images, test prompts, and find the best model for your use case.

October 14, 2025

Configure Claude Code with LLM Gateway

How to Configure Claude Code to Use Any Model via LLM Gateway

Use GPT-5, Gemini, or any model with Claude Code. Three environment variables, zero code changes.

September 8, 2025

LLM Gateway

Custom OpenAI-Compatible Providers Are Now Supported

Connect your internal LLM deployments or any OpenAI-compatible API to LLM Gateway—and get the same analytics, caching, and routing.

May 10, 2025

LLM Gateway

How to Self-Host LLM Gateway

Run LLM Gateway on your own infrastructure in under 5 minutes. Full control, zero platform fees.

May 1, 2025

LLM Gateway

Introducing LLM Gateway

One API for 280+ models across 35+ providers. Route requests, track costs, and switch models without changing your code.

April 12, 2025