OpenAI

GPT-5.4 Mini

GPT-5.4 Mini is a GPT-5 model from OpenAI, released in 2026-03. It costs $0.75 / $4.5 per 1M, has a 400k-token context window, and is best for chatbot, extraction, cheap-production. Last verified 2026-04-19.

Spec sheet

Pricing

Input
$0.75 / 1M
Output
$4.5 / 1M
Cached input
$0.075 / 1M
Batch discount
50%
Free tier
No

Context & speed

Context window
400k tokens
Max output
16k tokens
Throughput
~180 tok/s
Time to first token
~320 ms
Speed tier
fast

Capabilities

Tool use
Yes
Structured output
Yes
Prompt caching
Yes
Extended thinking
Yes
Vision input
Yes
Audio in / out
No
Fine-tuning
Yes

Deployment

Open weights
No
On-prem
No
HIPAA eligible
Yes
Zero retention
Yes
Regions
us, eu

Estimated monthly cost

Assumes typical token shape: 2k input, 600 output per call. Prompt caching is excluded from these figures.

10k calls/mo
$42.00
per month
100k calls/mo
$420.00
per month
1M calls/mo
$4.2k
per month

When to use GPT-5.4 Mini

Sweet spot

  • chatbot
  • extraction
  • cheap production
  • high throughput

Known trade-offs

  • below flagship on hardest tasks

Works with

OpenAI SDKAzure OpenAICursorVercel AI SDKLangChainLlamaIndexOpenRouter

FAQ — GPT-5.4 Mini

How much does GPT-5.4 Mini cost?

GPT-5.4 Mini costs $0.75 / $4.5 per 1M tokens on the OpenAI API. Cached input reads cost $0.075 per 1M, cutting the input bill by roughly 90% on repeat system prompts. The batch API offers a 50% discount for async workloads.

What is the context window of GPT-5.4 Mini?

GPT-5.4 Mini has a 400k-token context window with up to 16k tokens of output. That's enough for long reports, extended chat histories, or structured document analysis.

Does GPT-5.4 Mini have a free tier?

No — Near-free at $0.75/$4.50 per 1M; starter credits only.

Is GPT-5.4 Mini HIPAA / EU / on-prem friendly?

GPT-5.4 Mini is HIPAA-eligible, available in EU regions, and is API-only. Zero data retention is available for enterprise customers.

What is GPT-5.4 Mini best for?

GPT-5.4 Mini is best for chatbot, extraction, cheap production, high throughput. Trade-offs to be aware of: below flagship on hardest tasks.

Which tools and SDKs work with GPT-5.4 Mini?

GPT-5.4 Mini integrates with OpenAI SDK, Azure OpenAI, Cursor, Vercel AI SDK, LangChain, LlamaIndex, OpenRouter. Most major AI frameworks support it either natively or through OpenAI-compatible endpoints.