GPT-5.4
GPT-5.4 is a GPT-5 model from OpenAI, released in 2026-03. It costs $2.5 / $15 per 1M, has a 400k-token context window, and is best for general, reasoning, agents. Last verified 2026-04-19.
Spec sheet
Pricing
- Input
- $2.5 / 1M
- Output
- $15 / 1M
- Cached input
- $0.25 / 1M
- Batch discount
- 50%
- Free tier
- No
Context & speed
- Context window
- 400k tokens
- Max output
- 32k tokens
- Throughput
- ~105 tok/s
- Time to first token
- ~550 ms
- Speed tier
- balanced
Capabilities
- Tool use
- Yes
- Structured output
- Yes
- Prompt caching
- Yes
- Extended thinking
- Yes
- Vision input
- Yes
- Audio in / out
- No
- Fine-tuning
- Yes
Deployment
- Open weights
- No
- On-prem
- No
- HIPAA eligible
- Yes
- Zero retention
- Yes
- Regions
- us, eu
Estimated monthly cost
Assumes typical token shape: 2k input, 600 output per call. Prompt caching is excluded from these figures.
When to use GPT-5.4
Sweet spot
- general
- reasoning
- agents
- content writing
- vision
- coding
Known trade-offs
- behind Claude Sonnet on agent/tool benchmarks
Works with
Compare GPT-5.4 to other models
FAQ — GPT-5.4
How much does GPT-5.4 cost?
GPT-5.4 costs $2.5 / $15 per 1M tokens on the OpenAI API. Cached input reads cost $0.25 per 1M, cutting the input bill by roughly 90% on repeat system prompts. The batch API offers a 50% discount for async workloads.
What is the context window of GPT-5.4?
GPT-5.4 has a 400k-token context window with up to 32k tokens of output. That's enough for long reports, extended chat histories, or structured document analysis.
Does GPT-5.4 have a free tier?
No — Free $5 credit for new accounts; paid thereafter.
Is GPT-5.4 HIPAA / EU / on-prem friendly?
GPT-5.4 is HIPAA-eligible, available in EU regions, and is API-only. Zero data retention is available for enterprise customers.
What is GPT-5.4 best for?
GPT-5.4 is best for general, reasoning, agents, content writing. Trade-offs to be aware of: behind Claude Sonnet on agent/tool benchmarks.
Which tools and SDKs work with GPT-5.4?
GPT-5.4 integrates with OpenAI SDK, Azure OpenAI, Cursor, Vercel AI SDK, LangChain, LlamaIndex, OpenRouter. Most major AI frameworks support it either natively or through OpenAI-compatible endpoints.