Question 1

Codestral 25.08 vs Grok 4.20 — which is better?

Accepted Answer

Codestral 25.08 and Grok 4.20 are both current production-tier models. Codestral 25.08 is meaningfully cheaper at $0.30 / $0.90 per 1M. Grok 4.20 has a 2M context window — about 8× the 256k of Codestral 25.08. Grok 4.20 leads on reasoning, general knowledge, long-context retrieval. The right pick depends on your use case — see "When to choose each" above for a data-driven decision.

Question 2

How does Codestral 25.08 pricing compare to Grok 4.20?

Accepted Answer

Codestral 25.08 costs $0.30 / $0.90 per 1M vs Grok 4.20 at $2 / $6 per 1M. Codestral 25.08 is cheaper on output tokens by roughly 567%. Both support prompt caching, which reduces effective cost by 80-90% on repeat system prompts.

Question 3

Does Codestral 25.08 or Grok 4.20 have the bigger context window?

Accepted Answer

Grok 4.20 has a 2M-token context window — 8× the 256k context of Codestral 25.08. Enough for entire codebases, books, or multi-document RAG.

Question 4

Is there a free tier for Codestral 25.08 or Grok 4.20?

Accepted Answer

Codestral 25.08: yes — Free tier for prototyping with rate limits. Grok 4.20: no — X Premium includes Grok web chat; API is paid.

Question 5

Which is better for coding — Codestral 25.08 or Grok 4.20?

Accepted Answer

Grok 4.20 leads on coding benchmarks (Codestral 25.08: 90/100, Grok 4.20: 91/100). For production coding agents also weigh tool-use performance — Codestral 25.08 scores 78, Grok 4.20 scores 88.

Metric	Mistral AI Codestral 25.08	xAI Grok 4.20
Input price (per 1M)	$0.3	$2
Output price (per 1M)	$0.9	$6
Context window	256k tokens	2M tokens
Speed tier	fast	balanced
Open weights	Yes	No
EU region	Yes	No
Free tier	La Plateforme	No
Prompt caching	No	Yes
Vision input	No	Yes
Extended thinking	No	Yes

Codestral 25.08 vs Grok 4.20

Specs side by side

When to choose each

Choose Codestral 25.08 if…

Choose Grok 4.20 if…

Benchmark delta

Codestral 25.08 leads on

Grok 4.20 leads on

FAQ — Codestral 25.08 vs Grok 4.20