head-to-head

xAI: Grok 4.20 vs Mistral: Mistral Small 4

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

xAI: Grok 4.20 Mistral: Mistral Small 4
Vendorx-aimistralai
Quality Score100100
Benchmark Score61.56.1
Input Price$1.25/M$0.15/M
Output Price$2.50/M$0.60/M
Context Window2,000,000262,144
Max Output--
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index61.07.7
eqbench55.8-

Who wins by task?

TaskxAI: Grok 4.20Mistral: Mistral Small 4
SQL Generation 144 132
Code Review 150 127
Code Completion 122 129
Code Refactoring 153 129
Bug Fixing 154 131
Unit Test Generation 135 122
Code Documentation 141 127
Regex Writing 127 120
CI/CD Pipelines 131 118
Frontend Component Design 131 122
Data Analysis 136 125
CSV / Spreadsheet Cleanup 139 128
ETL Scripting 142 123
JSON Extraction 123 131
Bulk Data Labeling 120 129
OCR / Document Parsing 135 128
Table Extraction from PDFs 135 128
Long-Document Summarization 154 130
Short-Form Summarization 119 124
Blog Post Writing 132 120

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Small 4 Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Mistral: Mistral Small 4 MiniMax: MiniMax M3 vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Mistral: Mistral Small 4 StepFun: Step 3.7 Flash vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Mistral: Mistral Small 4