head-to-head

Qwen: Qwen3.6 35B A3B vs xAI: Grok 4.20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

Qwen: Qwen3.6 35B A3B xAI: Grok 4.20
Vendorqwenx-ai
Quality Score100100
Benchmark Score51.861.5
Input Price$0.14/M$1.25/M
Output Price$1.00/M$2.50/M
Context Window262,1442,000,000
Max Output262,144-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index52.261.0
ai_index_agentic35.3-
ai_index_coding69.1-
eqbench-55.8

Who wins by task?

TaskQwen: Qwen3.6 35B A3BxAI: Grok 4.20
SQL Generation 154 144
Code Review 147 150
Code Completion 130 122
Code Refactoring 144 153
Bug Fixing 156 154
Unit Test Generation 140 135
Code Documentation 133 141
Regex Writing 129 127
CI/CD Pipelines 132 131
Frontend Component Design 137 131
Data Analysis 151 136
CSV / Spreadsheet Cleanup 141 139
ETL Scripting 138 142
JSON Extraction 143 123
Bulk Data Labeling 133 120
OCR / Document Parsing 138 135
Table Extraction from PDFs 138 135
Long-Document Summarization 142 154
Short-Form Summarization 128 119
Blog Post Writing 130 132

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.6 35B A3B MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.6 35B A3B Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Qwen: Qwen3.6 35B A3B MiniMax: MiniMax M3 vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Qwen: Qwen3.6 35B A3B StepFun: Step 3.7 Flash vs xAI: Grok 4.20