picksbymodel Get API access →

head-to-head

StepFun: Step 3.7 Flash vs Mistral: Mistral Medium 3.5

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

	StepFun: Step 3.7 Flash	Mistral: Mistral Medium 3.5
Vendor	stepfun	mistralai
Quality Score	100	100
Benchmark Score	48.0	52.2
Input Price	$0.20/M	$1.50/M
Output Price	$1.15/M	$7.50/M
Context Window	256,000	262,144
Max Output	256,000	-
Tool Calling	✓	✓
Structured Output	✓	✓
Reasoning Mode	✓	✓
Vision	✓	✓
Audio	-	-
Benchmark Scores
ai_index	49.1	49.4
ai_index_agentic	35.5	31.3
ai_index_coding	61.6	77.4

Who wins by task?

Task	StepFun: Step 3.7 Flash	Mistral: Mistral Medium 3.5
SQL Generation	152	153
Code Review	145	147
Code Completion	129	116
Code Refactoring	143	144
Bug Fixing	154	155
Unit Test Generation	138	140
Code Documentation	132	131
Regex Writing	129	128
CI/CD Pipelines	131	132
Frontend Component Design	135	137
Data Analysis	149	151
CSV / Spreadsheet Cleanup	140	142
ETL Scripting	137	138
JSON Extraction	142	134
Bulk Data Labeling	133	123
OCR / Document Parsing	137	139
Table Extraction from PDFs	137	139
Short-Form Summarization	128	119
Blog Post Writing	129	129

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Medium 3.5 Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Mistral: Mistral Medium 3.5 MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Mistral: Mistral Medium 3.5 StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash