head-to-head

StepFun: Step 3.7 Flash vs Mistral: Mistral Small 4

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

StepFun: Step 3.7 Flash Mistral: Mistral Small 4
Vendorstepfunmistralai
Quality Score100100
Benchmark Score48.06.1
Input Price$0.20/M$0.15/M
Output Price$1.15/M$0.60/M
Context Window256,000262,144
Max Output256,000-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.17.7
ai_index_agentic35.5-
ai_index_coding61.6-

Who wins by task?

TaskStepFun: Step 3.7 FlashMistral: Mistral Small 4
SQL Generation 152 132
Code Review 145 127
Code Completion 129 129
Code Refactoring 143 129
Bug Fixing 154 131
Unit Test Generation 138 122
Code Documentation 132 127
Regex Writing 129 120
CI/CD Pipelines 131 118
Frontend Component Design 135 122
Data Analysis 149 125
CSV / Spreadsheet Cleanup 140 128
ETL Scripting 137 123
JSON Extraction 142 131
Bulk Data Labeling 133 129
OCR / Document Parsing 137 128
Table Extraction from PDFs 137 128
Long-Document Summarization 141 130
Short-Form Summarization 128 124
Blog Post Writing 129 120

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Small 4 Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Mistral: Mistral Small 4 MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Mistral: Mistral Small 4 StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash