head-to-head

StepFun: Step 3.7 Flash vs Mistral: Mistral Medium 3.5

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

StepFun: Step 3.7 Flash Mistral: Mistral Medium 3.5
Vendorstepfunmistralai
Quality Score100100
Benchmark Score48.052.2
Input Price$0.20/M$1.50/M
Output Price$1.15/M$7.50/M
Context Window256,000262,144
Max Output256,000-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.149.4
ai_index_agentic35.531.3
ai_index_coding61.677.4

Who wins by task?

TaskStepFun: Step 3.7 FlashMistral: Mistral Medium 3.5
SQL Generation 152 153
Code Review 145 147
Code Completion 129 116
Code Refactoring 143 144
Bug Fixing 154 155
Unit Test Generation 138 140
Code Documentation 132 131
Regex Writing 129 128
CI/CD Pipelines 131 132
Frontend Component Design 135 137
Data Analysis 149 151
CSV / Spreadsheet Cleanup 140 142
ETL Scripting 137 138
JSON Extraction 142 134
Bulk Data Labeling 133 123
OCR / Document Parsing 137 139
Table Extraction from PDFs 137 139
Short-Form Summarization 128 119
Blog Post Writing 129 129

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Medium 3.5 Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Mistral: Mistral Medium 3.5 MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Mistral: Mistral Medium 3.5 StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash