head-to-head

StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

StepFun: Step 3.7 Flash Google: Gemini 3.5 Flash
Vendorstepfungoogle
Quality Score100100
Benchmark Score48.084.5
Input Price$0.20/M$1.50/M
Output Price$1.15/M$9.00/M
Context Window256,0001,048,576
Max Output256,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio-
Benchmark Scores
ai_index49.182.8
ai_index_agentic35.561.8
ai_index_coding61.6100.0

Who wins by task?

TaskStepFun: Step 3.7 FlashGoogle: Gemini 3.5 Flash
SQL Generation 152 168
Code Review 145 164
Code Completion 129 119
Code Refactoring 143 162
Bug Fixing 154 176
Unit Test Generation 138 152
Code Documentation 132 141
Regex Writing 129 133
CI/CD Pipelines 131 143
Frontend Component Design 135 145
Data Analysis 149 167
CSV / Spreadsheet Cleanup 140 153
ETL Scripting 137 152
JSON Extraction 142 138
Bulk Data Labeling 133 124
OCR / Document Parsing 137 146
Table Extraction from PDFs 137 146
Long-Document Summarization 141 157
Short-Form Summarization 128 121
Blog Post Writing 129 138

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Google: Gemini 3.5 Flash Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Google: Gemini 3.5 Flash MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Google: Gemini 3.5 Flash StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite