head-to-head

StepFun: Step 3.7 Flash vs Qwen: Qwen3.6 Plus

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

StepFun: Step 3.7 Flash Qwen: Qwen3.6 Plus
Vendorstepfunqwen
Quality Score100100
Benchmark Score48.068.1
Input Price$0.20/M$0.33/M
Output Price$1.15/M$1.95/M
Context Window256,0001,000,000
Max Output256,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.165.3
ai_index_agentic35.545.5
ai_index_coding61.690.0

Who wins by task?

TaskStepFun: Step 3.7 FlashQwen: Qwen3.6 Plus
SQL Generation 152 163
Code Review 145 158
Code Completion 129 132
Code Refactoring 143 157
Bug Fixing 154 168
Unit Test Generation 138 148
Code Documentation 132 140
Regex Writing 129 132
CI/CD Pipelines 131 139
Frontend Component Design 135 141
Data Analysis 149 159
CSV / Spreadsheet Cleanup 140 151
ETL Scripting 137 148
JSON Extraction 142 146
Bulk Data Labeling 133 134
OCR / Document Parsing 137 144
Table Extraction from PDFs 137 144
Long-Document Summarization 141 154
Short-Form Summarization 128 130
Blog Post Writing 129 135

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.6 Plus Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Qwen: Qwen3.6 Plus MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Qwen: Qwen3.6 Plus StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash