head-to-head
MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-23.
| MiniMax: MiniMax M3 | StepFun: Step 3.7 Flash | |
|---|---|---|
| Vendor | minimax | stepfun |
| Quality Score | 100 | 100 |
| Benchmark Score | 78.2 | 48.0 |
| Input Price | $0.30/M | $0.20/M |
| Output Price | $1.20/M | $1.15/M |
| Context Window | 1,048,576 | 256,000 |
| Max Output | 512,000 | 256,000 |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | 73.3 | 49.1 |
| ai_index_agentic | 58.3 | 35.5 |
| ai_index_coding | 96.6 | 61.6 |
Who wins by task?
| Task | MiniMax: MiniMax M3 | StepFun: Step 3.7 Flash |
|---|---|---|
| SQL Generation | 167 | 152 |
| Code Review | 162 | 145 |
| Code Completion | 133 | 129 |
| Code Refactoring | 160 | 143 |
| Bug Fixing | 173 | 154 |
| Unit Test Generation | 151 | 138 |
| Code Documentation | 142 | 132 |
| Regex Writing | 134 | 129 |
| CI/CD Pipelines | 141 | 131 |
| Frontend Component Design | 143 | 135 |
| Data Analysis | 164 | 149 |
| CSV / Spreadsheet Cleanup | 153 | 140 |
| ETL Scripting | 151 | 137 |
| JSON Extraction | 147 | 142 |
| Bulk Data Labeling | 135 | 133 |
| OCR / Document Parsing | 145 | 137 |
| Table Extraction from PDFs | 145 | 137 |
| Long-Document Summarization | 156 | 141 |
| Short-Form Summarization | 131 | 128 |
| Blog Post Writing | 138 | 129 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
MoonshotAI: Kimi K2.7 Code vs MiniMax: MiniMax M3
MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash
Qwen: Qwen3.7 Plus vs MiniMax: MiniMax M3
Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash
MiniMax: MiniMax M3 vs xAI: Grok Build 0.1
MiniMax: MiniMax M3 vs Google: Gemini 3.5 Flash
MiniMax: MiniMax M3 vs Google: Gemini 3.1 Flash Lite
MiniMax: MiniMax M3 vs xAI: Grok 4.3