Cost · best for
Best AI model for Self-Hosted / Local (2026)
Open-weights models you can run yourself. Ranked from 346 live models on the OpenRouter catalog, weighted for low cost.
| # | Model | Score | In / 1M | Out / 1M | Context | |
|---|---|---|---|---|---|---|
| 1 | Pareto Code Routeropenrouter/pareto-code | 600118 | $-1000000.00 | $-1000000.00 | 200,000 | Try → |
| 2 | Body Builder (beta)openrouter/bodybuilder | 600118 | $-1000000.00 | $-1000000.00 | 128,000 | Try → |
| 3 | Auto Routeropenrouter/auto | 600118 | $-1000000.00 | $-1000000.00 | 2,000,000 | Try → |
| 4 | Google: Gemma 4 26B A4B (free)google/gemma-4-26b-a4b-it:free | 118 | Free | Free | 262,144 | Try → |
| 5 | Google: Gemma 4 31B (free)google/gemma-4-31b-it:free | 118 | Free | Free | 262,144 | Try → |
| 6 | Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it | 117 | $0.07 | $0.35 | 262,144 | Try → |
| 7 | Qwen: Qwen3.5-9Bqwen/qwen3.5-9b | 117 | $0.10 | $0.15 | 262,144 | Try → |
| 8 | Qwen: Qwen3.5-Flashqwen/qwen3.5-flash-02-23 | 117 | $0.07 | $0.26 | 1,000,000 | Try → |
| 9 | ByteDance Seed: Seed 1.6 Flashbytedance-seed/seed-1.6-flash | 117 | $0.07 | $0.30 | 262,144 | Try → |
| 10 | OpenAI: GPT-5 Nanoopenai/gpt-5-nano | 117 | $0.05 | $0.40 | 400,000 | Try → |
| 11 | Google: Gemini 2.0 Flash Litegoogle/gemini-2.0-flash-lite-001 | 117 | $0.07 | $0.30 | 1,048,576 | Try → |
| 12 | Google: Gemma 4 31Bgoogle/gemma-4-31b-it | 117 | $0.13 | $0.38 | 262,144 | Try → |
| 13 | Mistral: Mistral Small 4mistralai/mistral-small-2603 | 117 | $0.15 | $0.60 | 262,144 | Try → |
| 14 | ByteDance Seed: Seed-2.0-Minibytedance-seed/seed-2.0-mini | 117 | $0.10 | $0.40 | 262,144 | Try → |
| 15 | xAI: Grok 4.1 Fastx-ai/grok-4.1-fast | 117 | $0.20 | $0.50 | 2,000,000 | Try → |
How we ranked these
For Self-Hosted / Local, we weight models on low cost. Higher means better. Scores combine OpenRouter's model metadata (context length, modality support, tool calling, structured output, reasoning capability) with public pricing. See full methodology →