Google: Gemma 4 26B A4B (free)
Gemma 4 26B A4B is a multimodal model from Google that accepts image, text, and video inputs. It supports tool use and reasoning, and offers a context window of 262,144 tokens with a maximum of 32,768 output tokens per response. Structured output support is unconfirmed. The model is free to use, with no input or output costs. For teams evaluating cost-sensitive workflows, the zero-dollar price point makes it easy to test at scale without budget risk. That said, there is currently no independent benchmark coverage to gauge where it stands against other models in the same tier. Shortlisting it makes sense if you need multimodal input handling or long-context processing and want to run your own evaluations, but buyers who rely on published benchmarks to make decisions will find the current evidence base insufficient.
- Model ID
- google/gemma-4-26b-a4b-it:free
- Vendor
- Tokenizer
- Gemma
- Input Modalities
- image, text, video
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Strong choice for
Social Media Posts
Voice Assistant Backend
Cheap Bulk Inference
Self-Hosted / Local
Real-Time Chat
Category rankings
Where Google: Gemma 4 26B A4B (free) places across the 6 categories it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #2 | Social Media PostsWriting · of 25 ranked | 120 |
| #2 | Voice Assistant BackendVoice · of 25 ranked | 124 |
| #2 | Cheap Bulk InferenceCost · of 25 ranked | 138 |
| #2 | Self-Hosted / LocalCost · of 25 ranked | 118 |
| #3 | Real-Time ChatLatency · of 25 ranked | 118 |
| #10 | Video Auto-TaggingVideo · of 25 ranked | 123 |