google

Google: Gemma 4 26B A4B (free)

Gemma 4 26B A4B is a multimodal model from Google that accepts image, text, and video inputs. It supports tool use and reasoning, and offers a context window of 262,144 tokens with a maximum of 32,768 output tokens per response. Structured output support is unconfirmed. The model is free to use, with no input or output costs. For teams evaluating cost-sensitive workflows, the zero-dollar price point makes it easy to test at scale without budget risk. That said, there is currently no independent benchmark coverage to gauge where it stands against other models in the same tier. Shortlisting it makes sense if you need multimodal input handling or long-context processing and want to run your own evaluations, but buyers who rely on published benchmarks to make decisions will find the current evidence base insufficient.

Quality Score
100/100
price + capability + benchmarks
Input Price
Free
per 1M tokens
Output Price
Free
per 1M tokens
Context Window
262,144
tokens
Model ID
google/gemma-4-26b-a4b-it:free
Vendor
google
Tokenizer
Gemma
Input Modalities
image, text, video
Output Modalities
text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
✓ accepts images
Audio
no
Moderated
no

Strong choice for

Category rankings

Where Google: Gemma 4 26B A4B (free) places across the 6 categories it ranks in. How we rank →

#CategoryScore
#2 Social Media PostsWriting · of 25 ranked 120
#2 Voice Assistant BackendVoice · of 25 ranked 124
#2 Cheap Bulk InferenceCost · of 25 ranked 138
#2 Self-Hosted / LocalCost · of 25 ranked 118
#3 Real-Time ChatLatency · of 25 ranked 118
#10 Video Auto-TaggingVideo · of 25 ranked 123

Similar models