Google: Gemma 4 31B (free)
Gemma 4 31B is a Google model available here at no cost. It accepts text, images, and video as inputs, giving it multimodal range that text-only free alternatives lack. Its context window reaches 262,144 tokens, and it supports both tool use and reasoning, making it viable for agentic workflows and multi-step tasks. Outputs are capped at 32,768 tokens per completion. Structured output support is unconfirmed based on available data. The free pricing makes Gemma 4 31B worth shortlisting for developers prototyping on a budget or teams evaluating multimodal pipelines before committing to paid options. The significant caveat is benchmark coverage: no independent scores are available yet, so there is no external data to confirm how it ranks against comparable models on coding, reasoning, or instruction-following tasks. Users should treat performance as unproven and run their own task-specific tests before relying on it in production.
- Model ID
- google/gemma-4-31b-it:free
- Vendor
- Tokenizer
- Gemma
- Input Modalities
- image, text, video
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Strong choice for
Social Media Posts
Voice Assistant Backend
Cheap Bulk Inference
Self-Hosted / Local
Real-Time Chat
Category rankings
Where Google: Gemma 4 31B (free) places across the 6 categories it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #3 | Social Media PostsWriting · of 25 ranked | 120 |
| #3 | Voice Assistant BackendVoice · of 25 ranked | 124 |
| #3 | Cheap Bulk InferenceCost · of 25 ranked | 138 |
| #3 | Self-Hosted / LocalCost · of 25 ranked | 118 |
| #5 | Real-Time ChatLatency · of 25 ranked | 118 |
| #12 | Video Auto-TaggingVideo · of 25 ranked | 123 |