google

Google: Gemma 4 31B (free)

Gemma 4 31B is a Google model available here at no cost. It accepts text, images, and video as inputs, giving it multimodal range that text-only free alternatives lack. Its context window reaches 262,144 tokens, and it supports both tool use and reasoning, making it viable for agentic workflows and multi-step tasks. Outputs are capped at 32,768 tokens per completion. Structured output support is unconfirmed based on available data. The free pricing makes Gemma 4 31B worth shortlisting for developers prototyping on a budget or teams evaluating multimodal pipelines before committing to paid options. The significant caveat is benchmark coverage: no independent scores are available yet, so there is no external data to confirm how it ranks against comparable models on coding, reasoning, or instruction-following tasks. Users should treat performance as unproven and run their own task-specific tests before relying on it in production.

Quality Score
100/100
price + capability + benchmarks
Input Price
Free
per 1M tokens
Output Price
Free
per 1M tokens
Context Window
262,144
tokens
Model ID
google/gemma-4-31b-it:free
Vendor
google
Tokenizer
Gemma
Input Modalities
image, text, video
Output Modalities
text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
✓ accepts images
Audio
no
Moderated
no

Strong choice for

Category rankings

Where Google: Gemma 4 31B (free) places across the 6 categories it ranks in. How we rank →

#CategoryScore
#3 Social Media PostsWriting · of 25 ranked 120
#3 Voice Assistant BackendVoice · of 25 ranked 124
#3 Cheap Bulk InferenceCost · of 25 ranked 138
#3 Self-Hosted / LocalCost · of 25 ranked 118
#5 Real-Time ChatLatency · of 25 ranked 118
#12 Video Auto-TaggingVideo · of 25 ranked 123

Similar models