google

Google: Gemma 4 31B (free)

Gemma 4 31B is a free multimodal model from Google that accepts text, images, and video as input. Its context window runs to 262,144 tokens, which accommodates long documents or extended conversations without truncation. Responses are capped at 8,192 output tokens. The model supports tool use and reasoning, making it usable for agentic workflows and multi-step tasks, though structured output support is unconfirmed. For comparison purposes, the zero cost makes Gemma 4 31B worth shortlisting for developers prototyping multimodal or tool-integrated applications on a tight budget. The significant caveat is that it currently has no independent benchmark coverage, so its quality relative to paid alternatives is unverified. Users willing to evaluate it against their own tasks will pay nothing to find out, but those who need a reliable quality baseline before committing should wait for third-party benchmark data to emerge.

Query via API → View on google → Estimate cost

Quality Score

100/100

price + capability + benchmarks

Input Price

Free

per 1M tokens

Output Price

Free

per 1M tokens

Context Window

262,144

tokens

Model ID: google/gemma-4-31b-it:free
Vendor: google
Tokenizer: Gemma
Input Modalities: image, text, video
Output Modalities: text
Max Output: 8,192 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: ✓ accepts images
Audio: no
Moderated: yes

Strong choice for

Writing

Category rankings

Where Google: Gemma 4 31B (free) places across the 6 categories it ranks in. How we rank →

#	Category	Score
#3	Social Media PostsWriting · of 25 ranked	120
#3	Voice Assistant BackendVoice · of 25 ranked	124
#3	Cheap Bulk InferenceCost · of 25 ranked	138
#3	Self-Hosted / LocalCost · of 25 ranked	118
#5	Real-Time ChatLatency · of 25 ranked	118
#12	Video Auto-TaggingVideo · of 25 ranked	123

Similar models

google

Google: Gemma 4 31B (free)

Strong choice for

Social Media Posts

Voice Assistant Backend

Cheap Bulk Inference

Self-Hosted / Local

Real-Time Chat

Category rankings

Similar models

Google: Gemini 3.5 Flash

Google: Gemini 3.1 Flash Lite

Google: Gemma 4 26B A4B (free)

Google: Gemma 4 26B A4B

Google: Gemma 4 31B

Google: Gemini 3.1 Flash Lite Preview