Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
Google's Nano Banana Pro is a paid multimodal model from Google that accepts both text and image inputs, with a 65,536-token context window and up to 32,768 tokens of output. It supports reasoning, but does not support tool use, and structured output support is unconfirmed. If your workflow requires function calling or guaranteed structured responses, this model is not a safe choice based on available information. On pricing, input runs $2.00 per million tokens and output $12.00 per million, placing it in the mid-to-upper cost range for image-capable models. There is no independent benchmark coverage yet, so performance relative to competitors is genuinely unknown rather than just unreported. Buyers who need a vision-capable model with reasoning and can tolerate that uncertainty may find it worth evaluating, but anyone making a cost-justified decision should wait for third-party results before committing.
- Model ID
- google/gemini-3-pro-image-preview
- Vendor
- Tokenizer
- Gemini
- Input Modalities
- image, text
- Output Modalities
- image, text
- Max Output
- 32,768 tokens
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Category rankings
Where Google: Nano Banana Pro (Gemini 3 Pro Image Preview) places across the 1 category it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #7 | Image GenerationVision · of 8 ranked | 86 |