inclusionai

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is a text-only model from inclusionAI with a 262,144-token context window and support for tool use. It does not support reasoning modes or structured output, and accepts no image or audio input. The 32,768-token output ceiling is adequate for most document-length tasks, and the long context makes it suitable for processing large codebases or lengthy documents in a single pass. At $0.01 per million input tokens and $0.03 per million output tokens, this is a low-cost option worth considering for high-volume text workloads where budget matters more than top-tier performance. However, its blended benchmark score of 30.9 is based on only one independent benchmark, so performance claims are not well corroborated. Buyers who need broad capability validation before committing should treat that score as a limited signal and test the model against their own use cases before scaling.

Quality Score
95/100
price + capability + benchmarks
Input Price
$0.01
per 1M tokens
Output Price
$0.03
per 1M tokens
Context Window
262,144
tokens
Model ID
inclusionai/ling-2.6-flash
Vendor
inclusionai
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models