Image
Universal
LITE
Kandinsky 5.0 Image
is a high-resolution text-to-image generation model (6B parameters)
that achieves state-of-the-art visual quality and prompt alignment. It
outperforms leading open-source models like FLUX.1[dev] and Qwen-Image
in aesthetic realism and compositional accuracy.
We provide a comprehensive suite of optimized variants for different workflows:
Additionally, we provide Kandinsky 5.0 Image Editing , a specialized variant derived from the base Image model, fine-tuned on an instructive dataset for precise, context-aware image editing
(e.g., inpainting, object replacement, style transfer).
All models are available for generating images
at resolutions up to 1024x1024 pixels.
in aesthetic realism and compositional accuracy.
We provide a comprehensive suite of optimized variants for different workflows:
- RL-finetuned model — delivers the highest visual fidelity and realism.
- SFT-soup model — excels in prompt following and overall visual quality.
- Pretrain checkpoint — designed for researchers to conduct further fine-tuning and experimentation.
Additionally, we provide Kandinsky 5.0 Image Editing , a specialized variant derived from the base Image model, fine-tuned on an instructive dataset for precise, context-aware image editing
(e.g., inpainting, object replacement, style transfer).
All models are available for generating images
at resolutions up to 1024x1024 pixels.
Try now