Qwen-VL is Alibaba's series of vision-language models with impressive document understanding, multilingual OCR, and fine-grained visual grounding. The Qwen2.5-VL variant offers competitive performance with leading closed-source models, with a focus on practical document and image analysis.