HomeTechnologyAlibaba launches Qwen2.5-VL, its new AI language model to take on DeepSeek and ChatGPT: What is it, features and more

Alibaba launches Qwen2.5-VL, its new AI language model to take on DeepSeek and ChatGPT: What is it, features and more

Soon after DeepSeek’s announcement, another Chinese company – Alibaba – launched its latest LLM (Large Language Model) called Qwen2.5. The new LLM from the company positioned it as a competitor to DeepSeek-V3 and other leading AI models like GPT-4 and Gemini.

January 29, 2025 / 16:46 IST
Story continues below Advertisement
Alibaba Qwen 2.5
Alibaba Qwen 2.5

Soon after DeepSeek’s announcement, another Chinese company – Alibaba – launched its latest LLM (Large Language Model) called Qwen2.5. The new LLM from the company positioned it as a competitor to DeepSeek-V3 and other leading AI models like GPT-4 and Gemini. The company claims the model outperforms OpenAI’s GPT-4o, Meta’s Llama 3.1-405B, and DeepSeek-V3. The launch was announced on the first day of the Lunar New Year.

What is Alibaba Qwen 2.5?
Qwen 2.5 is Alibaba’s latest artificial intelligence model, designed to compete with DeepSeek-V3 and OpenAI’s GPT-4o. Qwen 2.5 is pre-trained on large-scale multilingual and multimodal data and post-trained on quality data to align with human preferences. It supports natural language understanding, text generation, vision and audio processing, AI tool use, and chatbot role-play. The latest update includes enhancements in structured data comprehension and long-text generation.

Story continues below Advertisement

Qwen 2.5: Features

Qwen 2.5 consists of dense, decoder-only language models ranging from 0.5 billion to 72 billion parameters, available in base and instruct variants. The model is pre-trained on an 18-trillion-token dataset, supporting multilingual capabilities in 29 languages, including Chinese, English, Spanish, and Arabic. It can process up to 128,000 tokens in context and generate outputs of up to 8,000 tokens.