Moneycontrol PRO
Outskill Gea AI
Outskill Gea AI
HomeTechnologyAlibaba launches Qwen2.5-VL, its new AI language model to take on DeepSeek and ChatGPT: What is it, features and more

Alibaba launches Qwen2.5-VL, its new AI language model to take on DeepSeek and ChatGPT: What is it, features and more

Soon after DeepSeek’s announcement, another Chinese company – Alibaba – launched its latest LLM (Large Language Model) called Qwen2.5. The new LLM from the company positioned it as a competitor to DeepSeek-V3 and other leading AI models like GPT-4 and Gemini.

January 29, 2025 / 16:46 IST
Alibaba Qwen 2.5

Soon after DeepSeek’s announcement, another Chinese company – Alibaba – launched its latest LLM (Large Language Model) called Qwen2.5. The new LLM from the company positioned it as a competitor to DeepSeek-V3 and other leading AI models like GPT-4 and Gemini. The company claims the model outperforms OpenAI’s GPT-4o, Meta’s Llama 3.1-405B, and DeepSeek-V3. The launch was announced on the first day of the Lunar New Year.

What is Alibaba Qwen 2.5?

Qwen 2.5 is Alibaba’s latest artificial intelligence model, designed to compete with DeepSeek-V3 and OpenAI’s GPT-4o. Qwen 2.5 is pre-trained on large-scale multilingual and multimodal data and post-trained on quality data to align with human preferences. It supports natural language understanding, text generation, vision and audio processing, AI tool use, and chatbot role-play. The latest update includes enhancements in structured data comprehension and long-text generation.

Qwen 2.5: Features

Qwen 2.5 consists of dense, decoder-only language models ranging from 0.5 billion to 72 billion parameters, available in base and instruct variants. The model is pre-trained on an 18-trillion-token dataset, supporting multilingual capabilities in 29 languages, including Chinese, English, Spanish, and Arabic. It can process up to 128,000 tokens in context and generate outputs of up to 8,000 tokens.

In addition to language capabilities, Qwen 2.5 improves document parsing, object detection, and video understanding. It enhances accuracy in identifying objects across multiple formats, including JSON,

Additionally, the Qwen 2.5 comes with improved video understanding, taking a step up against the competition’s image understanding through ultra-long video processing and fine-grained video grounding. The model applies dynamic resolution and frame rate training for better temporal understanding, helping it extract event segments efficiently. It also integrates a streamlined vision encoder that improves both training and inference speeds.

Competitive landscape

DeepSeek-V3’s rise has influenced AI market trends, leading to upgrades from Alibaba, ByteDance, and Tencent. DeepSeek’s open-source model previously triggered an AI price war in China, prompting cost reductions among competitors. Alibaba’s latest model aims to provide an alternative with advanced multimodal functions and optimised inference speeds.

Invite your friends and family to sign up for MC Tech 3, our daily newsletter that breaks down the biggest tech and startup stories of the day

Moneycontrol News
first published: Jan 29, 2025 03:54 pm

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

  • On Saturdays

    Find the best of Al News in one place, specially curated for you every weekend.

  • Daily-Weekdays

    Stay on top of the latest tech trends and biggest startup news.

Advisory Alert: It has come to our attention that certain individuals are representing themselves as affiliates of Moneycontrol and soliciting funds on the false promise of assured returns on their investments. We wish to reiterate that Moneycontrol does not solicit funds from investors and neither does it promise any assured returns. In case you are approached by anyone making such claims, please write to us at grievanceofficer@nw18.com or call on 02268882347
CloseGen AI Masterclass