We collect cookies to analyze our website traffic and performance; we never collect any personal data. Cookie Policy
Accept
NEW YORK DAWN™NEW YORK DAWN™NEW YORK DAWN™
Notification Show More
Font ResizerAa
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Reading: Alibaba releases Qwen with Questions, an open reasoning mannequin that beats o1-preview
Share
Font ResizerAa
NEW YORK DAWN™NEW YORK DAWN™
Search
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Follow US
NEW YORK DAWN™ > Blog > Technology > Alibaba releases Qwen with Questions, an open reasoning mannequin that beats o1-preview
Alibaba releases Qwen with Questions, an open reasoning mannequin that beats o1-preview
Technology

Alibaba releases Qwen with Questions, an open reasoning mannequin that beats o1-preview

Last updated: November 29, 2024 3:45 pm
Editorial Board Published November 29, 2024
Share
SHARE

Chinese language e-commerce large Alibaba has launched the newest mannequin in its ever-expanding Qwen household. This one is called Qwen with Questions (QwQ), and serves as the newest open supply competitor to OpenAI’s o1 reasoning mannequin.

Like different giant reasoning fashions (LRMs), QwQ makes use of further compute cycles throughout inference to overview its solutions and proper its errors, making it extra appropriate for duties that require logical reasoning and planning like math and coding.

What’s Qwen with Questions (OwQ?) and might or not it’s used for business functions?

Alibaba has launched a 32-billion-parameter model of QwQ with a 32,000-token context. The mannequin is at present in preview, which implies a higher-performing model is more likely to comply with.

Based on Alibaba’s exams, QwQ beats o1-preview on the AIME and MATH benchmarks, which consider mathematical problem-solving talents. It additionally outperforms o1-mini on GPQA, a benchmark for scientific reasoning. QwQ is inferior to o1 on the LiveCodeBench coding benchmarks however nonetheless outperforms different frontier fashions resembling GPT-4o and Claude 3.5 Sonnet.

Instance output of Qwen with Questions

QwQ doesn’t include an accompanying paper that describes the info or the method used to coach the mannequin, which makes it tough to breed the mannequin’s outcomes. Nevertheless, for the reason that mannequin is open, in contrast to OpenAI o1, its “thinking process” is just not hidden and can be utilized to make sense of how the mannequin causes when fixing issues.

Alibaba has additionally launched the mannequin underneath an Apache 2.0 license, which implies it may be used for business functions.

‘We discovered something profound’

Based on a weblog submit that was printed together with the mannequin’s launch, “Through deep exploration and countless trials, we discovered something profound: when given time to ponder, to question, and to reflect, the model’s understanding of mathematics and programming blossoms like a flower opening to the sun… This process of careful reflection and self-questioning leads to remarkable breakthroughs in solving complex problems.”

That is similar to what we learn about how reasoning fashions work. By producing extra tokens and reviewing their earlier responses, the fashions usually tend to right potential errors. Marco-o1, one other reasoning mannequin not too long ago launched by Alibaba may also include hints of how QwQ could be working. Marco-o1 makes use of Monte Carlo Tree Search (MCTS) and self-reflection at inference time to create totally different branches of reasoning and select the very best solutions. The mannequin was educated on a combination of chain-of-thought (CoT) examples and artificial information generated with MCTS algorithms.

Alibaba factors out that QwQ nonetheless has limitations resembling mixing languages or getting caught in round reasoning loops. The mannequin is obtainable for obtain on Hugging Face and a web-based demo could be discovered on Hugging Face Areas.

The LLM age offers strategy to LRMs: Massive Reasoning Fashions

The discharge of o1 has triggered rising curiosity in creating LRMs, despite the fact that not a lot is understood about how the mannequin works underneath the hood apart from utilizing inference-time scale to enhance the mannequin’s responses. 

There are actually a number of Chinese language opponents to o1. Chinese language AI lab DeepSeek not too long ago launched R1-Lite-Preview, its o1 competitor, which is at present solely obtainable by way of the corporate’s on-line chat interface. R1-Lite-Preview reportedly beats o1 on a number of key benchmarks.

One other not too long ago launched mannequin is LLaVA-o1, developed by researchers from a number of universities in China, which brings the inference-time reasoning paradigm to open-source imaginative and prescient language fashions (VLMs). 

The concentrate on LRMs comes at a time of uncertainty about the way forward for mannequin scaling legal guidelines. Experiences point out that AI labs resembling OpenAI, Google DeepMind, and Anthropic are getting diminishing returns on coaching bigger fashions. And creating bigger volumes of high quality coaching information is changing into more and more tough as fashions are already being educated on trillions of tokens gathered from the web. 

In the meantime, inference-time scale provides another which may present the following breakthrough in enhancing the talents of the following technology of AI fashions. There are studies that OpenAI is utilizing o1 to generate artificial reasoning information to coach the following technology of its LLMs. The discharge of open reasoning fashions is more likely to stimulate progress and make the house extra aggressive.

VB Day by day

By subscribing, you comply with VentureBeat’s Phrases of Service.

An error occured.

You Might Also Like

Between utopia and collapse: Navigating AI’s murky center future

From chatbots to collaborators: How AI brokers are reshaping enterprise work

Kayak and Expedia race to construct AI journey brokers that flip social posts into itineraries

From 30 days to 1: Chevron’s cloud migration ROI in actual numbers

Enterprise giants Atlassian, Intuit, and AWS are planning for a world the place brokers name the APIs

TAGGED:Alibababeatsmodelo1previewopenquestionsQwenreasoningreleases
Share This Article
Facebook Twitter Email Print

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow
Popular News
Now it’s TikTok father or mother ByteDance’s flip for a reasoning AI: enter Seed-Considering-v1.5!
Technology

Now it’s TikTok father or mother ByteDance’s flip for a reasoning AI: enter Seed-Considering-v1.5!

Editorial Board April 11, 2025
Republicans Who Assailed Biden’s Stimulus Bill Are Embracing the Money
Who Is Behind QAnon? Linguistic Detectives Find Fingerprints
From Actress to Chef: How Chloe-Charlotte Crampton Discovered Her True Calling
The FBI is disbanding certainly one of its Washington-based public corruption squads, AP sources say

You Might Also Like

Capital One builds agentic AI to supercharge auto gross sales
Technology

Capital One builds agentic AI to supercharge auto gross sales

July 7, 2025
Capital One builds agentic AI to supercharge auto gross sales
Technology

Vivid Knowledge beat Elon Musk and Meta in court docket — now its $100M AI platform is taking over Huge Tech

July 7, 2025
HOLY SMOKES! A brand new, 200% sooner DeepSeek R1-0528 variant seems from German lab TNG Expertise Consulting GmbH
Technology

HOLY SMOKES! A brand new, 200% sooner DeepSeek R1-0528 variant seems from German lab TNG Expertise Consulting GmbH

July 7, 2025
Capital One builds agentic AI to supercharge auto gross sales
Technology

Mud hits $6M ARR serving to enterprises construct AI brokers that truly do stuff as a substitute of simply speaking

July 7, 2025

Categories

  • Health
  • Sports
  • Politics
  • Entertainment
  • Technology
  • World
  • Art

About US

New York Dawn is a proud and integral publication of the Enspirers News Group, embodying the values of journalistic integrity and excellence.
Company
  • About Us
  • Newsroom Policies & Standards
  • Diversity & Inclusion
  • Careers
  • Media & Community Relations
  • Accessibility Statement
Contact Us
  • Contact Us
  • Contact Customer Care
  • Advertise
  • Licensing & Syndication
  • Request a Correction
  • Contact the Newsroom
  • Send a News Tip
  • Report a Vulnerability
Term of Use
  • Digital Products Terms of Sale
  • Terms of Service
  • Privacy Policy
  • Cookie Settings
  • Submissions & Discussion Policy
  • RSS Terms of Service
  • Ad Choices
© 2024 New York Dawn. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?