We collect cookies to analyze our website traffic and performance; we never collect any personal data. Cookie Policy
Accept
NEW YORK DAWN™NEW YORK DAWN™NEW YORK DAWN™
Notification Show More
Font ResizerAa
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Reading: Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra
Share
Font ResizerAa
NEW YORK DAWN™NEW YORK DAWN™
Search
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Follow US
NEW YORK DAWN™ > Blog > Technology > Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra
Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra
Technology

Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra

Last updated: November 13, 2025 9:36 pm
Editorial Board Published November 13, 2025
Share
SHARE

Mere hours after OpenAI up to date its flagship basis mannequin GPT-5 to GPT-5.1, promising diminished token utilization general and a extra nice character with extra preset choices, Chinese language search large Baidu unveiled its next-generation basis mannequin, ERNIE 5.0, alongside a set of AI product upgrades and strategic worldwide expansions.

The objective: to place as a world contender within the more and more aggressive enterprise AI market.

Introduced on the firm's Baidu World 2025 occasion, ERNIE 5.0 is a proprietary, natively omni-modal mannequin designed to collectively course of and generate content material throughout textual content, pictures, audio, and video.

Not like Baidu’s just lately launched ERNIE-4.5-VL-28B-A3B-Considering, which is open supply underneath an enterprise-friendly and permissive Apache 2.0 license, ERNIE 5.0 is a proprietary mannequin and is out there solely by way of Baidu’s ERNIE Bot web site (I wanted to pick out it manuallyu from the mannequin picker dropdown) and the Qianfan cloud platform utility programming interface (API) for enterprise clients.

Alongside the mannequin launch, Baidu launched main updates to its digital human platform, no-code instruments, and general-purpose AI brokers — all focused at increasing its AI footprint past China.

The corporate additionally launched ERNIE 5.0 Preview 1022, a variant optimized for text-intensive duties, alongside the final preview mannequin that balances throughout modalities.

Baidu emphasised that ERNIE 5.0 represents a shift in how intelligence is deployed at scale, with CEO Robin Li stating: “When you internalize AI, it becomes a native capability and transforms intelligence from a cost into a source of productivity.”

The place ERNIE 5.0 outshines GPT-5 and Gemini 2.5 Professional

ERNIE 5.0’s benchmark outcomes recommend that Baidu has achieved parity—or near-parity—with the highest Western basis fashions throughout a large spectrum of duties.

In public benchmark slides shared throughout the Baidu World 2025 occasion, ERNIE 5.0 Preview outperformed or matched OpenAI’s GPT-5-Excessive and Google’s Gemini 2.5 Professional in multimodal reasoning, doc understanding, and image-based QA, whereas additionally demonstrating robust language modeling and code execution skills.

The corporate emphasised its capacity to deal with joint inputs and outputs throughout modalities, moderately than counting on post-hoc modality fusion, which it framed as a technical differentiator.

On visible duties, ERNIE 5.0 achieved main scores on OCRBench, DocVQA, and ChartQA, three benchmarks that check doc recognition, comprehension, and structured knowledge reasoning.

Baidu claims the mannequin beat each GPT-5-Excessive and Gemini 2.5 Professional on these doc and chart-based benchmarks, areas it describes as core to enterprise purposes like automated doc processing and monetary evaluation.

In picture era, ERNIE 5.0 tied or exceeded Google’s Veo3 throughout classes together with semantic alignment and picture high quality, in keeping with Baidu’s inner GenEval-based analysis. Baidu claimed that the mannequin’s multimodal integration permits it to generate and interpret visible content material with larger contextual consciousness than fashions counting on modality-specific encoders.

For audio and speech duties, ERNIE 5.0 demonstrated aggressive outcomes on MM-AU and TUT2017 audio understanding benchmarks, in addition to query answering from spoken language inputs. Its audio efficiency, whereas not as closely emphasised as imaginative and prescient or textual content, suggests a broad functionality footprint supposed to assist full-spectrum multimodal purposes.

In language duties, the mannequin confirmed robust outcomes on instruction following, factual query answering, and mathematical reasoning—core areas that outline the enterprise utility of enormous language fashions.

The Preview 1022 variant of ERNIE 5.0, tailor-made for textual efficiency, confirmed even stronger language-specific ends in early developer entry. Whereas Baidu doesn’t declare broad superiority usually language reasoning, its inner evaluations recommend that ERNIE 5.0 Preview 1022 closes the hole with top-tier English-language fashions and outperforms them in Chinese language-language efficiency.

Whereas Baidu didn’t launch full benchmark particulars or uncooked scores publicly, its efficiency positioning suggests a deliberate try to border ERNIE 5.0 not as a distinct segment multimodal system however as a flagship mannequin aggressive with the biggest closed fashions in general-purpose reasoning.

The place Baidu claims a transparent lead is in structured doc understanding, visible chart reasoning, and integration of a number of modalities right into a single, native modeling structure. Unbiased verification of those outcomes stays pending, however the breadth of claimed capabilities positions ERNIE 5.0 as a severe various within the multimodal basis mannequin panorama.

Enterprise Pricing Technique

ERNIE 5.0 is positioned on the premium finish of Baidu’s mannequin pricing construction. The corporate has launched particular pricing for API utilization on its Qianfan platform, aligning the price with different top-tier choices from Chinese language rivals like Alibaba.

Mannequin

Enter Price (per 1K tokens)

Output Price (per 1K tokens)

Supply

ERNIE 5.0

$0.00085 (¥0.006)

$0.0034 (¥0.024)

Qianfan

ERNIE 4.5 Turbo (ex.)

$0.00011 (¥0.0008)

$0.00045 (¥0.0032)

Qianfan

Qwen3 (Coder ex.)

$0.00085 (¥0.006)

$0.0034 (¥0.024)

Qianfan

The distinction in value between ERNIE 5.0 and earlier fashions corresponding to ERNIE 4.5 Turbo underscores Baidu’s technique to differentiate between high-volume, low-cost fashions and high-capability fashions designed for complicated duties and multimodal reasoning.

In comparison with different U.S. alternate options, it stays mid-range in pricing:

Mannequin

Enter (/1 M tokens)

Output (/1 M tokens)

Supply

GPT-5.1

$1.25

$10.00

OpenAI

ERNIE 5.0

$0.85

$3.40

Qianfan

ERNIE 4.5 Turbo (ex.)

$0.11

$0.45

Qianfan

Claude Opus 4.1

$15.00

$75.00

Anthropic

Gemini 2.5 Professional

$1.25 (≤200k) / $2.50 (>200k)

$10.00 (≤200k) / $15.00 (>200k)

Google Vertex AI Pricing

Grok 4 (grok-4-0709)

$3.00

$15.00

xAI API

World Enlargement: Merchandise and Platforms

In tandem with the mannequin launch, Baidu is increasing internationally:

GenFlow 3.0, now with 20M+ customers, is the corporate’s largest general-purpose AI agent and options enhanced reminiscence and multimodal activity dealing with.

Famou, a self-evolving agent able to dynamically fixing complicated issues, is now commercially obtainable by way of invite.

MeDo, the worldwide model of Baidu’s no-code builder Miaoda, is dwell globally by way of medo.dev.

Oreate, a productiveness workspace with doc, slide, picture, video, and podcast assist, has reached over 1.2M customers worldwide.

Baidu’s digital human platform, already rolled out in Brazil, can be a part of the worldwide push. In keeping with firm knowledge, 83% of livestreamers throughout this 12 months’s “Double 11” buying occasion in China used Baidu’s digital human tech, contributing to a 91% enhance in GMV.

In the meantime, Baidu’s autonomous ride-hailing service Apollo Go has surpassed 17 million rides, working driverless fleets in 22 cities and claiming the title of the world’s largest robotaxi community.

Open-Supply Imaginative and prescient-Language Mannequin Garners Business Consideration

Two days earlier than the flagship ERNIE 5.0 occasion, Baidu additionally launched an open-source multimodal mannequin underneath the Apache 2.0 license: ERNIE-4.5-VL-28B-A3B-Considering.

As reported by my colleague Michael Nuñez at VentureBeat, the mannequin prompts simply 3 billion parameters whereas sustaining a complete of 28 billion, utilizing a Combination-of-Consultants (MoE) structure for environment friendly inference.

Key technical improvements embrace:

“Thinking with Images”, which allows dynamic zoom-based visible evaluation

Help for chart interpretation, doc understanding, visible grounding, and temporal consciousness in video

Runtime on a single 80GB GPU, making it accessible to mid-sized organizations

Full compatibility with Transformers, vLLM, and Baidu’s FastDeploy toolkits

This launch provides strain on closed-source rivals. With Apache 2.0 licensing, ERNIE-4.5-VL-28B-A3B-Considering turns into a viable basis mannequin for industrial purposes with out licensing restrictions — one thing few high-performing fashions on this class provide.

Neighborhood Suggestions and Baidu’s Response

Following the launch of ERNIE 5.0, developer and AI evaluator Lisan al Gaib (@scaling01) posted a blended evaluation on X. Whereas initially impressed by the mannequin’s benchmark efficiency, they reported a persistent situation the place ERNIE 5.0 would repeatedly invoke instruments — even when explicitly instructed to not — throughout SVG era duties.

“ERNIE 5.0 benchmarks looked insane until I tested it… unfortunately it’s RL braindamaged or they have a serious issue with their chat platform / system prompt,” Lisan wrote.

In a matter of hours, Baidu’s developer-focused assist account, @ErnieforDevs, responded:

“Thanks for the feedback! It’s a known bug — certain syntax can consistently trigger it. We’re working on a fix. You can try rephrasing or changing the prompt to avoid it for now.”

The fast turnaround displays Baidu’s growing emphasis on developer communication, particularly because it courts worldwide customers by each proprietary and open-source choices.

Outlook for Baidu and its ERNIE foundational LLM household

Baidu’s ERNIE 5.0 marks a strategic escalation within the world basis mannequin race. With efficiency claims that put it on par with probably the most superior programs from OpenAI and Google, and a mixture of premium pricing and open-access alternate options, Baidu is signaling its ambition to turn into not only a home AI chief, however a reputable world infrastructure supplier.

At a time when enterprise AI customers are more and more demanding multimodal efficiency, versatile licensing, and deployment effectivity, Baidu’s two-track method—premium hosted APIs and open-source releases—could broaden its attraction throughout each company and developer communities.

Whether or not the corporate’s efficiency claims maintain up underneath third-party testing stays to be seen. However in a panorama formed by rising prices, mannequin complexity, and compute bottlenecks, ERNIE 5.0 and its supporting ecosystem give Baidu a aggressive place within the subsequent wave of AI deployment.

You Might Also Like

Claude Cowork turns Claude from a chat software into shared AI infrastructure

How OpenAI is scaling the PostgreSQL database to 800 million customers

Researchers broke each AI protection they examined. Listed below are 7 inquiries to ask distributors.

MemRL outperforms RAG on complicated agent benchmarks with out fine-tuning

All the pieces in voice AI simply modified: how enterprise AI builders can profit

TAGGED:BaidubeatingchartsdocumentERNIEGPT5performanceproprietaryunderstandingunveils
Share This Article
Facebook Twitter Email Print

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow
Popular News
House Panel Subpoenas Roger Stone and Alex Jones in Capitol Riot Inquiry
Politics

House Panel Subpoenas Roger Stone and Alex Jones in Capitol Riot Inquiry

Editorial Board November 23, 2021
Yankees’ Aaron Decide avoids surgical procedure, however Carlos Rodón gained’t be prepared for Opening Day
Invoice Madden: MLB commish Rob Manfred has a severe possession downside
How Jenny Slate swung between laughter and sorrow in ‘Dying for Intercourse’
Omicron and Travel: So, Now Do I Need Trip Insurance?

You Might Also Like

Salesforce Analysis: Throughout the C-suite, belief is the important thing to scaling agentic AI
Technology

Salesforce Analysis: Throughout the C-suite, belief is the important thing to scaling agentic AI

January 22, 2026
Railway secures 0 million to problem AWS with AI-native cloud infrastructure
Technology

Railway secures $100 million to problem AWS with AI-native cloud infrastructure

January 22, 2026
Why LinkedIn says prompting was a non-starter — and small fashions was the breakthrough
Technology

Why LinkedIn says prompting was a non-starter — and small fashions was the breakthrough

January 22, 2026
ServiceNow positions itself because the management layer for enterprise AI execution
Technology

ServiceNow positions itself because the management layer for enterprise AI execution

January 21, 2026

Categories

  • Health
  • Sports
  • Politics
  • Entertainment
  • Technology
  • Art
  • World

About US

New York Dawn is a proud and integral publication of the Enspirers News Group, embodying the values of journalistic integrity and excellence.
Company
  • About Us
  • Newsroom Policies & Standards
  • Diversity & Inclusion
  • Careers
  • Media & Community Relations
  • Accessibility Statement
Contact Us
  • Contact Us
  • Contact Customer Care
  • Advertise
  • Licensing & Syndication
  • Request a Correction
  • Contact the Newsroom
  • Send a News Tip
  • Report a Vulnerability
Term of Use
  • Digital Products Terms of Sale
  • Terms of Service
  • Privacy Policy
  • Cookie Settings
  • Submissions & Discussion Policy
  • RSS Terms of Service
  • Ad Choices
© 2024 New York Dawn. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?