We collect cookies to analyze our website traffic and performance; we never collect any personal data. Cookie Policy
Accept
NEW YORK DAWN™NEW YORK DAWN™NEW YORK DAWN™
Notification Show More
Font ResizerAa
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Reading: Galileo launches Agentic Evaluations to repair AI agent errors earlier than they value you
Share
Font ResizerAa
NEW YORK DAWN™NEW YORK DAWN™
Search
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Follow US
NEW YORK DAWN™ > Blog > Technology > Galileo launches Agentic Evaluations to repair AI agent errors earlier than they value you
Galileo launches Agentic Evaluations to repair AI agent errors earlier than they value you
Technology

Galileo launches Agentic Evaluations to repair AI agent errors earlier than they value you

Last updated: January 24, 2025 2:51 pm
Editorial Board Published January 24, 2025
Share
SHARE

Galileo, a San Francisco-based startup, is betting that the way forward for synthetic intelligence depends upon belief. At the moment, the corporate launched a brand new product, Agentic Evaluations, to handle a rising problem on this planet of AI: ensuring the more and more advanced methods often called AI brokers really work as meant.

AI brokers — autonomous methods that carry out multi-step duties like producing experiences or analyzing buyer knowledge — are gaining traction throughout industries. However their speedy adoption raises an important query: How can firms confirm these methods stay dependable after deployment? Galileo’s CEO, Vikram Chatterji, believes his firm has discovered the reply.

“Over the last six to eight months, we started to see some of our customers trying to adopt agentic systems,” mentioned Chatterji in an interview. “Now LLMs can be used as a smart router to pick and choose the right API calls towards actually completing a task. Going from just generating text to actually completing a task was a very big chasm that was unlocked.”

A diagram displaying how Galileo evaluates AI brokers at three key phases: software choice, error detection and job completion. (Credit score: Galileo)

AI brokers present promise, however enterprises demand accountability

Main enterprises like Cisco and Ema (the latter based by Coinbase’s former chief product officer) have already adopted Galileo’s platform. These firms use AI brokers to automate duties from buyer assist to monetary evaluation, and report vital productiveness positive factors.

“A sales representative who’s trying to do outreach and outbounds would otherwise use maybe a week of their time to do that, versus with some of these AI-enabled agents, they’re doing that within two days or less,” Chatterji defined, highlighting the return on funding for enterprises.

Galileo’s new framework evaluates software choice high quality, detects errors in software calls, and tracks total session success. It additionally displays important metrics for large-scale AI deployment, together with prices and latency.

Figure 2 Agent MetricsA dashboard displaying how Galileo evaluates AI brokers at three key phases: software choice, error detection and job completion. (Credit score: Galileo)

$68 million in funding fuels Galileo’s push into enterprise AI

The launch builds on Galileo’s current momentum. The corporate raised $45 million in collection B funding led by Scale Enterprise Companions final October, bringing its whole funding to $68 million. Trade analysts mission the marketplace for AI operations instruments may attain $4 billion by 2025.

The stakes are excessive as AI deployment accelerates. Research present even superior fashions like GPT-4 can hallucinate about 23% of the time throughout fundamental question-and-answer duties. Galileo’s instruments assist enterprises determine these points earlier than they influence operations.

“Before we launch this thing, we really, really need to know that this thing works,” Chatterji mentioned, describing buyer issues. “The bar is really high. So that’s where we gave them this tool chain, such that they could just use our metrics as the basis for these tests.”

Addressing AI hallucinations and enterprise-scale challenges

The corporate’s concentrate on dependable, production-ready options positions it properly in a market more and more involved with AI security. For technical leaders deploying enterprise AI, Galileo’s platform offers important guardrails for guaranteeing AI brokers carry out as meant whereas controlling prices.

As enterprises increase their use of AI brokers, efficiency monitoring instruments grow to be essential infrastructure. Galileo’s newest providing goals to assist companies deploy AI responsibly and successfully at scale.

“2025 will be the year of agents. It is going to be very prolific,” Chatterji famous. “However, what we’ve also seen is a lot of companies that are just launching these agents without good testing is leading to negative implications…The need for proper testing and evaluations is more than ever before.”

Every day insights on enterprise use instances with VB Every day

If you wish to impress your boss, VB Every day has you lined. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for max ROI.

An error occured.

vb daily phone

You Might Also Like

Why most enterprise AI coding pilots underperform (Trace: It's not the mannequin)

Google’s new framework helps AI brokers spend their compute and gear finances extra correctly

Ai2's new Olmo 3.1 extends reinforcement studying coaching for stronger reasoning benchmarks

Cohere’s Rerank 4 quadruples the context window over 3.5 to chop agent errors and enhance enterprise search accuracy

Nous Analysis simply launched Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math examination

TAGGED:agentagenticCosterrorsEvaluationsfixGalileolaunches
Share This Article
Facebook Twitter Email Print

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow
Popular News
With out clear water, Pacific Islanders flip to sugary drinks—tackling this might cut back weight problems
Health

With out clear water, Pacific Islanders flip to sugary drinks—tackling this might cut back weight problems

Editorial Board December 21, 2024
Nets Pocket book: How Ziaire Williams improved his market worth
New strategy to agent reliability, AgentSpec, forces brokers to comply with guidelines
Nvidia revenues hit $39.3B, up 78% in FYQ4 — with no signal of slowdown (up to date)
Fife in Autumn: Go to Scotland’s Historical Capital and Coastal Villages

You Might Also Like

GPT-5.2 first impressions: a strong replace, particularly for enterprise duties and workflows
Technology

GPT-5.2 first impressions: a strong replace, particularly for enterprise duties and workflows

December 12, 2025
OpenAI's GPT-5.2 is right here: what enterprises must know
Technology

OpenAI's GPT-5.2 is right here: what enterprises must know

December 11, 2025
Marble enters the race to convey AI to tax work, armed with  million and a free analysis device
Technology

Marble enters the race to convey AI to tax work, armed with $9 million and a free analysis device

December 11, 2025
Making a glass field: How NetSuite is engineering belief into AI
Technology

Making a glass field: How NetSuite is engineering belief into AI

December 11, 2025

Categories

  • Health
  • Sports
  • Politics
  • Entertainment
  • Technology
  • Art
  • World

About US

New York Dawn is a proud and integral publication of the Enspirers News Group, embodying the values of journalistic integrity and excellence.
Company
  • About Us
  • Newsroom Policies & Standards
  • Diversity & Inclusion
  • Careers
  • Media & Community Relations
  • Accessibility Statement
Contact Us
  • Contact Us
  • Contact Customer Care
  • Advertise
  • Licensing & Syndication
  • Request a Correction
  • Contact the Newsroom
  • Send a News Tip
  • Report a Vulnerability
Term of Use
  • Digital Products Terms of Sale
  • Terms of Service
  • Privacy Policy
  • Cookie Settings
  • Submissions & Discussion Policy
  • RSS Terms of Service
  • Ad Choices
© 2024 New York Dawn. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?