Whereas enterprises face the challenges of deploying AI brokers in essential purposes, a brand new, extra pragmatic mannequin is rising that places people again in management as a strategic safeguard towards AI failure.
One such instance is Mixus, a platform that makes use of a “colleague-in-the-loop” strategy to make AI brokers dependable for mission-critical work.
This strategy is a response to the rising proof that totally autonomous brokers are a high-stakes gamble.
The excessive value of unchecked AI
The issue of AI hallucinations has develop into a tangible danger as firms discover AI purposes. In a current incident, the AI-powered code editor Cursor noticed its personal help bot invent a faux coverage limiting subscriptions, sparking a wave of public buyer cancellations.
Equally, the fintech firm Klarna famously reversed course on changing customer support brokers with AI after admitting the transfer resulted in decrease high quality. In a extra alarming case, New York Metropolis’s AI-powered enterprise chatbot suggested entrepreneurs to interact in unlawful practices, highlighting the catastrophic compliance dangers of unmonitored brokers.
These incidents are signs of a bigger functionality hole. Based on a Might 2025 Salesforce analysis paper, at present’s main brokers succeed solely 58% of the time on single-step duties and simply 35% of the time on multi-step ones, highlighting “a significant gap between current LLM capabilities and the multifaceted demands of real-world enterprise scenarios.”
The colleague-in-the-loop mannequin
To bridge this hole, a brand new strategy focuses on structured human oversight. “An AI agent should act at your direction and on your behalf,” Mixus co-founder Elliot Katz advised VentureBeat. “But without built-in organizational oversight, fully autonomous agents often create more problems than they solve.”
This philosophy underpins Mixus’s colleague-in-the-loop mannequin, which embeds human verification straight into automated workflows. For instance, a big retailer would possibly obtain weekly stories from hundreds of shops that comprise essential operational knowledge (e.g., gross sales volumes, labor hours, productiveness ratios, compensation requests from headquarters). Human analysts should spend hours manually reviewing the info and making choices primarily based on heuristics. With Mixus, the AI agent automates the heavy lifting, analyzing advanced patterns and flagging anomalies like unusually excessive wage requests or productiveness outliers.
For prime-stakes choices like cost authorizations or coverage violations — workflows outlined by a human consumer as “high-risk” — the agent pauses and requires human approval earlier than continuing. The division of labor between AI and people has been built-in into the agent creation course of.
“This approach means humans only get involved when their expertise actually adds value — typically the critical 5-10% of decisions that could have significant impact — while the remaining 90-95% of routine tasks flow through automatically,” Katz stated. “You get the speed of full automation for standard operations, but human oversight kicks in precisely when context, judgment, and accountability matter most.”
In a demo that the Mixus workforce confirmed to VentureBeat, creating an agent is an intuitive course of that may be achieved with plain-text directions. To construct a fact-checking agent for reporters, for instance, co-founder Shai Magzimof merely described the multi-step course of in pure language and instructed the platform to embed human verification steps with particular thresholds, reminiscent of when a declare is high-risk and can lead to reputational harm or authorized penalties.
The platform’s integration capabilities lengthen additional to fulfill particular enterprise wants. Mixus helps the Mannequin Context Protocol (MCP), which allows companies to attach brokers to their bespoke instruments and APIs, avoiding the necessity to reinvent the wheel for current inner programs. Mixed with integrations for different enterprise software program like Jira and Salesforce, this permits brokers to carry out advanced, cross-platform duties, reminiscent of checking on open engineering tickets and reporting the standing again to a supervisor on Slack.
Human oversight as a strategic multiplier
The enterprise AI house is presently present process a actuality verify as firms transfer from experimentation to manufacturing. The consensus amongst many business leaders is that people within the loop are a sensible necessity for brokers to carry out reliably.
AI Brokers will possible comply with a self driving trajectory, the place you want a human within the loop for a protracted tail of duties for some time. The massive distinction is we’ll get a rising variety of autonomous brokers alongside the best way, the place full self driving is an all or nothing proposition. https://t.co/5dR7cGS7jn
— Aaron Levie (@levie) June 20, 2025
Mixus’s collaborative mannequin adjustments the economics of scaling AI. Mixus predicts that by 2030, agent deployment could develop 1000x and every human overseer will develop into 50x extra environment friendly as AI brokers develop into extra dependable. However the whole want for human oversight will nonetheless develop.
“Each human overseer manages exponentially more AI work over time, but you still need more total oversight as AI deployment explodes across your organization,” Katz stated.

For enterprise leaders, this implies human expertise will evolve somewhat than disappear. As an alternative of being changed by AI, consultants will probably be promoted to roles the place they orchestrate fleets of AI brokers and deal with the high-stakes choices flagged for his or her overview.
On this framework, constructing a robust human oversight operate turns into a aggressive benefit, permitting firms to deploy AI extra aggressively and safely than their rivals.
“Companies that master this multiplication will dominate their industries, while those chasing full automation will struggle with reliability, compliance, and trust,” Katz stated.
Each day insights on enterprise use circumstances with VB Each day
If you wish to impress your boss, VB Each day has you lined. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.
An error occured.


