Token Monster, a brand new AI chatbot platform, has launched its alpha preview, aiming to alter how customers work together with giant language fashions (LLMs).
Developed by Matt Shumer, co-founder and CEO of OthersideAI and its hit AI writing assistant Hyperwrite AI, Token Monster’s key promoting level is its capability to route person prompts to the most effective obtainable LLMs for the duty at hand, delivering enhanced outputs by leveraging the strengths of a number of fashions.
There are seven main LLMs presently obtainable by means of Token Monster. As soon as a person varieties one thing into the immediate entry field, Token Monster makes use of pre-prompts developed by means of iteration by Shumer himself to robotically analyze the person’s enter, determine which mixture of a number of obtainable fashions and linked instruments are greatest suited to reply it, after which present a mixed response leveraging the strengths of mentioned fashions. The obtainable LLMs embrace:
Anthropic Claude 3.5 Sonnet
Anthropic Claude 3.5 Opus
OpenAI GPT-4.1
OpenAI GPT-4o
Perplexity AI PPLX (for analysis)
OpenAI o3 (for reasoning)
Google Gemini 2.5 Professional
In contrast to different chatbot platforms, Token Monster robotically identifies which LLM is greatest for particular duties — in addition to which LLM-connected instruments could be useful equivalent to internet search or coding environments — and orchestrates a multi-model workflow.
“We’re just building the connectors to everything and then a system that decides what to use when,” mentioned Shumer.
For example, it’d use Claude for creativity, o3 for reasoning, and PPLX for analysis, amongst others. This method eliminates the necessity for customers to manually select the suitable mannequin for every immediate, simplifying the method for anybody who desires high-quality, tailor-made outcomes.
Characteristic highlights
The alpha preview, which is presently free to join at tokenmonster.ai, permits customers to add a spread of file varieties, together with Excel, PowerPoint, and Docs.
It additionally consists of options equivalent to webpage extraction, persistent dialog classes, and a “FAST mode” that auto-routes to the most effective mannequin with out person enter.
On the coronary heart of Token Monster is OpenRouter, a third-party service that acts as a gateway to a number of LLMs, and into which Shumer has invested a small sum, by his admission.
This structure lets Token Monster faucet into a spread of fashions from completely different suppliers with out having to construct separate integrations for each.
Pricing and availability
As of proper now, Token Monster doesn’t cost a flat month-to-month charge.
As a substitute, customers solely pay for the tokens they devour by means of OpenRouter, making it versatile for various ranges of utilization.
In keeping with Shumer, this mannequin was impressed by Cline, a software that allows high-spending customers to entry limitless AI energy, permitting them to attain higher outputs by merely utilizing extra compute assets.
Multi-step workflows produce richer LLM responses
Token Monster’s AI workflows lengthen past easy immediate routing.
In a single instance, the chatbot may begin with a analysis section utilizing internet search APIs, cross that knowledge to o3 for figuring out info gaps, then create a top level view with Gemini 2.5 Professional, draft textual content with Claude Opus, and refine it with Claude 3.5 Sonnet.
This multi-step orchestration is designed to offer richer, extra full solutions than a single LLM may have the ability to generate alone.
The platform additionally consists of the power to save lots of classes, with knowledge securely saved utilizing the open supply on-line database service Supabase. This ensures that customers can return to ongoing tasks with out dropping their work, whereas nonetheless giving them management over what knowledge is saved and what’s ephemeral.
A non-traditional CEO
In a notable experiment, Token Monster’s management has been handed over to Anthropic’s Claude mannequin.
Shumer introduced that he’s dedicated to following each resolution made by “CEO Claude,” calling it a check to see whether or not an AI can handle a enterprise successfully.
“Either we’ve revolutionized management forever or made a huge mistake,” he wrote on X.
Rising from the Reflection 70-B controversy
Token Monster’s launch comes lower than a 12 months after Shumer confronted controversy over his launch and supreme retraction of Reflection 70B, a fine-tuned versio of Meta’s Llama 3.1 that was initially touted as essentially the most extremely performant open supply mannequin on this planet, however which rapidly grew to become topic to criticism and accusations of fraud after third-party researchers have been unable to breed its said efficiency on third-party benchmark assessments.
Shumer apologized and mentioned the problems have been born out of errors made as a result of pace. The episode underscored the challenges and dangers of fast AI improvement and the significance of transparency in mannequin releases.
MCP integrations coming subsequent
Shumer mentioned his crew on Token Monster can also be exploring new capabilities, equivalent to integrating with Mannequin Context Protocol (MCP) servers that enable web sites and corporations to have LLMs make use of their information, instruments, and merchandise to attain higher-order duties than simply textual content or picture technology.
This may allow Token Monster to attach with a person’s inner knowledge and providers, opening prospects for it to deal with duties like managing buyer help tickets or interfacing with different enterprise programs.
Shumer emphasised that Token Monster continues to be very a lot in its early levels. Whereas it already helps a set of highly effective options, the platform stays an alpha product and is predicted to see fast iterations and updates as extra customers present suggestions. “We’re going to keep iterating and adding things,” he mentioned.
A promising experiment
For customers who need to reap the benefits of the mixed energy of a number of LLMs with out the trouble of mannequin switching, Token Monster may very well be an interesting selection. It’s designed to work for individuals who don’t need to spend hours tweaking prompts or testing completely different fashions themselves, as a substitute letting the system’s automated routing and multi-step workflows deal with the complexity.
As Token Monster’s capabilities develop, will probably be fascinating to see how customers and companies undertake it — and the way its experiment with AI-led administration pans out. For now, it’s a promising addition to the quickly increasing panorama of AI chatbots and digital assistants.
Each day insights on enterprise use instances with VB Each day
If you wish to impress your boss, VB Each day has you lined. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for max ROI.
An error occured.


