Whereas shopper consideration has targeted on the generative AI battles between OpenAI and Google, Anthropic has executed a disciplined enterprise technique centered on coding — doubtlessly essentially the most helpful enterprise AI use case. The outcomes have gotten more and more clear: Claude is positioning itself because the LLM that issues most for companies.
The proof? Anthropic’s Claude 3.7 Sonnet, launched simply two weeks in the past, set new benchmark data for coding efficiency. Concurrently, the corporate launched Claude Code, a command-line AI agent that helps builders construct purposes quicker. In the meantime, Cursor — an AI-powered code editor that defaults to Anthropic’s Claude mannequin — has surged to a reported $100 million in annual recurring income in simply 12 months.
Anthropic’s deliberate deal with coding comes as enterprises more and more acknowledge the facility of AI coding brokers, which allow each seasoned builders and non-coders to construct purposes with unprecedented velocity and effectivity. “Anthropic continues to come out on top,” mentioned Guillermo Rauch, CEO of Vercel, one other fast-growing firm that lets builders, together with non-coders, deploy front-end purposes. Final yr, Vercel switched its lead coding mannequin from OpenAI’s GPT to Anthropic’s Claude after evaluating the fashions’ efficiency on key coding duties.
Claude 3.7: Setting new benchmarks for AI coding
Launched February 24, Claude 3.7 Sonnet leads on almost all coding benchmarks. It scored a formidable 70.3% on the revered SWE-bench benchmark, which measures an agent’s software program improvement abilities, handily outperforming nearest rivals OpenAI’s o1 (48.9%) and DeepSeek-R1 (49.2%). It additionally outperforms rivals on agentic duties.
Supply: Anthropic. SWE-bench measures a mannequin’s potential to unravel real-world software program points.
Developer communities have shortly verified these ends in real-world testing. Reddit threads evaluating Claude 3.7 with Grok 3, the newly launched mannequin from Elon Musk’s xAI, persistently favor Anthropic’s mannequin for coding duties. “Based on what I’ve tested, Claude 3.7 seems to be the best for writing code (at least for me),” mentioned a high commenter.
Alongside the three.7 Sonnet launch, Anthropic launched Claude Code, an AI coding agent that works straight by means of the command line. This enhances the corporate’s October launch of Pc Use, which allows Claude to work together with a person’s pc, together with utilizing a browser to go looking the online, opening purposes, and inputting textual content.
Supply: Anthropic: TAU-bench is a framework that assessments AI brokers on complicated real-world duties with person and power interactions.
Most notable is what Anthropic hasn’t carried out. In contrast to rivals that rush to match one another feature-for-feature, the corporate hasn’t even bothered to combine internet search performance into its app — a primary characteristic most customers count on. This calculated omission indicators that Anthropic isn’t competing for common shoppers however is laser-focused on the enterprise market, the place coding capabilities ship a lot larger ROI than search.
Fingers-on with Claude’s coding capabilities
To check the real-world capabilities of those coding brokers, I experimented with constructing a database to retailer VentureBeat articles utilizing three totally different approaches: Claude 3.7 Sonnet by means of Anthropic’s app; Cursor’s coding agent; and Claude Code.
Utilizing Claude 3.7 straight by means of Anthropic’s app, I discovered the answer supplied outstanding steerage for a non-coder like myself. It really useful a number of choices, from very sturdy options utilizing issues like PostgreSQL database, to simpler, light-weight ones like utilizing Airtable. I selected the light-weight answer, and Claude methodically walked me by means of easy methods to pull articles from the VentureBeat API into Airtable utilizing Make.com for connections. The method took about two hours, together with some authentication challenges, however resulted in a practical system. You would say that as a substitute of doing the entire code for me, it confirmed me a grasp plan on easy methods to do it.
Cursor, which defaults to Claude’s fashions, is a full-fledged code editor and was extra wanting to automate the method. Nonetheless, it required permission at each step, making a considerably tedious workflow.
Claude Code provided yet one more strategy, operating straight within the terminal and utilizing SQLite to create an area database that pulled articles from our RSS feed. This answer was easier and extra dependable when it comes to getting me to my finish aim, however positively much less sturdy and feature-rich than the Airtable implementation. I’m now understanding the character of those tradeoffs, and know that the coding agent I choose actually will depend on the particular undertaking.
The important thing perception: At the same time as a non-developer, I used to be in a position to construct practical database purposes utilizing all three approaches — one thing that will have been unthinkable only a yr in the past. They usually all relied on Claude below the hood.
For a extra detailed evaluate of how to do that so-called “vibe coding,” the place you depend on brokers to code issues whereas not doing any coding your self, learn this nice piece by developer Simon Willison revealed yesterday. The method may be very buggy, and irritating at instances, however with the precise concessions to this, you’ll be able to go a good distance.
The technique: Why coding is Anthropic’s enterprise play
Anthropic’s singular deal with coding capabilities isn’t unintended. Based on projections reportedly leaked to The Data, Anthropic goals to succeed in $34.5 billion in income by 2027 — an 86-fold enhance from present ranges. Roughly 67% of this projected income would come from API enterprise, with enterprise coding purposes as the first driver. Whereas Anthropic hasn’t launched precise numbers for its income up to now, it mentioned its coding income surged 1,000% over the past quarter of 2024. Final week, Anthropic introduced it had raised $3.5 billion extra in funding at a $61.5 billion valuation.
This coding wager is supported by Anthropic’s personal Financial Index, which discovered that 37.2% of queries despatched to Claude had been within the “computer and mathematical” class, primarily masking software program engineering duties like code modification, debugging and community troubleshooting.
Anthropic seems to be marching to its personal beat — at a time when rivals are distracted, dashing to cowl each enterprise and shopper markets with characteristic parity. OpenAI’s lead is bolstered from its early shopper recognition and utilization, and it’s caught making an attempt to serve each common customers and companies with a number of fashions and performance. Google is chasing this pattern too, making an attempt to have one among every thing.
Anthropic’s comparatively disciplined technique extends to its product selections. As a substitute of chasing shopper market share, the corporate has prioritized enterprise options like GitHub integration, audit logs, customizable permissions and domain-specific safety controls. Six months in the past, it launched a large 500,000-token context window for builders, whereas Google restricted its 1-million-token window to personal testers. The result’s a complete coding-focused providing that enterprises are more and more adopting.
The corporate not too long ago launched options permitting non-coders to publish AI-created purposes inside their organizations, and simply final week upgraded its console with enhanced collaboration capabilities, together with shareable prompts and templates. This democratization displays a kind of Trojan Horse technique: First allow builders to construct highly effective foundations, then broaden entry to the broader enterprise workforce, together with up into the company suite.
The coding agent ecosystem: Cursor and past
Maybe essentially the most telling signal of Anthropic’s success is the explosive progress of Cursor, an AI code editor that reportedly has 360,000 customers, with greater than 40,000 of them paying clients, after simply 12 months — making it probably the quickest SaaS firm to succeed in that milestone.
Cursor’s success is inextricably linked to Claude. “You’ve got to think their number one customer is Cursor,” famous Sam Witteveen, cofounder of Purple Dragon, an impartial developer of AI brokers. “Most people on [Cursor] were using the Claude Sonnet model — the 3.5 models — already. And now it seems everyone’s just migrating over to 3.7.”
The connection between Anthropic and its ecosystem extends past particular person corporations like Cursor. In November, Anthropic launched its Mannequin Context Protocol (MCP) as an open commonplace, permitting builders to construct instruments that work together with Claude fashions. The usual is being broadly adopted by builders.
“By launching this as an open protocol, they’re kind of saying, ‘Hey, everyone, have at it,’” explained Witteveen. “You can develop whatever you want that fits this protocol. We’re going to help this protocol.”
This strategy creates a virtuous cycle: Builders construct instruments for Claude, which makes Claude extra helpful to enterprises, which drives extra adoption, which attracts extra builders.
The competitors: Microsoft, OpenAI, Google and open supply
Whereas Anthropic has discovered its focus, rivals are pursuing totally different methods with various outcomes.
Microsoft maintains important momentum by means of its GitHub Copilot, which has 1.3 million paid customers and has been adopted by greater than 77,000 organizations in roughly two years. Corporations like Honeywell, State Road, TD Financial institution Group and Levi’s are amongst its customers. This widespread adoption stems largely from Microsoft’s current enterprise relationships and its first-mover benefit, whereby it invested early into OpenAI and used that firm’s fashions to energy Copilot.
Nonetheless, even Microsoft has acknowledged Anthropic’s energy. In October, it allowed GitHub Copilot customers to decide on Anthropic’s fashions as a substitute for OpenAI. And OpenAI’s current fashions — o1 and the newer o3, which emphasize reasoning by means of prolonged pondering — haven’t demonstrated explicit strengths in coding or agentic duties.
Google has made its personal play by not too long ago making its Code Help free, however this transfer appears extra defensive than strategic.
The open supply motion is one other important drive on this panorama. Meta’s Llama fashions have gained substantial enterprise traction, with main corporations like AT&T, DoorDash and Goldman Sachs deploying Llama-based fashions for varied purposes. The open-source strategy gives enterprises larger management, customization choices and value advantages that closed fashions can’t match, as VentureBeat reported final yr.
Moderately than seeing this as a direct menace, Anthropic seems to be positioning itself as complementary to open supply. Enterprise clients can use Claude alongside open-source fashions relying on particular wants, a hybrid strategy that maximizes the strengths of every.
In reality, most enterprise corporations of scale I’ve talked with over the previous a number of months are explicitly multimodal, in that their AI workflows permit them to make use of no matter mannequin is greatest for a given case. Intuit was an early instance of an organization that had wager on OpenAI as a default for its tax return purposes, however then final yr switched to Claude as a result of it was superior in some circumstances. The ache of switching led Intuit to create an AI orchestration framework that allowed switching between fashions to be rather more seamless, as Nhung Ho, Intuit’s VP of AI, informed VentureBeat on the time.
Most different enterprise corporations have since adopted the same follow. They use no matter mannequin is greatest for the particular case, pulling in fashions with easy API calls. In some circumstances, an open-source mannequin like Llama may work properly, however in others — for instance, in calculations the place accuracy is essential — Claude is the selection, Intuit’s Ho defined at VentureBeat’s VB Remodel occasion final yr.
Over the previous couple of days, I’ve been attending the HumanX convention in Las Vegas, the place lots of of builders gathered to speak about AI. Claude comes up virtually at all times at any time when the subject of brokers or coding is raised. Over lunch yesterday, Julianne Averill, managing director at Danforth Advisors, which advises life science corporations, mentioned her firm had discovered Claude superior for a lot of such duties, together with constructing structured evaluation tables.
Vercel CEO Guillermo Rauch, one other attendee, mentioned his firm, which has surpassed $100 million in annual income, selected Claude final yr as its default mannequin to assist builders code after doing rigorous evaluations of all fashions. “3.7 is king,” Rauch informed VentureBeat. He agreed it’s essential to supply builders a selection of fashions, because the breakneck tempo of advances means there can’t be loyalty to a single mannequin. However whereas Vercel’s V0 product, which lets customers generate internet person interfaces (UIs) utilizing natural-language prompts, gives that selection, it has to choose a default mannequin to assist customers throughout their preliminary ideation and reasoning section. That mannequin is Claude Sonnet. “You need the architect model that is capable of reasoning and does the lion’s share of code generation,” he mentioned. “A significant chunk of our pipeline is powered by Anthropic Sonnet.” Adobe, Chick-Fil-A and Mattress Tub and Past are Vercel’s clients.
Nonetheless Rauch cautioned that fluidity within the LLM race stays, and the lead mannequin may change at any time. Vercel experimented with China’s DeepSeek, he mentioned, however discovered it fell simply in need of matching Claude’s Sonnet. Equally, he mentioned, Alibaba’s Qwen mannequin has gotten superb.
Enterprise implications: Making the shift to coding brokers
For enterprise decision-makers, this quickly evolving panorama presents each alternatives and challenges.
Safety stays a high concern, however a current impartial report discovered Claude 3.7 Sonnet to be essentially the most safe mannequin but — the one one examined that proved “jailbreak-proof.” This safety stance, mixed with Anthropic’s backing from each Google and Amazon (and integration into AWS Bedrock), positions it properly for enterprise adoption.
The rise of coding brokers isn’t simply altering how purposes are constructed — it’s democratizing the method. Based on GitHub, 92% of U.S.-based builders at enterprise corporations had been already utilizing AI-powered coding instruments at work 18 months in the past. That quantity has probably grown considerably since then.
“The challenge that people are having [because of] not being a coder is really that they don’t know a lot of the terminology. They don’t know best practices,” defined Witteveen. AI coding brokers more and more bridge this hole, permitting technical and non-technical group members to collaborate extra successfully.
For enterprise adoption, Witteveen recommends a balanced strategy: “It’s the balance between security and experimentation at the moment. Clearly, on the developer side, people are starting to build real-world apps with this stuff.”
For a deeper exploration of those points, try my current YouTube video dialog with Witteveen, the place we take a deep dive into the state of coding brokers and what they imply for enterprise AI technique.
Wanting forward: the way forward for enterprise coding
The rise of AI coding brokers indicators a elementary shift in enterprise software program improvement. When used successfully, these instruments don’t exchange builders however rework their roles, permitting them to deal with structure and innovation somewhat than implementation particulars.
Anthropic’s disciplined strategy in focusing particularly on coding capabilities whereas rivals chase a number of priorities seems to be paying dividends for the corporate. By the tip of 2025, we could look again on this era because the second when AI coding brokers turned important enterprise instruments — with Claude main the best way.
For technical decision-makers, the message is evident: Begin experimenting with these instruments now or danger falling behind rivals who’re already utilizing them to speed up improvement cycles dramatically. This second echoes the early days of the iPhone revolution, when corporations initially tried to dam “unsanctioned” gadgets from their company networks, solely to ultimately embrace BYOD insurance policies as worker demand turned overwhelming. Some corporations VentureBeat has talked with, like Honeywell, have not too long ago equally tried to close down “rogue” use of AI coding instruments not permitted by IT.
Talking Monday on the HumanX convention, James Reggio, the CTO of Brex, an organization that gives bank cards and different monetary companies to small and mid-sized enterprises, mentioned his firm initially additionally tried to implement a top-down strategy to AI mannequin choice, in an effort to succeed in perfection. However the firm confronted revolt amongst its developer staff, and shortly realized this was futile. It determined to permit customers to experiment freely. Sensible corporations are already organising safe sandbox environments to permit managed experimentation. Organizations that create clear guardrails whereas encouraging innovation will profit from each worker enthusiasm and insights about how these instruments can greatest serve their distinctive wants — positioning themselves forward of rivals who resist change. And Anthropic’s Claude, not less than for now, is a giant beneficiary of this motion.
Watch my video with developer Sam Witteveen right here for a full deep dive into the coding agent pattern:
Each day insights on enterprise use circumstances with VB Each day
If you wish to impress your boss, VB Each day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.
An error occured.