AI Coding Startups Face Margin Crisis Amid High LLM Costs

Techcrunch

The AI coding assistant sector, despite its fervent hype and rapid growth, is grappling with a formidable challenge: the high cost of doing business is eroding profit margins, threatening the long-term viability of even the most promising startups. This underlying financial fragility was starkly illustrated by the recent saga of Windsurf, an AI coding startup that initially garnered significant investor interest but ultimately sought an exit.

In February, Windsurf was reportedly in advanced discussions to secure a substantial funding round, valuing the company at an impressive $2.85 billion—double its valuation from just six months prior. Yet, that deal never materialized. Instead, by April, news emerged that Windsurf intended to sell itself to OpenAI for a similar valuation, approximately $3 billion. While that acquisition famously fell through, it begged a crucial question: if the startup was growing so rapidly and attracting such high-profile venture capital interest, why would it consider selling at all?

Sources close to the industry reveal a sobering truth: for all their popularity, AI coding assistants can be massive money-losing ventures. Companies like Windsurf often operate with structures so expensive that their gross margins — the revenue remaining after direct costs of producing a good or service — are “very negative.” In essence, it costs more to deliver the product than the company can charge its customers.

This financial strain stems primarily from the exorbitant costs associated with utilizing large language models (LLMs). AI coding assistants are under immense pressure to consistently integrate the newest, most advanced, and consequently, most expensive LLMs. This is because model developers are constantly fine-tuning their latest iterations for superior performance in coding and related tasks, such as debugging, making older models quickly obsolete.

Compounding this challenge is the intense competition within the code-assist market. Entrenched players like GitHub Copilot and Anysphere’s Cursor already command vast user bases, making it difficult for newer entrants to differentiate themselves and capture market share without offering cutting-edge performance.

The most direct route for these startups to improve their margins involves developing their own proprietary LLMs, thereby eliminating the substantial fees paid to external suppliers such as Anthropic and OpenAI. As one insider put it, “It’s a very expensive business to run if you’re not going to be in the model game.” However, building an LLM is a monumental and costly undertaking. Windsurf’s co-founder and CEO, Varun Mohan, ultimately decided against this path for his company, a decision that proved strategically prescient given that the very model makers, including Anthropic with Claude Code and OpenAI with Codex, are now directly competing in the AI coding market. Windsurf’s strategic move to sell was, in part, an attempt to lock in a high return before its business could be undermined by its own suppliers.

The pressure Windsurf faced is not unique. Many in the industry believe similar margin challenges plague other prominent AI coding tools, including Anysphere’s Cursor, Lovable, and Replit. Nicholas Charriere, founder of the AI coding startup Mocha, bluntly stated that margins on “code gen” products are “either neutral or negative. They’re absolutely abysmal,” estimating that variable costs across the sector are remarkably similar, likely within a 10% to 15% range.

Anysphere, the company behind Cursor, offers a contrasting narrative. Despite facing similar cost pressures, its rapid growth has allowed it to decline acquisition offers, reportedly even from OpenAI, with the intent of remaining independent. In January, Anysphere announced its ambitious plan to build its own LLM, a move that could grant it greater control over expenses. The company even hired two leaders from Anthropic’s Claude Code team in July, though they returned to Anthropic just two weeks later.

Another long-term hope for these companies rests on the expectation that LLM costs will eventually decrease. Erik Nordlander, a general partner at Google Ventures, echoed this sentiment, suggesting, “The inference cost today, that’s the most expensive it’s ever going to be.” However, this projection isn’t entirely clear-cut. Contrary to expectations, the cost of some of the latest AI models has actually risen, as they demand more computational resources to handle increasingly complex, multi-step tasks.

The market remains dynamic. Just recently, OpenAI introduced GPT-5, a new flagship model with significantly lower fees than its competitor, Anthropic’s Claude Opus 4.1. Anysphere swiftly integrated GPT-5 as an option for Cursor users. Yet, Anysphere also recently adjusted its pricing structure, passing increased costs, particularly for Anthropic’s latest Claude model, onto its most active users. This move caught some Cursor subscribers by surprise, as they faced additional charges on top of their $20-per-month Pro plan, prompting an apology from Anysphere CEO Michael Truell for the unclear communication.

This situation highlights the precarious balance these companies must strike. While Cursor boasts impressive metrics, having reached $500 million in annual recurring revenue by June, its user base may not exhibit unwavering loyalty if a superior, more cost-effective tool emerges.

In the end, Windsurf’s decision to exit the market appears understandable. Following the collapse of the OpenAI deal, its founders and key employees joined Google, leading to a substantial $2.4 billion payout for key shareholders. The remaining business was subsequently sold to Cognition. While some, including prominent venture capitalists, criticized Mohan for leaving approximately 200 employees without roles at Google, sources familiar with the deal maintain that the acquisition ultimately maximized outcomes for all employees.

The challenges faced by AI coding tools like Replit, Lovable, and Bolt, all of which rely heavily on external model makers, underscore a broader concern. If this immensely popular and revenue-generating sector struggles to build sustainable businesses on top of existing LLM providers, it raises significant questions about the viability of other, more nascent industries similarly dependent on these foundational AI models.

[[]] AI coding startups hit a wall: high LLM costs are eating profits, forcing a brutal choice between survival and innovation.]]