Open vs closed source model economics

One-line summary: Open-weight models are ~3–5% behind the best closed models on quality but dramatically cheaper per unit of intelligence — because the user pays only for serving (power + compute), not for training. The "battle" between the two is, per andrew-feldman (May 2026), undecided; closed is "strictly better by a little bit," and the durable question is how big a premium that little bit commands.

The insight

The open-vs-closed split in AI is not the open-vs-closed split in traditional software, and conflating them is the core error this concept exists to flag.

"Free open source" doesn't transfer. joe-weisenthal's framing: in traditional software, open source is free; in AI there is "no real such thing as free open source AI software" because even a free-to-license model still costs chip depreciation + electricity to run. The relevant cost axis is serving cost, not license cost.
The quality gap is small; the cost gap is large. andrew-feldman (running an inference cloud that serves both) puts the closed-vs-open quality difference at "3, 4%, 5%" with closed "strictly better." But on cost per unit of intelligence, open is cheaper "by a lot" — because the user "what you're not paying for was the cost to train it." A ~1T-parameter open model (Kimi K2) runs on Cerebras "10 or 15 times faster than others" at just power + compute cost.
A levelized-cost-of-intelligence metric is missing. Joe proposes "cost per IQ point" / "levelized cost of intelligence" as the unit that would let buyers compare honestly — and notes the industry doesn't have it yet. Without it, the premium for closed quality is hard to price.
The market is bifurcating, not consolidating. Feldman expects no single winner — "I don't think there's going to be one," analogizing to x86 (Intel/AMD) + ARM + custom silicon coexisting. Closed frontier labs (OpenAI, Anthropic), open-model serving (Cursor, Cognition on open weights), and specialists all persist.
Quiet enterprise migration to open is already happening. tracy-alloway reports "a lot of big companies in the US ... very quietly shifting from some of the closed source models to the open source models like the Chinese ones, like Kimi" and Qwen.

This dovetails with the thread's llm-as-commodity-thesis (Ghodsi: models are interchangeable at the unit level; durable value is above the model layer) and with cuda-moat-erosion-at-inference (runtime portability removes lock-in at the inference layer). Where the commodity thesis says "models commoditize," this concept adds the open-vs-closed pricing structure underneath that commoditization: open weights commoditize fastest because their cost floor is just serving.

Evidence

andrew-feldman in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "you can jump up right now and run Kimi K2. It's a 1 trillion parameter model. It's an open source model on cerebras where 10 or 15 times faster than others. And what you're paying for is the cost of our power and some cost of the compute that took to calculate it. What you're not paying for was the cost to train it."
andrew-feldman in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "The open source models, there are no open source models that are as good as the closed source models. Think of it as 3, 4%, 5% different... What is clear is that the closed source is strictly better by a little bit, by how much varies and it's more expensive."
andrew-feldman in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "You have OpenAI with their coding software, you have Anthropic with their coding software. And you've got companies like Cursor and Cognition that are using open source. We power OpenAI and we power Cognition. You have a battle underway between closed source and open source. And I think that the winners of that battle is yet to be determined."
joe-weisenthal in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "there's no really such thing as like free AI software. Because even if it's like free, you still have to pay for the depreciation of the chips and you have to pay for the electricity to run them. So there is no real such thing as like free open source AI software."
joe-weisenthal in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "are the open source models cheaper on a per unit of intelligence basis? If we had some way of saying levelized cost of intelligence, which I don't know if the industry has yet, are open source models cheaper per IQ point?"
tracy-alloway in 2026-05-21-odd-lots-why-cerebras-ceo-andrew-feldman-built-the-world-s: "I've heard of a lot of big companies in the US who have been very quietly shifting from some of the closed source models to the open source models like the Chinese ones, like Kimi... And Qwen."
June 2026 update — the 3-5% gap closed on some benchmarks. alexander-wissner-gross in 2026-06-26-podcast-moonshots-the-10b-satellite-empire-putting-ai-in-orbit-why: "the gestalt with GLM 5.2 is that it takes roughly double the number of tokens to get to the same capability output as the best western frontier models, but at half the total price. So the Chinese are evidently figuring out how to more efficiently or at least more cheaply reason." This directly answers this page's own open question ("does the closed-quality premium widen or collapse") in the collapse direction — but by a cost route, not a capability-per-token route. See chinese-open-weight-frontier-parity.
salim-ismail in 2026-06-26-podcast-moonshots-the-10b-satellite-empire-putting-ai-in-orbit-why: "frontier intelligence cannot be monopolized anymore"
clark-tang in 2026-06-11-podcast-bg2-pod-the-spacex-ipo-fable-5-ai-capex-update-market, on why closed retained the value even as open took the volume: "the reason why closed source models have captured so much of the value is because the models actually get the intention and actually carry through the work"
gavin-baker in 2026-06-11-podcast-bg2-pod-the-spacex-ipo-fable-5-ai-capex-update-market, on the volume/value split: "Open source might be 80% of tokens." — and the inversion that follows: "It's actually really bullish for compute and hardware because if the frontier models are capturing less of the margin then you're going to spend more on compute." See open-source-share-shift-bullish-for-compute.

Design implications

The right comparison unit is serving cost per unit of intelligence, not license cost. Whoever can serve a near-frontier open model fastest/cheapest (the wafer-scale bet of cerebras) captures the open-serving market.
The closed premium is bounded by the (small) quality gap times each workload's sensitivity to that gap. Joe's prediction — companies getting "more skilled at allocating from different forms of inference" — implies the closed premium is a task-routing question, not a blanket one: premium closed model for the hard 5% of tasks, cheap open model for the rest.
Chinese open-weight models (Kimi, Qwen) are a live part of the US enterprise stack, which carries an export-control / data-governance dimension this source only gestures at.

Contradictions / tensions

The headline cost claims come from a CEO whose inference cloud monetizes serving open models — clear incentive to talk up open-model economics. Treat "cheaper by a lot" as directional.
Closed labs are not standing still: a 3–5% quality gap measured in May 2026 is a vintage snapshot (see the thread's capability-tracking discipline); the gap and the premium could widen or narrow with each frontier release.
"No single winner" is a forecast, not an observation — Feldman's x86/ARM analogy is plausible but the AI model market could still consolidate around 1–2 closed labs if the quality gap proves to compound rather than stay constant.
The May 2026 vintage snapshot has been overtaken. This page was written when "closed is strictly better by a little bit" was the state of play. By late June 2026, GLM 5.2 matched or exceeded top Western models on coding, long-range-agency and design benchmarks, and alexander-wissner-gross reports the "six to eight months behind" assumption starting to "creak." The quality framing may be the wrong axis entirely: GLM 5.2 is worse per token and better per dollar. Whether that counts as closing the gap depends on which denominator you price. Worth a /calibrate entry — the "closed is durably better" prior moved on evidence.
Distillation may explain the convergence rather than refute the gap. will-marshall: "They did distillation almost certainly on the best models." If parity is borrowed, the premium is a lead measured in trace-generation cycles, not a moat. See distillation-and-iterated-amplification.

Open questions

Does the closed-quality premium widen (gap compounds → closed pulls away) or collapse (open catches up → premium → zero) over the next several frontier-release cycles?
Will a standardized "levelized cost of intelligence" metric emerge, and who defines it? Without it, the premium stays hard to price.
How much of the quiet enterprise migration to Chinese open models survives export-control / data-governance scrutiny?

Open vs closed source model economics

Open vs closed source model economics

The insight

Evidence

Design implications

Contradictions / tensions

Open questions

Related