brain/
conceptartificial-intelligence

Open vs closed source model economics

Notes

Open vs closed source model economics

One-line summary: Open-weight models are ~3–5% behind the best closed models on quality but dramatically cheaper per unit of intelligence — because the user pays only for serving (power + compute), not for training. The "battle" between the two is, per andrew-feldman (May 2026), undecided; closed is "strictly better by a little bit," and the durable question is how big a premium that little bit commands.

The insight

The open-vs-closed split in AI is not the open-vs-closed split in traditional software, and conflating them is the core error this concept exists to flag.

  1. "Free open source" doesn't transfer. joe-weisenthal's framing: in traditional software, open source is free; in AI there is "no real such thing as free open source AI software" because even a free-to-license model still costs chip depreciation + electricity to run. The relevant cost axis is serving cost, not license cost.
  2. The quality gap is small; the cost gap is large. andrew-feldman (running an inference cloud that serves both) puts the closed-vs-open quality difference at "3, 4%, 5%" with closed "strictly better." But on cost per unit of intelligence, open is cheaper "by a lot" — because the user "what you're not paying for was the cost to train it." A ~1T-parameter open model (Kimi K2) runs on Cerebras "10 or 15 times faster than others" at just power + compute cost.
  3. A levelized-cost-of-intelligence metric is missing. Joe proposes "cost per IQ point" / "levelized cost of intelligence" as the unit that would let buyers compare honestly — and notes the industry doesn't have it yet. Without it, the premium for closed quality is hard to price.
  4. The market is bifurcating, not consolidating. Feldman expects no single winner — "I don't think there's going to be one," analogizing to x86 (Intel/AMD) + ARM + custom silicon coexisting. Closed frontier labs (OpenAI, Anthropic), open-model serving (Cursor, Cognition on open weights), and specialists all persist.
  5. Quiet enterprise migration to open is already happening. tracy-alloway reports "a lot of big companies in the US ... very quietly shifting from some of the closed source models to the open source models like the Chinese ones, like Kimi" and Qwen.

This dovetails with the thread's llm-as-commodity-thesis (Ghodsi: models are interchangeable at the unit level; durable value is above the model layer) and with cuda-moat-erosion-at-inference (runtime portability removes lock-in at the inference layer). Where the commodity thesis says "models commoditize," this concept adds the open-vs-closed pricing structure underneath that commoditization: open weights commoditize fastest because their cost floor is just serving.

Evidence

Design implications

  • The right comparison unit is serving cost per unit of intelligence, not license cost. Whoever can serve a near-frontier open model fastest/cheapest (the wafer-scale bet of cerebras) captures the open-serving market.
  • The closed premium is bounded by the (small) quality gap times each workload's sensitivity to that gap. Joe's prediction — companies getting "more skilled at allocating from different forms of inference" — implies the closed premium is a task-routing question, not a blanket one: premium closed model for the hard 5% of tasks, cheap open model for the rest.
  • Chinese open-weight models (Kimi, Qwen) are a live part of the US enterprise stack, which carries an export-control / data-governance dimension this source only gestures at.

Contradictions / tensions

  • The headline cost claims come from a CEO whose inference cloud monetizes serving open models — clear incentive to talk up open-model economics. Treat "cheaper by a lot" as directional.
  • Closed labs are not standing still: a 3–5% quality gap measured in May 2026 is a vintage snapshot (see the thread's capability-tracking discipline); the gap and the premium could widen or narrow with each frontier release.
  • "No single winner" is a forecast, not an observation — Feldman's x86/ARM analogy is plausible but the AI model market could still consolidate around 1–2 closed labs if the quality gap proves to compound rather than stay constant.

Open questions

  • Does the closed-quality premium widen (gap compounds → closed pulls away) or collapse (open catches up → premium → zero) over the next several frontier-release cycles?
  • Will a standardized "levelized cost of intelligence" metric emerge, and who defines it? Without it, the premium stays hard to price.
  • How much of the quiet enterprise migration to Chinese open models survives export-control / data-governance scrutiny?

Related

Referenced by