brain/
← all entities
entitygenericartificial-intelligence

Claude Code

Notes

Claude Code

One-line summary: Anthropic's coding agent — a CLI plus web, iOS, and Android surfaces that all share one backend model workflow.

What it is

An agentic coding tool built around Claude. Usable from a terminal, a web app, and mobile apps (Claude iOS / Android). The same sessions are discoverable across surfaces.

Why it matters to this thread

It is the concrete interface most of this thread's Claude/agent-workflow commentary is about. Understanding its surfaces and primitives (plan mode, auto-accept edits, CLAUDE.md, GitHub Action) is prerequisite for most concept pages here.

Key facts (from 2026-04-21-boris-claude-techniques)

  • Surfaces: terminal, web, iOS app, Android app. Boris uses all four; mobile handles "probably half" of his coding now.
  • Plan mode: first-class feature; user iterates on a plan before allowing execution.
  • Auto-accept edits: mode the user switches into once the plan is good; model then executes the plan end-to-end.
  • CLAUDE.md: per-repo plain text file the agent reads for context. No enforced format — "just a text file."
  • GitHub Action: installable via the /install-github-action slash command; lets teammates @claude in PR comments / issues so the agent can push changes back to the branch directly.
  • Customization philosophy: Anthropic deliberately built Claude Code to be customized/hacked per user — "no one correct way to use it."

Later-shipped features

/ultrareview — fleet-based pre-merge bug review (announced 2026-04-22)

From 2026-04-22-claude-ultra-review (the @ClaudeDevs announcement post and its reply thread):

  • What it is: A new slash command (research preview) that runs "a fleet of bug-hunting agents in the cloud." Findings land back in the CLI or Desktop automatically.
  • Recommended use: Before merging critical changes — "auth, data migrations, etc."
  • Access: Pro and Max users get 3 free reviews through 5/5 (May 5 2026). Activated via claude update.
  • Architectural note: Cloud-executed, not local. The user's CLI/Desktop is the findings-delivery surface, not the runtime.

See multi-agent-code-review for the broader pattern this operationalizes.

Reception, from the announcement reply thread (weigh carefully — reply-thread samples skew toward strong opinions):

  • Positive / pattern-legitimizing: @MindTheGapMTG: "We've been running multi-agent code review for months. Single-pass review catches syntax errors. Fleets catch the interactions between auth changes and data flows that actually break production." @Tahseen_Rahman: "Once code agents can write large changes quickly, review has to become parallelized and adversarial enough to keep up." @Surreal_Intel: frames the trajectory as "a permanent machine review layer sitting between 'done' and 'deployed'."
  • Cultural-norm shift claim (open question — unverified): @jatingargiitk argues the feature's existence "moves the bar for code review" — reviewers will start expecting a fleet-run on auth/migration PRs. One opinion, no evidence yet that this has happened at any team; flagged as an open question in multi-agent-code-review.
  • Negative: @thepatriotvlls claims first-day use: "burns through your usage," "QoR abysmal since you can't go back and forth," "incredibly buggy and constantly disconnected from my terminal instance." Single-commenter evidence from day of release.
  • Usage-limit theme: Multiple replies (@junijas, @Ultrabeyonder1, and implicit in @thepatriotvlls' "burns through your usage") concentrate criticism on subscription/token consumption — consistent with the broader claude-code pricing gap between advertised and real heavy-use cost already flagged above.
  • Adjacent-tool plug: @mndaniel78 promoted "Witness" — a read-only, local-login alternative positioned as a complement: "/review, no write/run access. Uses your Claude login." No independent corroboration of the tool; captured as a data point about the ecosystem, not a claim about Witness itself.

Open questions surfaced by the announcement:

  • Will the cultural-expectation claim materialize — do teams actually require fleet-runs on critical PRs? Needs ~1 quarter of observation.
  • Does /ultrareview meaningfully outperform single-agent review on real-world bug classes? (No benchmark cited; the product is a research preview.) Track in multi-agent-code-review.
  • What happens to access after the 5/5 free-reviews window ends? Pricing model not disclosed in the source.

Key model pairing

  • Boris recommends running Claude Code with claude-opus-4-5 (thinking enabled) as the default. His claim: the extra per-token cost is more than offset by fewer total tokens to completion.

Known limitations surfaced in-source

  • Session sync between surfaces is one-directional. Reply commenter @dmat13484493: "If only iOS Claude would sync with the Mac mini Claude. Only syncs in One Direction."

Market position and scale (from 2026-04-21-autoresearch-best-ai-coding-tools)

  • Revenue: Uvik reports Claude Code at $2.5B ARR and "over half of Anthropic's enterprise revenue." (Note: this exceeds cursor's reported $1B ARR — a striking claim sourced to a consultancy blog, not a primary disclosure. Flagged as an open question in ai-coding-tool-landscape-2026.)
  • Adoption (JetBrains April 2026 survey, 10K+ devs): 18% adoption / 57% awareness, with 6× growth from mid-2025; reaches 24% in US/Canada. Survey authors frame this as "product excellence now outweighs ecosystem lock-in."
  • Primary-tool share (Claude5.ai survey): Reported as 28% of primary-tool selections — in tension with JetBrains' 18% overall-adoption figure. Likely a primary-vs-any-use phrasing gap. See ai-coding-tool-landscape-2026.
  • Bottom-up adoption: Uvik's staff-augmentation practice observes: "Claude Code adoption has entered roughly 60–70% of teams in the past nine months — almost always through individual engineer advocacy rather than top-down rollout."

Pricing (from 2026-04-21-autoresearch-best-ai-coding-tools)

  • Advertised: $20/mo Pro; $100/mo Max (standalone or API).
  • Real heavy-use cost: $150–200/month per developer per MorphLLM's practitioner report — a 7.5–10× gap from the Pro tier, consistent with Amplifi Labs' general observation that "developers should budget for at least 50% more than the advertised base price if using agentic features daily."

Benchmark position (from 2026-04-21-autoresearch-best-ai-coding-tools)

See ai-coding-benchmarks for the full picture. Headline figures:

  • SWE-bench Verified (agent-level, not just model): ~80.9% — tops the field.
  • Terminal-Bench 2.0: ~65.4% — behind codex-cli (~77.3%).
  • Tops multi-file reasoning leaderboards; pairs with claude-opus-4-5 as its recommended model.

In the METR RCT context

ai-coding-productivity-paradox — note that METR's 2025 "19% slower" RCT used cursor Pro with Claude 3.5/3.7 Sonnet, not Claude Code. Evidence specific to Claude Code's productivity effect is field-report-quality (Uvik), not RCT-quality.

Sources

Related

Referenced by