TLDR;Opus 4.8 is the headline this week — new model, same price, stronger coding benchmarks, and a genuinely useful new feature for agentic loops (mid-conversation system messages). Claude Code also shipped dynamic workflows in research preview, letting you throw hundreds of parallel subagents at your biggest migrations — Salesforce reports a 231-day migration completed in 13 days, and Bun's 700k-LOC Rust rewrite shipped in 6. Several hard deadlines are coming fast: the Sponsored by CodeRabbit Your AI shouldn't grade its own homeworkClaude Code writes beautiful code. So does Codex. But here's the thing — they also think they write beautiful code. And when you ask an AI to review code it just wrote, you get the intellectual equivalent of a student grading their own exam. Shockingly, they always pass. CodeRabbit CLI plugs into Claude Code and Codex as an external reviewer — different AI agent, different architecture, 40+ static analyzers, and zero emotional attachment to the code it's looking at. The agent writes, CodeRabbit reviews, and the agent fixes. Loop until clean. You show up when there's actually something worth approving. One command. Autonomous generate-review-iterate cycles. The AI still does the work. It just doesn't get to decide if the work is good anymore. API & Model UpdatesIntroducing Claude Opus 4.8Released May 28 at the same price as Opus 4.7 ($5/$25 per million input/output tokens). SWE-bench Pro improves from 64.3 → 69.2; the model is notably more honest about uncertainty and catches its own bugs rather than declaring premature victory. Mid-Conversation System Messages — New in Opus 4.8You can now pass a Opus 4.8 Fast Mode — Research PreviewFast mode is now available for Opus 4.8 at 2× the standard rate for 2.5× the speed — roughly 3× cheaper than the previous Opus fast mode tier. Set Opus 4.8 Prompt Caching — Lower Minimum Token ThresholdThe minimum cacheable prompt length on Opus 4.8 drops to 1,024 tokens (down from Opus 4.7). Opus 4.8 also uses adaptive thinking, triggering reasoning only when a turn needs it — fewer wasted thinking tokens compared to running Opus 4.7 at the same effort level. Opus 4.8: Drop-in Replacement for Opus 4.7 — No Breaking ChangesSame context window, same tool surface, same pricing. For teams already running Opus 4.7 in production, migration is a config-only model string swap. Run ⚠️ claude-sonnet-4-20250514 and claude-opus-4-20250514 Retire June 15, 2026Both original Claude 4 model snapshots are deprecated and retire on the Claude API on June 15. Migrate to Claude Sonnet 4.6 and Claude Opus 4.7 (or 4.8) respectively before that date. ⚠️ Claude Agent SDK Billing Split — Effective June 15, 2026Programmatic Claude usage (Claude Agent SDK, ⚠️ SDK Package Rename: @anthropic-ai/claude-code → @anthropic-ai/claude-agent-sdkEffective June 15: the TypeScript package ⚠️ CLAUDE_CODE_OPUS_4_6_FAST_MODE_OVERRIDE Deprecated — Removed June 1This env var is deprecated in Claude Code v2.1.154 and will be removed on June 1, 2026. Switch to anthropic Python SDK v0.105.2Released May 29, includes Opus 4.8 spec updates. v0.105.0 shipped May 28. Install: Claude Code & CLIClaude Code v2.1.157 — Plugins Auto-load from .claude/skillsReleased May 29. Plugins in Claude Code v2.1.156 — Opus 4.8 Thinking Block FixReleased May 29. Hotfix for an issue where thinking blocks were modified when using Opus 4.8, causing API errors. Update immediately if you're running 4.8 in Claude Code. Claude Code v2.1.154 — Opus 4.8, Dynamic Workflows, Fast ModeReleased May 28. The flagship release this week: Opus 4.8 is now the default, dynamic workflows ship as a research preview, fast mode on Opus 4.8 is available at 2× cost/2.5× speed, and the lean system prompt is now default for all models except Haiku, Sonnet, and Opus 4.7 and earlier. Also: Dynamic Workflows — Research Preview in Claude CodeType "workflow" in a prompt to have Claude write an orchestration script on the fly and spin up tens to hundreds of coordinated subagents in parallel. Designed for migrations, large refactors, and performance optimization. Run Claude Code v2.1.153 — MCP Fixes, macOS Permissions, Git LFS SkipReleased May 28. Fixed stateful MCP servers without optional GET SSE stream reconnect-looping on Claude Code v2.1.152 — /code-review --fix, Hook Improvements, Auto Mode No Longer Requires Opt-inReleased May 27. Security-Guidance Plugin for Claude Code — Shipped May 26A new security plugin identifies and fixes vulnerabilities as you write code. Install via Claude Code Reliability Update — May 27Several quality-of-life fixes: fewer cryptic "tool result doesn't match tool use" errors with more readable messages; compaction now shows progress and no longer blocks on "prompt too long" bugs; more compaction speed improvements rolling out. Agent SDK & Managed Agentsclaude-agent-sdk-typescript v0.3.157Released May 29. Parity with Claude Code v2.1.157 (auto-load plugins from claude-agent-sdk-typescript v0.3.154 — stdio MCP FixReleased May 28. Fixed stdio MCP servers being incorrectly restarted on every reconcile pass due to config-equality false positives — a meaningful reliability fix for anyone using stdio MCP in SDK-driven agents. Update: claude-agent-sdk-typescript v0.3.152 — Hook EventsReleased May 27. claude-agent-sdk-python v0.2.87Released May 23. Bundles Claude CLI v2.1.150; CI switched from static API key to Workload Identity Federation using short-lived tokens. Install: Developer Tools & CommunitySalesforce: 231-Day Migration Completed in 13 Days with Claude CodePublished May 29. Salesforce published a detailed writeup on agentic Claude Code usage: one migration scoped at 231 days shipped in 13; one PR delivered 21 endpoints at 100% test coverage; total incidents dropped 5% even as PR volume increased. They built security guardrails directly into the agentic workflow. The core lesson: the teams seeing the biggest wins are redesigning their workflows entirely, not just speeding up existing ones. Bun Rewritten in Rust in 6 Days Using Claude Code Dynamic WorkflowsPublished May 28. Follow-up to last week's "no behavior diffs after several days" report: Jarred Sumner confirmed the full 700k-LOC Bun → Rust rewrite shipped in just 6 days, with dynamic workflows and adversarial code review central to the result. He's seen individual workflows run for 10 continuous hours. The codebase is now under active fuzzing (using swc's test suite as corpus) to address security scrutiny. How We Contain Claude Across ProductsPublished May 25. Anthropic engineering post on the isolation and containment architecture used when running Claude across different product surfaces — sandboxes, VMs, and egress controls to cap an agent's blast radius. Useful context for anyone designing multi-tenant Claude-powered applications. Coding Agents in the Social SciencesPublished May 27. Research on using Claude coding agents for social science research tasks — relevant if you're building Claude-powered research automation tools or academic tooling. Competitor Dev ToolsCursor 3.6 — Auto-review Run ModeReleased May 29. New "Auto-review" run mode applies to Shell, MCP, and Fetch tool calls: allowlisted calls run immediately, sandboxable calls run in the sandbox, everything else goes to a classifier subagent that decides whether to proceed or ask. Configure in Settings > Cursor Settings > Agents > Run Mode. GitHub Copilot: Claude Opus 4.8 GA — 15× Premium Multiplier Until June 1Published May 28. Opus 4.8 is now generally available in GitHub Copilot for Pro+, Business, and Enterprise. Note: it launches at a 15× premium request multiplier until Usage Based Billing launches on June 1 — plan your usage accordingly if you're on a seat-limited plan. Enterprise & BusinessClaude Marketplace — Five New Partners AddedPublished May 27. Augment Code, Bolt, CodeRabbit, Hebbia, and Legora are now in the Claude Marketplace. Enterprise customers can apply existing Anthropic spend commitments toward these Claude-powered products. See claude.com/platform/marketplace. Anthropic Opens Milan OfficePublished May 27. New office to support Italian enterprise customers, researchers, and developers. Relevant if you're building Claude-powered products in the EU and want regional support contacts. Research & SafetyChris Olah's Remarks on Pope Leo XIV's Encyclical "Magnifica Humanitas"Published May 25. Anthropic co-founder Chris Olah responds to the papal encyclical on AI and humanity. Background context on how Anthropic's interpretability work is being received in broader societal discourse. Action ItemsBy June 1, 2026:
By June 15, 2026:
Immediate (model migration):
Did this issue land?One tap tells me what to keep doing (or drop). It is the fastest way to shape next week. And I read every reply. Hit reply and tell me what you want more of, or less of, and it goes straight into the next issue. |
A weekly digest for Claude Code builders. Model updates, releases, and notable tools. Every Monday morning.
TLDR; Anthropic added Andrej Karpathy to its pre-training team and acquired Stainless, the SDK and MCP-server generator that powers nearly every official AI SDK (including OpenAI's). On the platform side, Managed Agents added self-hosted sandboxes (public beta) and private-network MCP tunnels (research preview). Claude Code shipped six releases (v2.1.144–v2.1.149) with a per-category /usage breakdown, a renamed-and-expanded /code-review command, auto mode on the Pro plan, GitHub-flavored...
TLDR; A massive week for builders: Claude Platform on AWS went GA on May 11, agent view landed in Claude Code, and three breaking changes hit the TypeScript Agent SDK (v0.3.142) that require migration. Claude Code weekly limits jumped another 50% (stacking with last week's 2× 5-hour bump), Opus 4.7 fast mode entered research preview, and a new programmatic credit pool ($20–$200/month depending on plan) lands June 15 — claim window opens June 8. Five daily Claude Code releases...
TLDR; A research- and infrastructure-heavy week. Anthropic doubled Claude Code's five-hour rate limits and credited a new SpaceX compute deal (Colossus 1: 300 MW, 220k+ NVIDIA GPUs) for making room. Claude Code shipped v2.1.128–v2.1.136 with plugin loading from URLs, cross-project Ctrl+R history, new worktree base-ref control, and auto-mode hard-deny rules — plus a viral /radio Easter egg. AWS MCP Server is generally available as part of the Agent Toolkit for AWS. The Agent SDKs shipped six...