Anthropic added Andrej Karpathy to its pre-training team and acquired Stainless, the SDK and MCP-server generator that powers nearly every official AI SDK (including OpenAI's). On the platform side, Managed Agents added self-hosted sandboxes (public beta) and private-network MCP tunnels (research preview). Claude Code shipped six releases (v2.1.144–v2.1.149) with a per-category /usage breakdown, a renamed-and-expanded /code-review command, auto mode on the Pro plan, GitHub-flavored markdown checkbox rendering, and several security fixes. And Project Glasswing's first findings update reports that Claude Mythos Preview has identified 10,000+ high- or critical-severity vulnerabilities in essential software — patch windows are compressing. If you run Claude Code, claude update now.
Sponsored by CodeRabbit
Claude Code writes beautiful code. So does Codex. But here's the thing — they also think they write beautiful code. And when you ask an AI to review code it just wrote, you get the intellectual equivalent of a student grading their own exam. Shockingly, they always pass.
CodeRabbit CLI plugs into Claude Code and Codex as an external reviewer — different AI agent, different architecture, 40+ static analyzers, and zero emotional attachment to the code it's looking at. The agent writes, CodeRabbit reviews, and the agent fixes. Loop until clean.
You show up when there's actually something worth approving.
One command. Autonomous generate-review-iterate cycles. The AI still does the work. It just doesn't get to decide if the work is good anymore.
Karpathy announced on X that he joined Anthropic this week, working on pre-training under team lead Nick Joseph. Per Anthropic, the new team is "focused on using Claude to accelerate pre-training research itself" — i.e. Claude helping train the next Claude. Karpathy's quote: "I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D."
Editor's read: Karpathy is a founding OpenAI researcher, former Tesla Autopilot lead, and the author of the de facto LLM curriculum (CS231n, Zero to Hero, nanoGPT). The hire matters less as a single name and more as a direction marker: Anthropic is betting the next moat is research velocity — running more experiments per dollar of compute — and Karpathy's last stint at OpenAI was on midtraining and synthetic data, the exact levers labs now pull when raw scaling stalls. Watch for changes in how fast Claude itself evolves over the next 12 months.
Stainless has generated every official Anthropic SDK (TypeScript, Python, Go, Java, and more) since the earliest days of the API, and also generates MCP servers from OpenAPI specs. Its customer list included Cloudflare, Google, OpenAI, Replicate, and Runway. Reported deal size is north of $300M.
Editor's read: Watch what gets shut down. Anthropic has reportedly closed Stainless's hosted SDK generator to new customers — existing-customer SDKs keep working, but the toolchain that produced half the industry's SDKs is now Anthropic-controlled. If your team relies on Stainless to generate SDKs or MCP servers, plan around the possibility that third-party access narrows over time. The acquisition page does not currently commit to a continued third-party offering.
When a request misses the cache, Claude Console now shows exactly which part of your prompt changed and how many tokens it cost you. Check it at platform.claude.com/usage/cache; docs here.
Anthropic renamed "extra usage" to "usage credits" across Claude — the rebrand reflects that credits now power features like fast mode directly, not just overflow beyond plan limits. Spending limits, auto-reload settings, and existing credits carry over as-is; /extra-usage is now /usage-credits (old name still works).
Claude Design doubled token limits on every plan. No action needed — existing projects automatically get the higher limits.
/usage breakdown, /code-review, security fixes — May 19–22, 2026Three things worth knowing across this week's batch:
/usage now breaks down tokens per category — skills, subagents, plugins, and per-MCP-server cost — in the CLI today, with Desktop coming next. The fastest way to find out which plugin is eating your context./simplify is renamed to /code-review and significantly expanded. It now reports correctness bugs at a chosen effort level (e.g., /code-review high) and accepts --comment to post findings as inline GitHub PR comments. Useful as a self-review step before opening a PR or as an autonomous reviewer for an agent loop.cd functions (cd.., cd\, cd~, X:) changed the working directory undetected; sandbox write allowlist in git worktrees was covering the whole main repo root instead of only .git; permission analysis trusted stale PWD/OLDPWD/DIRSTACK across cd/pushd/popd; bare variable assignments to non-allowlisted environment variables auto-approved in v2.1.145.Also worth a scan: pinned background sessions stay alive when idle and restart in-place to apply Code updates; claude agents --json lists live sessions for scripting (tmux-resurrect, status bars); OpenTelemetry spans now carry agent_id and parent_agent_id so background subagent traces nest correctly; markdown output renders GitHub-flavored markdown task list checkboxes (- [ ] / - [x]) instead of plain bullets; new allowAllClaudeAiMcps enterprise setting loads claude.ai cloud MCP connectors alongside managed-mcp.json.
Heads-up: v2.1.148 was a Bash exit-code-127 hotfix — a regression from v2.1.147 had the Bash tool returning exit 127 on every command. If you skipped a day, you may have hit it.
Run claude update or update the Claude desktop app now.
Auto mode (Shift+Tab) is now available on the Pro plan in addition to existing tiers. Sonnet 4.6 is now supported in auto mode alongside Opus 4.7.
Anthropic published guidance on running Claude Code across multi-million-line monorepos, decades-old legacy systems, and distributed microservices. Worth a read if you're deploying at team scale.
New guidance on making computer use reliable in production: click accuracy, thinking effort levels, keeping long sessions within context, and recording demonstrations Claude can replay.
Two new security primitives for enterprise agent deployments. Self-hosted sandboxes keep the agent's execution environment in your own infrastructure (or with managed providers like Cloudflare, Daytona, Modal, or Vercel), so sensitive files and services never leave your perimeter. MCP tunnels let agents reach MCP servers inside your private network without exposing them to the public internet. Self-hosted sandboxes are available now on the Claude Platform; request access to MCP tunnels here. Docs: platform.claude.com/docs/en/managed-agents/self-hosted-sandboxes.
You can now swap a live session's tools, MCP servers, or vault IDs without restarting the session. Large MCP tool outputs (>100k tokens) are automatically offloaded to Sandbox files, preventing context bloat on heavy tool-use workflows.
@anthropic-ai/claude-agent-sdk@0.3.149 (TypeScript) — fixes options.env dropping CLAUDE_AGENT_SDK_VERSION when a custom environment is supplied. Docs now correctly state that Options.env replaces the subprocess environment rather than merging with process.env — audit your env handling if you pass a custom env object.@anthropic-ai/claude-agent-sdk@0.3.144 (TypeScript) — assistant messages and StopFailure hooks now report error: 'model_not_found' (was generic 'invalid_request') when the selected model is unavailable; new @anthropic-ai/claude-agent-sdk/extract export lets bun build --compile consumers embed and extract the native binary from a compiled executable.claude-agent-sdk==0.2.86 (Python) — bundles Claude CLI v2.1.149.Claude Mythos Preview — Anthropic's invitation-only cybersecurity-focused frontier model, deployed through Project Glasswing since April — has identified more than 10,000 high- or critical-severity vulnerabilities in essential software since launch. Expert validators agree with Claude's severity assessments 89% of the time, and within one severity level 98% of the time. The update includes working zero-day exploits on fully hardened systems. Full technical writeup: red.anthropic.com/2026/mythos-preview/.
Editor's read: Even with Anthropic withholding specifics until patches deploy, the headline number is the actionable signal: patch windows are compressing. If your release cadence treats dependency bumps as routine maintenance to be batched once a quarter, that posture is becoming a liability. The concrete things to change: turn on auto-update where you can, treat CVE-bearing dependency bumps as urgent rather than routine, and put your incident response on a runbook a junior engineer can execute at 3 a.m. The pace will keep accelerating, not slow down.
A widely-shared prompt pattern from the Claude Code team: instruct Claude to maintain a running implementation-notes.html file documenting decisions made outside the spec, tradeoffs, and unknowns. Useful for staying in the loop on long autonomous runs without having to read the transcript end-to-end.
/config → Output Style → Learning — May 18, 2026Learning mode walks you through what the agent is doing rather than just doing it — useful for teams who want to use Claude Code while staying hands-on enough to actually understand the changes.
Cursor Automations are now inside the Agents Window and support multi-repo configurations and no-repo (pure monitoring/workflow) automations. New Marketplace templates for Slack digests, product analytics, finance, and customer health workflows. Agent runs for newly created automations are 50% off for 7 days from release.
KPMG is integrating Claude across its core business and a workforce of more than 276,000 globally. Relevant for builders selling into professional services or audit/advisory — this signals the enterprise readiness and compliance posture that supports sales conversations in that vertical.
Anthropic launched a video series featuring founders building with Claude. First two episodes: Kay Zhu (co-founder/CTO of Genspark) and Scott Wu (Cognition/Devin). Watch at claude.com/problem-solvers.
Anthropic has been running dialogues with scholars, philosophers, clergy, and ethicists on questions AI raises, starting with how good character forms. No direct developer action required, but signals how Anthropic's values-based positioning will continue to shape model behavior and policy.
Follow-up to last week's announcement: Jarred Sumner reports running Claude Code on the Rust port of Bun for several days with no behavior differences and a passing test suite across all platforms — plus a ~10% bundler speed improvement. Claude Code is actively working through ~100 open PRs to rewrite Zig implementations as Rust, including the XML parser and memory leak fixes.
Immediate:
claude update (or update the Claude desktop app) to get Code v2.1.149 — includes PowerShell permission bypass fixes and the Bash exit-code-127 regression fix from v2.1.148.options.env, audit your code: Options.env replaces (not merges with) process.env. This is now documented in v0.3.149 but was previously undocumented behavior.Watch:
A weekly digest for Claude Code builders. Model updates, releases, and notable tools. Every Monday morning.
TLDR; A massive week for builders: Claude Platform on AWS went GA on May 11, agent view landed in Claude Code, and three breaking changes hit the TypeScript Agent SDK (v0.3.142) that require migration. Claude Code weekly limits jumped another 50% (stacking with last week's 2× 5-hour bump), Opus 4.7 fast mode entered research preview, and a new programmatic credit pool ($20–$200/month depending on plan) lands June 15 — claim window opens June 8. Five daily Claude Code releases...
TLDR; A research- and infrastructure-heavy week. Anthropic doubled Claude Code's five-hour rate limits and credited a new SpaceX compute deal (Colossus 1: 300 MW, 220k+ NVIDIA GPUs) for making room. Claude Code shipped v2.1.128–v2.1.136 with plugin loading from URLs, cross-project Ctrl+R history, new worktree base-ref control, and auto-mode hard-deny rules — plus a viral /radio Easter egg. AWS MCP Server is generally available as part of the Agent Toolkit for AWS. The Agent SDKs shipped six...
This week in Claude — May 03, 2026 TLDR; A builder-heavy week: Claude Security entered public beta for Enterprise — a dedicated codebase vulnerability scanner powered by Opus 4.7; Claude Code shipped v2.1.122 through v2.1.126 with claude project purge, a gateway-aware /model picker, an ANTHROPIC_BEDROCK_SERVICE_TIER env var, and an OAuth retry-loop fix (50+ stability fixes across the last four releases per @ClaudeDevs); the 1M-token context window beta is now retired for Sonnet 4 and Sonnet...