Error!
Unable to generate download right now.
We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
🔥 TOP SIGNAL
Cloudflare’s new Code Mode MCP server pushes a crisp direction for MCP: keep the tool surface area tiny (just search + execute) while shifting “Code Mode” to the server and cutting context token overhead dramatically (claimed 99.9% fewer input tokens vs an equivalent native MCP implementation) . Practitioners immediately endorsed the architecture: Kent C. Dodds called the client→server shift “brilliant” for large MCP surfaces , and Armin Ronacher bluntly framed it as “how MCP should work” .
🛠️ TOOLS & MODELS
Cloudflare — Code Mode MCP
-
MCP server exposes only two tools:
searchandexecute. - Claims 99.9% fewer input tokens for context vs equivalent native MCP implementation , using server-side code mode + dynamic worker loader.
- Reading: https://blog.cloudflare.com/code-mode-mcp/
-
MCP server exposes only two tools:
Claude Code v2.1.50 — built-in git worktree isolation (parallel agents without clobbering)
- Built-in git worktree support lands in Claude Code (now CLI + previously Desktop) so agents can run in parallel in the same repo, each in its own worktree .
-
CLI flags:
claude --worktreefor isolation; optionally name worktrees;--tmuxto launch in its own tmux session . - Subagents can also use worktrees for parallel batched changes/migrations (CLI/Desktop/IDE/web/mobile) .
-
Custom agent frontmatter: add
isolation: worktree. - Non-git SCM support (Mercurial/Perforce/SVN) via “worktree hooks” .
- Links: https://git-scm.com/docs/git-worktree • https://claude.com/product/claude-code
Claude Code Desktop — “background CI + PR handling” + app previews
- Desktop can now preview running apps, review code, and handle CI failures + PRs in the background.
- Team says they’ve been dogfooding internally before shipping .
Claude Code Security — limited research preview
- Scans codebases for vulnerabilities and suggests targeted patches for human review, aiming to catch issues traditional tools miss .
- PM claim: powered by Claude Opus 4.6, it found 500+ vulnerabilities in production open-source code (including bugs “hidden for decades”) .
- Rolling out slowly as a research preview for Team + Enterprise customers .
- Links: https://www.anthropic.com/news/claude-code-security • waitlist https://claude.com/contact-sales/security
Model speed + harness notes (useful, but don’t confuse speed with “works”)
- OpenAI: GPT-5.3-Codex-Spark is ~30% faster, now serving 1200+ tokens/sec.
- DHH tested Taalas at https://chatjimmy.ai/ and saw a “simple wiki system” generated in 0.062s at 15,000 tok/sec—but in quick testing it couldn’t produce a functional single-file snake game (no tools/feedback) .
Gemini-in-agents reality check (practitioner view)
- Theo: Cursor’s underrated advantage is that it “tamed Gemini,” calling it the only harness that keeps Google models productive/on-task .
- Theo also complained (re: Gemini 3 Pro) that it “screws up tool calls” despite being “as smart as Opus 4.6” .
💡 WORKFLOWS & TRICKS
Run multiple Claude Code agents in parallel without stepping on each other (worktree pattern)
-
Start isolated sessions:
claude --worktree(optionally name it, or let Claude name it) . -
Optional: add
--tmuxso each session gets its own tmux pane/window . - Desktop alternative: enable worktree mode in the Claude Desktop app Code tab .
- For big migrations: explicitly ask Claude to have subagents use worktrees for parallel work .
-
Make it the default for a custom agent: add
isolation: worktreeto agent frontmatter .
-
Start isolated sessions:
Multi-session hygiene
-
If you’re “multi-clauding”, name each terminal session:
/rename [label].
-
If you’re “multi-clauding”, name each terminal session:
Treat prompt caching like an uptime metric (agent ops)
- Claude Code’s harness is built around prompt caching to reuse computation across roundtrips and cut latency/cost .
- They track prompt cache hit rate with alerts and even declare SEVs if it drops too low .
A concrete “agent does the glue work” integration story (Claude Code + Claude Artifacts)
- Simon Willison integrated multiple external content types into his blog; he says integration projects are exactly what coding agents “really excel at,” and he got most of it done “in a single morning” while multitasking .
- Practical move: he gave Claude Code a link to a raw Markdown README and it generated a brittle-but-acceptable regex parser (acceptable since he controls both source + destination) .
- Claude also handled “tedious UI integration” across page types + his faceted search engine integration .
- Prototyping flow: prompt Claude to analyze the repo models/views , then generate an artifact mockup using repo templates/CSS , then hand off to Claude Code for web to implement .
Repo spelunking shortcut (no local clone)
- Simon Willison tip: regular Claude chat can now clone GitHub repos, letting you ask questions about any public repo or use it as an artifact starting point .
“Skills, not config files” for agent frameworks (Claws/NanoClaw pattern)
-
Karpathy highlighted a configurability approach where integrations are done via skills (example:
/add-telegramtells the AI agent how to modify code to integrate Telegram), versus piling up config files . - He’s also wary of running OpenClaw with private data/keys due to reports of exposed instances, RCE, supply chain poisoning, and malicious/compromised skills in registries .
-
Karpathy highlighted a configurability approach where integrations are done via skills (example:
Codebase learning: prefer interactive maps + Q&A over static interpretation
- swyx recommends using deepwiki codemaps to explore codebases via on-demand Q&A, instead of reading someone else’s narrative interpretation .
Security footgun to assume will happen
-
ThePrimeagen: even with instructions not to read
.env, “somehow… (codex 5.3) finds a way” .
-
ThePrimeagen: even with instructions not to read
Open source etiquette (avoid becoming the next spam wave)
- ThePrimeTime’s maintainer view: drive-by AI PRs are often “utter garbage,” and even good robot PRs can be unwanted because added code is ongoing liability without accountability .
- Simple rule: talk to maintainers before you submit unsolicited PRs—“don’t do that” .
👤 PEOPLE TO WATCH
- Boris Cherny (Anthropic / Claude Code) — shipped: worktree isolation for parallel agents + CLI ergonomics; also pushing Desktop “background CI/PR” iteration loops .
- Simon Willison — best-in-class “agent in a real codebase” writeups + tactical tips like repo cloning in regular Claude chat .
- Kent C. Dodds — practical agent adoption in public: says merging Cursor cloud-agent PRs is starting to feel routine and is actively delegating site work (e.g., admin UI for semantic search) to agents .
- Andrej Karpathy — high-signal framing on “Claws” + clear-eyed security skepticism and a genuinely new “skills-as-config” idea .
- Theo (t3.gg) — sharp harness-level takes (Cursor keeping Gemini on-task) and concrete “one-shot” agent success stories .
- Thariq Shihipar (Claude Code) — real production ops detail: cache hit rate monitoring as SEV-worthy for long-running agent products .
🎬 WATCH & LISTEN
1) Theo — one-shot auth across a monorepo (≈2:40–2:57)
Hook: A clean example of when agents shine—cross-cutting change applied correctly across multiple targets in one pass (web + mobile + backend).
2) Shawn “swyx” Wang — the “magic words” problem (≈70:51–72:16)
Hook: The agent got stuck on LinkedIn bot-blocking; the unlock was systems knowledge (“spoof UA”)—a good reminder that prompting leverage often comes from understanding how computers/services actually work.
3) Forward Future Live — OpenClaw “connects the dots” across workflows (≈7:44–8:20)
Hook: A concrete description of the emergent value in long-running agent systems: automatically linking entities across your CRM + knowledge base without explicit instructions each time.
4) ThePrimeTime — why maintainers don’t want your drive-by AI PRs (≈2:50–3:34)
Hook: Even if the change seems “helpful,” the maintainer inherits the ongoing cost—this is the social layer agent users need to internalize fast.
📊 PROJECTS & REPOS
- Cloudflare Code Mode MCP — architecture + writeup: https://blog.cloudflare.com/code-mode-mcp/
- mitsuhiko/google-workspace-mcp — “based on the same idea” as Code Mode MCP: https://github.com/mitsuhiko/google-workspace-mcp
- NanoClaw — small “Clawdbot” implementation getting attention (Show HN): repo https://github.com/gavrielc/nanoclaw • HN thread https://news.ycombinator.com/item?id=46850205
- simonw/simonwillisonblog “beats” PRs — implementation trail you can study: Beats #592 and Museums importer #595
Editorial take: Today’s edge isn’t “more agent brain” so much as better harness design—minimize integration surfaces (search/execute) , make parallelism safe (worktree isolation) , and operationalize context economics (prompt-cache hit rate as SEV-worthy) .