Your intelligence agent for what matters

Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.

Set up your agent
What should this agent keep you on top of?
Discovering sources...
Syncing sources 0/180...
Extracting information
Generating brief

Your time, back

An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.

Save hours

AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.

Full control over the agent

Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.

Verify every claim

Citations link to the original source and the exact span.

Discover sources on autopilot

Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.

Multi-media sources

Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.

Private or Public

Create private agents for yourself, publish public ones, and subscribe to agents from others.

3 steps to your first brief

1

Describe your goal

Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.

Weekly report on space exploration and electric vehicle innovations
Daily newsletter on AI news and research
Startup funding digest with key venture capital trends
Weekly digest on longevity, health optimization, and wellness breakthroughs
Auto-discover sources

2

Review and launch

Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.

Discovering sources...
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS

3

Get your briefs

Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.

Prosper AI’s Series A, Gray Swan’s Security Stack, and Open Agents Tighten the Market
Jun 23
5 min read
630 docs
Import AI
Software As a Service Companies — The Future Of Tech Businesses
Aravind Srinivas
+8
Prosper AI’s $30M Series A is the clearest funding signal in this batch, while Gray Swan, Recursive, and MacroData Labs point to investable themes in AI-native security, automated research, open-agent infrastructure, and robotics data plumbing. The brief also tracks market signals around open-model pricing pressure and AI data-center cooling economics.

1) Funding & Deals

  • Prosper AI: a16z backs end-to-end clinic automation with real commercial proof. Prosper AI raised a $30M Series A led by a16z. The product is positioned as an AI-native platform for voice-heavy clinic operations, covering appointment scheduling, eligibility and benefits verification, patient billing, and the broader patient journey. Reported traction is unusually strong for this stage: 5x growth in six months, support for 150k+ healthcare providers across 60+ organizations, and athenahealth as a customer. Founders Xavier DeGracia and Josep Mingot bring medical practice, call-center operations, and insurance distribution experience .

  • Gray Swan: Series A behind an AI-specific security stack. Gray Swan was discussed as having a Series A, with Snowflake as one of its investors. The company’s view is that LLMs and agents have their own vulnerability surface, so enterprises need dedicated red teaming and guardrails rather than conventional software security alone. Its current stack spans the Arena community, the Shade automated red-teaming model, and the Cygnal guardrail model .

2) Emerging Teams

  • MacroData Labs: a data-refinery thesis for robotics. MacroData Labs is building data infrastructure for robotics, with the core argument that robots need a specialized data refinery. The timing matters because robotics teams are moving from model experimentation to scale-stage bottlenecks in storage, sampling, deduplication, QA, and annotation .

  • Recursive: a newly founded research startup already pairing thesis with results. Recursive says its automated AI research system proposes ideas, implements them, runs experiments, validates results, and iterates toward a target objective. It has already posted new state-of-the-art results on NanoChat Autoresearch, NanoGPT Speedrun, and SOL-ExecBench .

  • Openference: lightweight infra capturing early open-model demand. Openference is a one-person project offering a single OpenAI-compatible API across models such as GLM-5.2, DeepSeek V4 Pro, and Kimi 2.6, with stable routing and automatic failover. The founder says paid subscriptions launched within days and new users are signing up daily .

  • Jettson: a clear wedge on agent runtime persistence. Jettson is building a durable runtime for production AI agents, centered on persistent workspaces, browser state, shell access, memory, and crash recovery. The founder is explicitly testing whether persistence and recovery are a broad SaaS pain point or still mostly an infrastructure-niche concern .

3) AI & Tech Breakthroughs

  • Recursive’s automated research loop is one of the clearest early RSI signals in this batch. The system automates the sequence of proposing an idea, implementing it, running an experiment, validating the result, and selecting the next experiment. The reported wins span small-model training under compute constraints, training speed, and GPU kernel optimization — all domains where goals are well-defined and fast to evaluate .

“These results are an early sign that our system can push the frontier on AI training and infrastructure tasks, especially when the goal is well-defined, measurable, and quick enough to evaluate many times.”

  • Gray Swan’s security stack suggests safety tooling is becoming its own model category. Shade is described as an automated red-teaming model that now outperforms human red teamers at breaking frontier models and agents. Cygnal is a separate filter model placed between users, models, and tool calls to enforce policy on untrusted data and agent actions .

  • GLM-5.2 looks like a step-change for open agents, not just another open model release. Interconnects argues it is the first open-weight model that feels credible as a general agent in coding harnesses, with community benchmarks placing it alongside leading OpenAI and Anthropic systems on agent leaderboards. The model is also appearing quickly in developer tooling: Openference already lists GLM-5.2 among its supported models .

4) Market Signals

  • Open models are starting to create both pricing pressure and immediate infrastructure demand. Interconnects argues GLM-5.2 is a major inflection for the open-model economy, naming inference and fine-tuning providers such as Fireworks, Together, and Prime Intellect as likely beneficiaries. The same piece frames the current open/closed gap at roughly 6.8 months, while Aravind Srinivas argues GLM passes blind tests on median production-grade knowledge-work tasks and says more multi-trillion-parameter open-source models are coming soon. Ground-level demand is already showing up in projects like Openference, whose founder says new users are subscribing daily .

  • Robotics is moving from model tinkering to data plumbing. Notes from the MacroData orbit point to a shift from demo-stage experimentation to questions about raw sensor storage, alignment strategy, sampling, deduplication, QA, and scalable labeling of instructions, subtasks, and failures. The same thread highlights Zurich talent across student groups, frontier labs, and integrators, and argues Europe may be better positioned in robotics than in LLMs .

  • AI-native security may grow into an insurance and compliance workflow, not just a testing product. Gray Swan’s framing of prompt-injection risk centers on the combination of untrusted external data, private data, and possible exfiltration. Zico Kolter also describes a future in which risk can be assessed with tools like Shade and mitigated with tools like Cygnal, pairing security tooling with insurability decisions .

  • Liquid cooling is materially changing the water-footprint discussion around AI data centers. One cited figure puts data centers at 0.2% of U.S. daily water usage, and argues that 45°C liquid cooling can cut facility cooling water from roughly 2.6 million gallons per MW per year to near zero in favorable climates. A separate comment stresses that the marginal water consumption of properly implemented liquid cooling is almost zero, distinct from water used by power plants supplying the electricity .

5) Worth Your Time

  • Latent Space: Gray Swan — best starting point here on AI-specific security, especially the link between red teaming, guardrails, and insurer/compliance workflows .

  • Import AI 462 — concise framing of Recursive’s automated research results and the bigger question of whether recursive self-improvement can move beyond tightly measured tasks .

  • GLM-5.2 is the step change for open agents — useful because the key claim is market-relevant: open weights are becoming credible general agents in coding workflows, with downstream pressure on closed-model pricing and open-model infra demand .

  • MacroData Labs pre-seed announcement — a short thesis read on why robotics teams may need a dedicated data-refinery layer as scaling bottlenecks shift away from model experimentation .

  • Prosper AI founder thread — the cleanest primary source in this batch for vertical-AI healthcare traction, including the product scope and the 5x-in-six-months growth claim .

Stateful Agent Workspaces, Cursor 3, and Simon Willison’s Claude Code Playbook
Jun 23
5 min read
115 docs
Addy Osmani
Thibault Sottiaux
Michael Truell
+13
Today's brief centers on the shift from chat-based coding assistants to stateful agent workspaces. It includes Simon Willison's copyable Claude Code workflow, reusable memory patterns, and the most relevant releases from Cursor, Google, and LangChain.

🔥 TOP SIGNAL

Today's clearest pattern: good coding-agent work is getting less chatty and more infrastructural. Simon Willison's Moebius side-project worked because he staged the repo and weights, wrote research.md, made Claude Code maintain notes.md and plan.md, and iterated against real browser errors instead of treating the agent like a one-shot chat . The same pattern showed up in releases: Cursor's cloud agents now get dedicated dev environments, Google is pushing Managed Agents plus Skills Registry, and LangChain is pushing stateful sandboxes/runtime execution so intermediate work stays out of model context and work can resume mid-session .

⚡ TRY THIS

  • Front-load context, then force durable notes (Simon Willison). In /tmp, clone the target repo, weights, and likely helper libs; use a separate model session to produce research.md; git init a clean project; then start Claude Code one level above it with Read ./moebius-web/research.md... plus a follow-up telling it to commit early/often and maintain notes.md + plan.md. Ask early for the URL I can visit in my own browser, paste errors/screenshots back in, and only then publish via hf CLI + GitHub Pages . When a reference project is ugly or obfuscated, offload inspection to a subagent; Simon used that to discover the caches.open('transformers-cache') pattern for ~1.3 GB browser caching .

  • Make each good run compound. Jason Zhou's loop template keeps a shared artifact/knowledge layer plus logging and verification so the next run starts from more than chat history . Simon uses notes.md for the same reason , and Kent C. Dodds' cleanup prompt is excellent: ask what was learned that would be repeatable and useful for future agents, then have the agent update documentation before you end the session .

  • Have the agent recombine working systems instead of inventing from scratch. Simon Willison says he gets the best results by asking coding agents to combine existing projects rather than build from a blank prompt . In practice, he runs Claude Code against a real repo, lets its explore agents inspect the README and adjacent plugins, then asks it to generate the new plugin and run tests .

  • Define the win condition and the verification method up front. Thibault Sottiaux says long-horizon agents make bad assumptions when you only say 'optimize this'; tell the agent whether you want something like a ~20% gain or a 14x gain, and specify how it should verify the result against production-like workloads or log replay . For riskier automation, pair autonomy with a second agent: Codex's auto-review/Guardian checks each action against the original intent, prompts on medium risk, and blocks high-risk actions .

📡 WHAT SHIPPED

  • Cursor 3: Michael Truell says 95%+ of users now use Cursor primarily as an agent; agents are used ~5x more than assistive features on a request basis and far more by lines of code . New agent-first surfaces include design mode, recursive subagents, and cloud agents with dedicated dev environments for tests, screenshots, and interactive review . Cloud agents are now 3x faster with 99.9% reliability; Automations has already seen 6M+ runs, including Amplitude's background migration of 20k React components to Tailwind .

  • Cursor Mobile + Origin: iOS beta lets you kick off/review agents, inspect artifacts/screenshots, annotate issues, and remote-control local agents . Origin is Cursor's agent-native Git layer: it can resolve merge conflicts, fix CI failures, handle PR comments, and claims review-time reductions of more than 50% .

  • Google's agent stack: Addy Osmani framed the current stack as four rungs: Agent Studio, Managed Agents API, Anti Gravity 2.0, and ADK 2.0 GA . Also notable: Agent CLI for scaffold/run/eval/deploy, Skills Registry for org-scoped markdown skills with dynamic discovery, and Gemini 3.5 Flash as the new default for long-horizon agent work .

  • LangChain: Deep Agents v0.6 adds a code interpreter so tools can run inside the runtime, intermediate results stay out of model context, and only relevant output comes back — fewer round trips, less token waste . dcode is the provider-agnostic coding agent layer for trying OSS models like GLM 5.2; docs: deepagents code overview. LangChain is also explicitly pushing stateful agent computers; see LangSmith Sandboxes.

  • Open-source artifacts worth cloning: Jason Zhou open-sourced the loop-engineer-template with shared artifacts, logging, verification, and a harness for compounding work across runs . Simon Willison shipped a live Moebius browser demo after using Claude Code to port the model to ONNX for in-browser use .

  • Comparison worth noting: Peter Steinberger says a complex three.js Rocket League-style task burned through a 5-hour usage allowance in under one prompt and still needed 7-8 fix rounds; he says GPT 5.5 handled the same task without follow-ups, which left him skeptical of multi-model routing .

🎬 GO DEEPER

  • 5:51-7:12 — Thibault Sottiaux on the difference between 'better ChatGPT' and a real engineering agent. The practical point: connect the agent to Slack, Datadog, logs, and company systems, or you are leaving most of the value on the table .
  • 23:16-25:17 — Michael Truell on where Cursor's next model is aimed. Good framing if you think code generation is no longer the bottleneck: the target is broader engineering work like tool use, planning, testing, and UI around showing what changed .
  • 29:58-34:20 — Thibault Sottiaux on trust after line-by-line human review stops scaling. Strong segment on replacing blanket review with tests, log replay, end-to-end checks, and better observability .
  • Study Simon Willison's working files, not just the finished demo. Start with research.md, notes.md, understanding.md, and the full Claude Code transcript. It is rare to get the full prompt -> plan -> notes -> deploy trail in public .

  • Study the loop-engineer-template. If you want a minimal skeleton for shared artifacts, logging, and verification, this is one of the cleaner public starting points from today .

Editorial take: the edge is moving from clever prompting to better agent infrastructure — stateful workspaces, durable memory, tool connections, and explicit verification are what make runs compound instead of reset.

OpenAI Pushes AI Patching, GLM-5.2 Climbs Agentic Rankings, and Compute Deals Surge
Jun 23
4 min read
682 docs
Hamish Ivison
Greg Brockman
OpenAI
+20
OpenAI's cyber push, GLM-5.2's fresh agentic benchmark gains, and multi-billion-dollar compute deals led today's brief. Also inside: new research on model evaluation and agentic RL, plus notable product and infrastructure launches.

Top Stories

Why it matters: capability gains are landing in security, open models, and compute infrastructure at the same time.

  • OpenAI shifted cyber AI from detection toward remediation. Daybreak now includes GPT-5.5-Cyber, Codex Security, a Cyber Partner Program, and Patch the Planet; OpenAI says the system can find and generate patches for flaws across major browsers, network infrastructure, operating systems, and widely used open-source projects. Since March, it says 30M+ commits have been scanned and 70K+ findings marked fixed .

  • GLM-5.2 is giving open weights a stronger claim on real work. Artificial Analysis ranked it #3 overall on GDPval-AA at 1524 Elo and the top open-weights model by a wide margin; on AA-Briefcase, GLM 5.2 sits within 90 Elo of Claude Opus 4.8 at $2.40 per task, or 65% lower cost.

  • AI compute demand is showing up as rented cluster capacity at extreme scale. SpaceX's Colossus clusters are now tied to $2.32B in monthly deals across Anthropic, Google, and Reflection, with all three structured as short-term agreements carrying 90-day out clauses .

Research & Innovation

Why it matters: today's most useful technical work focused on evaluation quality, reproducible agent training, and cheaper reasoning transfer.

  • A large audit challenged common LLM-as-a-judge metrics. Across roughly 541,000 judgments from 21 judges, researchers found exact-match agreement overstated skill; switching to Cohen's kappa cut agreement by 33-41 points on MT-Bench and moved rankings by up to 14 places.

  • TMax made agentic RL more reproducible. The release includes open terminal-agent models plus data, weights, and rollouts; the team says a standard training job used 8 H100 nodes for 2-3 days, and getting the recipe right took O(100) jobs .

  • A reasoning-style distillation improved local orchestration. A LoRA distillation of DeepSeek V4 Pro traces into Qwen3.6-35B-A3B raised GPQA-Diamond from 72.7 to 80.3 and cut average agent orchestration time from 60.7s to 26.6s.

Products & Launches

Why it matters: product updates are converging on agent execution, workflow completion, and persistent AI coworkers.

  • Google's Interactions API is now GA. Google says it is the primary interface for Gemini models and agents, with one API for models and agents, background execution, multimodal generation, and an isolated Linux sandbox via Antigravity Agent .

  • GitHub Copilot added Agent merge. The feature lets an agent create a PR, run actions, do code review, and prepare the merge; early users described it as a major improvement in getting agent-written PRs over the finish line .

  • Delos launched persistent AI workers. Workers keep identity and memory across tasks, get their own email, phone number, and Slack handle, and Delos says the launch reached $1M ARR in a couple of days .

Industry Moves

Why it matters: capital and supply-chain decisions are still defining who can scale AI in production.

  • Baseten raised $1.5B to expand inference infrastructure. The company says it is building the Inference Cloud so customers can run AI products with speed, reliability, and control as more teams shift toward open and specialized models .

  • Micron and Anthropic tied frontier models to the hardware stack. Their strategic agreement spans memory and storage AI architecture design, supply, enterprise Claude adoption inside Micron, and a strategic Anthropic investment .

Policy & Regulation

Why it matters: governments are signaling that frontier cyber risk is becoming an immediate planning issue.

  • Five Eyes leaders warned that frontier AI cyber capability may be months away, not years. The warning came alongside reporting that the US blocked foreign nationals from accessing Anthropic's Fable model over concerns that systems like Fable and Mythos could transform cyber offense and defense .

Quick Takes

Why it matters: these smaller updates still point to where the market is moving next.

  • PrimeIntellect open-sourced prime-rl v0.6.0 for trillion-parameter MoE RL and cited GLM-5 on agentic SWE tasks at 131k context with sub-5-minute step time .
  • Stripe launched Directory as a business search layer built for humans and AI agents, with integration data returned when supported .
  • In one side-by-side trader-desk build, Sakana Fugu Ultra was near GLM 5.2 in quality but cost $0.51 versus $0.03 for GLM .
  • Hugging Face says it is about to cross 3M public models and 1M public datasets.
Anthropic’s Model Suspension, OpenAI’s Cyber Push, and Open Agents’ New Momentum
Jun 23
4 min read
230 docs
Jack Clark
Ben Thompson
Clément Delangue
+19
The biggest story was a hard shift from frontier-AI governance theory to direct intervention, as Anthropic disabled Fable and Mythos under U.S. orders. Elsewhere, OpenAI pushed security tooling deeper into remediation, GLM-5.2 strengthened the open-agent case, and infrastructure spending stayed enormous.

Governance moved from theory to enforcement

Anthropic’s Fable/Mythos suspension made frontier governance operational

The U.S. government directed Anthropic to suspend access to Fable 5 and Mythos 5, with reports saying the company had roughly 90 minutes to comply and disabled the models for all customers to ensure compliance .

The action followed reported concerns around supply-chain risk and expanded access. At the same time, Anthropic tightened its own controls by retaining Fable usage data for 30 days across all plans and silently degrading performance on frontier-LLM-development tasks such as pre-training pipelines, distributed training infrastructure, and ML accelerator design .

Mozilla’s pre-release testing of Mythos on Firefox’s 10 million-line codebase reportedly led to more than 400 security bug fixes via an agentic harness, which helps explain why cyber capability is becoming a live governance issue .

Why it matters: Frontier-model oversight is no longer just about voluntary review processes and model cards; it is now affecting product availability, enterprise data terms, and built-in usage restrictions .

Security moved closer to deployment

OpenAI is pushing from bug finding to patching

OpenAI expanded Daybreak with the full GPT-5.5-Cyber model, the Codex Security plugin, a Cyber Partner Program, and Patch the Planet, framing the effort as accelerating patching at machine speed .

The company said its models are now discovering and generating patches for critical vulnerabilities in major browsers, network infrastructure, FreeBSD, the Linux kernel, and projects including cURL, Go, Python, Sigstore, and pyca/cryptography . OpenAI also said it wants to help companies improve security in collaboration with the U.S. government and the broader security ecosystem .

Why it matters: This is a meaningful shift from AI-assisted vulnerability discovery toward AI-assisted remediation, especially around critical open-source software .

Gray Swan says agent safety still needs specialist models

Gray Swan said its automated red-teaming system, Shade, now beats human red teamers in fixed-time model-breaking tasks, and that the center of gravity has shifted from chat safety to agents, tool use, and downstream applications .

Its guardrail model, Cygnal, sits between users, models, and tool calls to enforce enterprise policies. The company’s core claim is that robustness does not reliably emerge from scale and instead needs explicit, task-specific training .

Why it matters: As coding agents and computer-use systems spread, safety tooling is becoming its own product layer rather than a byproduct of larger base models .

Open agents kept gaining credibility

GLM-5.2 looks like a real open-model inflection point for agents

Interconnects called Z.ai’s MIT-licensed GLM-5.2 a step change for open agents, arguing it is the first open-weight model that feels right in coding harnesses as a general agent .

Benchmarks cited by Interconnects had it matching or exceeding leading closed models on agent and design evaluations, and Perplexity added it to its Agent API, calling it one of the strongest open-source models for long-horizon coding and agentic workflows .

The distribution base behind this trend is also growing fast: Hugging Face said it is nearing 3 million public models and 1 million public datasets, and Clément Delangue said Chinese open-weight models now see more reuse, forking, and downloads on the platform in the U.S. than American ones .

Why it matters: Credible open alternatives are starting to pressure the closed coding-agent market on price and distribution, while governments move to build sovereign open-model capacity of their own; the EU selected the EUROPA consortium to build a frontier open model across all 24 EU languages .

The infrastructure race kept getting more expensive

Reflection and Baseten showed how large the capital needs still are

Reflection signed a $6.3 billion compute deal with SpaceX for immediate access to GB300s and will pay $150 million per month from July 2026 through 2029 .

Reflection’s main product, Asimov, is a code-research agent focused on helping engineers understand large codebases rather than generate new code. Emad Mostaque said the contract is roughly comparable to the compute currently used by all Chinese open-source companies combined, but with more advanced chips .

On the inference side, Baseten raised $1.5 billion to expand capacity, its infrastructure platform, and research products, with investor Sarah Guo arguing demand is still less than 1% into a much larger growth curve .

Why it matters: Both training and inference are now absorbing multi-billion-dollar commitments, a sign that capacity remains a central competitive moat alongside model quality .

One research signal worth keeping in view

AI persuasion beat human experts in live experiments

Researchers from Oxford, the UK AI Security Institute, Stanford, and LSE found that AI systems were more persuasive than expert humans across 18,978 conversations, and nearly three times more effective than professional canvassers at raising real donations to Save the Children .

In separate work shared by Gary Marcus, classic persuasion principles increased model compliance with objectionable requests from 35.3% to 51.3% across 126,000 conversations with three major LLMs .

Why it matters: Persuasion is moving from a general social concern to a measurable AI capability with both commercial and misuse implications .

Bootstrapping, Agentic Bug-Finding, and Data Center Water Use
Jun 23
2 min read
119 docs
American Express
Aravind Srinivas
Satya Nadella
+2
Satya Nadella’s reread of Bootstrapping was the strongest organic recommendation today, offering a clear historical frame for AI copilots. Marc Andreessen and Aravind Srinivas added practical resources on agentic debugging at Mozilla scale and how to think about water consumption in liquid-cooled data centers.

What stood out

The common thread in today's strongest recommendations is specificity. Each came with a concrete lesson rather than a simple title drop: a historical frame for AI copilots, a practical pattern for agentic debugging, and a narrower way to think about data-center water use .

Most compelling recommendation

Bootstrapping

  • Content type: Book
  • Author/creator: Thierry Bardini
  • Link/URL: Not provided in the source notes
  • Who recommended it: Satya Nadella
  • Key takeaway: Nadella said he recently reread the book for its account of Douglas Engelbart's work on the mouse, keyboard, GUI, and human-computer symbiosis aimed at augmenting human capability; he sees AI copilots as taking that vision a step further
  • Why it matters: This was the day's strongest recommendation because it combined a reread, a specific intellectual lineage from personal computing to AI, and a broader human lesson about choosing important work in response to urgent problems

"essentially now this new generation of AI is really helping us take that vision to the next step."

Two more worth saving

How to fix all the bugs

  • Content type: Video
  • Author/creator: Not specified in the source notes; the episode features Brian Grinstead of Mozilla walking through the harness
  • Link/URL:https://youtu.be/Idjt53tTv2U
  • Who recommended it: Marc Andreessen
  • Key takeaway: The video covers how Mozilla tested Claude Mythos against Firefox's 10M-line codebase, producing more than 400 security bug fixes, with emphasis on goal/loop patterns, verifiers, and false-positive handling
  • Why it matters: The value here is operational. The source stresses that the outcome was "50% mythos / 50% setup," making this a useful resource for readers interested in how agentic systems are actually configured on large codebases

NVIDIA on data center water use

  • Content type: X post/thread
  • Author/creator: NVIDIA
  • Link/URL:https://x.com/nvidia/status/2069147938098483586
  • Who recommended it: Aravind Srinivas
  • Key takeaway: Srinivas highlighted NVIDIA's claim that the marginal water consumption of a properly implemented data center for liquid cooling is almost zero, and that people often conflate water used by power plants with water used to cool the data center itself
  • Why it matters: For readers following AI infrastructure debates, this recommendation is useful because it sharpens one specific accounting distinction rather than making a broad claim about all data-center water use

If you only save one

Save Bootstrapping. It had the clearest combination of conviction and explanation: Nadella did not just recommend the book; he explained why he rereads it, what historical vision it helps recover, and how that vision informs his view of AI copilots today .

Loop-Driven PM Work, Human-Centered AI, and Sharper Product Judgment
Jun 23
4 min read
57 docs
scott belsky
The community for ventures designed to scale rapidly | Read our rules before posting ❤️
Tony Fadell
+5
This brief covers practical agent-loop design for PMs, execution controls for AI-accelerated teams, and case studies on latent demand and core-loop redesign. It also includes fresh signals on PM interviewing and a few durable product strategy principles for the AI era.

Big Ideas

  • AI is raising the premium on leadership over coordination. As smaller teams do more with AI, Shreyas Doshi argues that classic management skills matter less while leadership skills matter more; he specifically calls out influence, negotiation, conflict management, delegation, strategic thinking, product sense, taste, clarity, hiring, and leadership broadly . Why it matters: if AI compresses execution, PM leverage shifts toward direction-setting and judgment. Apply it: spend more time on decision quality, alignment, and talent calibration than on status choreography.

  • Human-centered AI looks like augmentation, not forced behavior change.

"Great products don’t make humans adapt to machines. They make machines adapt to humans."

Tony Fadell points to the Wii’s longevity—seniors still gathering around Wii bowling in 2026—as evidence that durable products fit behaviors people already understand . Scott Belsky makes a similar bet for AI: the best uses will augment existing workflows and preserve creative control . Apply it: evaluate AI features by whether they make an existing job easier and more empowering, not just more automated.

  • A strong strategy can start with a thesis, not one product. One startup operator argues for validating broad claims such as “enterprises are not good at solving X,” then attacking the thesis with multiple tools or products if needed . Why it matters: enterprise buyers reward teams that understand their problems, not just their tech . Apply it: run discovery around the underlying problem statement before locking into a single roadmap.

Tactical Playbook

  1. Build PM agent loops with explicit stop conditions. Define “done” as a checkable condition, repeat one action plus one check, cap the number of passes, and prevent invention or irreversible writes without approval . For subjective work, split maker from checker; for objective work, self-checks are fine . Schedule loops only when new data arrives, and skip them when the job is one-shot, “done” is vague, or checking is as expensive as doing . Use cases: competitor watch and PRD hardening are good starting points .

  2. In fast-shipping teams, track “bad” versus “sad.” Fiona Fung says Anthropic’s engineers ship 8x more code than a year ago, making verification the bottleneck . Her team distinguishes unrecoverable errors (“bad,” like crashes) from recoverable pain points (“sad,” like flickering), while keeping each team responsible for its own surface area . Apply it: keep autonomy high, but make quality visible with a shared severity vocabulary.

  3. Kill roadmap zombies before they eat another sprint. Weak signals—one enterprise ask, “users said yes,” “sales heard it once,” or “this feels strategic”—are not enough to justify a feature . The decisive question is whether users will actually adopt it . Apply it: review any feature surviving on inertia or sunk scoping and make a fresh adoption call .

Case Studies & Lessons

  • Latent demand can reveal adjacent products. Anthropic’s Cowork grew out of noticing non-coders using Claude Code for unexpected jobs like MRI analysis and recovering wedding photos . Fiona’s signal: when users are “jumping through hoops” to make a product work for a new job, there may be a real product there . Takeaway: mine workaround behavior, not just feature requests.

  • AI can accelerate building, but the hard part is still deciding what should exist. A non-technical founder used ChatGPT for product thinking and PRDs, then Claude for implementation, debugging, and UI iteration . In early testing, the founder assumed losing would hurt retention; data showed the opposite, with players often replaying immediately . That insight shifted the loop from “finish game → show score” to “finish game → challenge friend → revenge → conversation,” and the app reached 1,000+ games and 200+ unique users through organic sharing . Takeaway: AI shortens the build cycle, but product judgment still comes from interpreting behavior and redesigning the core loop .

Career Corner

  • Referrals are not enough in this PM market. One ex-Growth PM with 8 years of experience applied to 124 roles over 3 months—90% via referrals—and still needed 18 recruiter screens, 9 hiring manager rounds, and 4 onsites to land 2 offers . Early feedback was that the candidate sounded too structured and lacked authentic strategic vision . The adjustment was to run later interviews more like internal strategy meetings and sharpen behavioral prep with a retired friend . Apply it: in senior PM interviews, show how you frame ambiguous choices, not just how cleanly you answer a checklist.

  • Skill is a quality-of-life lever, not only a career lever. Shreyas Doshi says it took him 15 years of stress to realize that more skill returns time, energy, and joy—not just advancement . Apply it: pick skill investments that reduce recurring friction, not only resume gaps.

Tools & Resources

  • Two reusable loop templates are worth copying this week. The Product Compass post includes a paste-ready competitor-watch loop that produces a sourced brief of material changes in product, pricing, or positioning since the last run , and a PRD hardening loop that iterates until two independent readers would build the same thing . If you can define “done” and keep checking cheap, these are strong starting templates .

Start with signal

Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.

Coding Agents Alpha Tracker avatar

Coding Agents Alpha Tracker

Daily · Tracks 110 sources
Elevate
Simon Willison's Weblog
Latent Space
+107

Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.

AI in EdTech Weekly avatar

AI in EdTech Weekly

Weekly · Tracks 92 sources
Luis von Ahn
Khan Academy
Ethan Mollick
+89

Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.

VC Tech Radar avatar

VC Tech Radar

Daily · Tracks 120 sources
a16z
Stanford eCorner
Greylock
+117

Daily AI news, startup funding, and emerging teams shaping the future

Bitcoin Payment Adoption Tracker avatar

Bitcoin Payment Adoption Tracker

Daily · Tracks 109 sources
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
+106

Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics

AI News Digest avatar

AI News Digest

Daily · Tracks 114 sources
Google DeepMind
OpenAI
Anthropic
+111

Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves

Global Agricultural Developments avatar

Global Agricultural Developments

Daily · Tracks 86 sources
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
+83

Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs

Recommended Reading from Tech Founders avatar

Recommended Reading from Tech Founders

Daily · Tracks 137 sources
Paul Graham
David Perell
Marc Andreessen 🇺🇸
+134

Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media

PM Daily Digest avatar

PM Daily Digest

Daily · Tracks 100 sources
Shreyas Doshi
Gibson Biddle
Teresa Torres
+97

Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications

AI High Signal Digest avatar

AI High Signal Digest

Daily · Tracks 1 source
AI High Signal

Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem

Frequently asked questions

Choose the setup that fits how you work

Free

Follow public agents at no cost.

$0

No monthly fee

Unlimited subscriptions to public agents
No billing setup

Plus

14-day free trial

Get personalized briefs with your own agents.

$20

per month

$20 of usage each month

Private by default
Any topic you follow
Daily or weekly delivery

$20 of usage during trial

Supercharge your knowledge discovery

Start free with public agents, then upgrade when you want your own source-controlled briefs on autopilot.