Your intelligence agent for what matters

Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.

Set up your agent
What should this agent keep you on top of?
Discovering sources...
Syncing sources 0/180...
Extracting information
Generating brief

Your time, back

An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.

Save hours

AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.

Full control over the agent

Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.

Verify every claim

Citations link to the original source and the exact span.

Discover sources on autopilot

Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.

Multi-media sources

Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.

Private or Public

Create private agents for yourself, publish public ones, and subscribe to agents from others.

3 steps to your first brief

1

Describe your goal

Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.

Weekly report on space exploration and electric vehicle innovations
Daily newsletter on AI news and research
Startup funding digest with key venture capital trends
Weekly digest on longevity, health optimization, and wellness breakthroughs
Auto-discover sources

2

Review and launch

Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.

Discovering sources...
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS

3

Get your briefs

Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.

Agent-Centered Dev Loops Overtake One-Off Prompting
May 27
5 min read
108 docs
Boris Cherny
Salvatore Sanfilippo
Romain Huet
+12
Top practitioners are reorganizing their dev workflow around agents instead of treating them as sidekicks. Inside: copyable Codex and Claude patterns, DeepSWE's benchmark launch, and the clips/repos most worth studying.

🔥 TOP SIGNAL

  • Stop bolting agents onto the old workflow; put them at the center. Boris Cherny says he ships 20-30 PRs/day by running five Claude agents in parallel, has not written a line of code in over six months, and often leaves hundreds or even thousands of agents running 5-20 hours overnight; he also describes Claude as being at the center of everything inside Anthropic, where the terminal prototype spread quickly across engineering . Romain Huet shows the same operating model in Codex: a plain-English 7am prompt becomes a pinned automation that pulls Slack, Gmail, and calendar context, surfaces top priorities, and drafts emails, while Greg Brockman shared Codex analyzing Slack history and reorganizing channels via computer use into a reusable automation .

⚡ TRY THIS

  • Build a pinned 7am operating brief in Codex (Romain Huet). Start by connecting the tools you use every day; Codex has 120+ plugins packaging skills, MCPs, and direct access to tools like Slack, Gmail, and calendar . Then use Huet's pattern: every morning at 7am, pull all of the context from my Slack, from my Gmail, from my calendar. Give me a full brief of my day, my top five priorities. Point me to what's most critical and urgent ... and draft some of these emails. Codex turns that into a scheduled daily automation, and you can pin the thread so the same briefing surface is waiting every morning .

  • Add a pre-merge review agent before every PR lands (Peter Steinberger). Steinberger says autoreview has been the most impactful skill in his stack after crabbox.sh: it automatically reviews code before landing a PR, finds edge cases, and sometimes runs for hours . If you want a concrete starting point, study the skill spec and port the pattern into your own setup: https://github.com/openclaw/agent-skills/blob/main/skills/autoreview/SKILL.md.

  • Tune the operating context before you blame the model. Theo says GPT-5.5 only clicked for him after about two months, prompting entirely differently, and spending time on agents.md; now he says he cannot really use another model for code . Practical routing on top of that: Huet defaults Codex/GPT-5.5 to Medium for most tasks and only uses Extra High for one-off critical refactors, while Matthew Berman's rule is to use frontier models for hardcore upfront planning and cheaper workhorse models for the actual code writing .

  • Close the loop on agent fixes. Palash Shah describes self-optimizing agents as systems that observe their own outputs, evaluate them, and use that signal to improve future performance . The concrete LangSmith Engine recipe: auto-triage feedback on traces, attach an online evaluator to every suggested fix so you do not regress, generate offline evals for your test suite, and keep learning from user preferences over time .

📡 WHAT SHIPPED

  • DeepSWE benchmark launched. Theo calls it the first coding benchmark that actually aligns with how it feels to use these models for coding, and says it exposes much larger divergence than public leaderboards suggest. His read: the gap between official harnesses and simple agent harnesses says a lot, and one comparison showed Gemini 3.5 Flash costing more than GPT-5.5 while scoring about half as high .

  • Field test from Salvatore Sanfilippo. The Redis creator says GPT-5.5 implemented kernels for his MIMO inference system from directions he provided; compared with his handwritten namasi++ version, the AI-generated implementation ran 2x faster and did not crash past larger context sizes. His current model is mixed: full AI implementation on some projects, line-by-line assist on others, and small manual programs reserved for aesthetics, learning, and new ideas .

  • Codex keeps adding practical surface area. Huet says the app now has 120+ plugins, sandboxed auto review with permission escalation, local or cloud projects, an in-app browser for pointing at UI changes, and a Codex tab inside the ChatGPT app that launched last week for continuing desktop work from the phone . Theo's recent feature callouts: locked-Mac computer control, a double-Command hotkey to capture the current screen into context, and diff marker settings . Adoption signal: Simon Willison highlighted @openai/codex NPM installs growing from about 100,000/day in January to over 1 million/day now .

  • Rastermill released for Node agents. Peter Steinberger extracted image logic into a separate library to stop small hacked images from crashing the process; it is built with Wasm and Rust for speed. Link: http://rastermill.com.

  • Undocumented-feature field report. Theo says he added a new Lakebed feature without announcing it; by morning, Sherlock's agent had discovered it and shipped a curation app on top of it, which Theo says was used perfectly to spec. App: https://badlogic-list.lakebed.app/ with RSS included .

🎬 GO DEEPER

  • 16:12-18:29 — Romain Huet on background computer use. Best clip today for understanding parallel desktop work: Codex clicks through a simulator, checks for obvious issues, and can create a Reminder without taking over the user's cursor .
  • 22:25-24:48 — Theo on verification vs token burn. Good watch if you are choosing between local-machine verification and cloud swarms: his core argument is that computer-use verification in a real environment can beat simply spinning up more sub-agents and burning tokens .
  • 27:54-28:29 — Boris Cherny on making Claude the default interface. Short but important: codebase questions, expenses, and even holiday lookups all route through Claude when the workflow is redesigned around the agent .

Editorial take: the edge is moving from model shopping to workflow design — persistent context, pre-merge review, and self-evaluating loops beat raw token burn.

Perceptic’s $12M Biopharma Round, Lucis’ Series A, and AI’s Shift Toward Trust Layers
May 27
5 min read
799 docs
r/SideProject - A community for sharing side projects
Serena Ge (Datacurve)
Eric Wu
+9
Two health-focused AI financings lead this brief, alongside early traction in agent infrastructure, enterprise workflow automation, and regulated operations. The broader pattern is AI moving toward trusted deployment in compliance and business systems while compute, memory, and inference speed remain hard constraints.

Funding & Deals

  • Perceptic announced a $12M round led by Air Street and Accel, with participation from Elder Gull and angels from AI labs, to operationalize frontier AI systems for biopharma . The founding team includes CEO Tilman after a seven-year stint at Palantir, plus Palantir AIP veterans Martin Copes and Zaki Trache . The company says it is already live in top-20 pharma accounts speeding up critical workflows, and Nathan Benaich framed the timing around biopharma’s increased appetite for frontier AI and AI labs turning more attention toward science .
  • Lucis raised a $20M Series A to expand its AI-driven preventive health platform in Europe . YC says the company has served 10,000+ customers across France, the UK, Ireland, and Portugal, delivered 1M+ biomarker tests, and that 75% of users who completed a six-month follow-up improved three or more markers without medication .

Emerging Teams

  • Superset is an open-source IDE for developers to run hundreds of agents in parallel. YC says it has grown 30% week over week for the last four months and is helping engineers ship 10x more PRs.
  • Appnigma is attacking Salesforce implementation pain by generating native managed packages from natural language. The founders say it compresses 3-6 months of development into days and outputs 100% native Salesforce metadata. They are ex-Salesforce, including 3.5 years on the AppExchange review team, and say customers include Pylon, Warmly, UserEvidence, and roughly 85% YC companies.
  • Alchemize is building an AI-native customs brokerage that promises importers real-time regulatory clarity and shipment clearance in minutes instead of days. YC highlighted the launch from founders Samuel and Robert.
  • NavigateAI gives Eric Wu a new wedge in labor-constrained physical work. The company says it wants to give every field worker an AI copilot to address skilled-worker shortages so crews can build faster and better. Keith Rabois amplified the launch, calling it “Super cool” and emphasizing faster, lower-cost building .

AI & Tech Breakthroughs

  • DeepSWE is positioned as a new benchmark for agentic coding. Its creators say it shows where top models actually diverge in realistic developer workflows, and Garry Tan called it the new standard for engineering evals.
  • GBrain + ActiveGraphAI outline a more auditable agent architecture. The system pairs a durable markdown/git knowledge substrate with an event-sourced runtime, then wraps operations as typed events, projects retrieved evidence into graph state with stable citation tokens, adds an explicit propose/approve/apply flow, and forks before mutation so agents can trace recommendations to specific evidence and runs .
  • StableBrowse is a concrete attempt to improve agent browsing economics. YC says its browser layer lets agents navigate the web with 70% fewer tokens and 3-4x faster execution.
  • YourMemory is a notable open-source experiment in persistent AI memory. Its author says the system adds time-aware retrieval and memory decay based on a modified Ebbinghaus forgetting curve, tested it on LongMemEval, and runs it locally with a visualization dashboard .

Market Signals

  • Compute still looks supply-constrained rather than speculative. In a 20VC interview, Cerebras’ CEO said data centers cannot be built fast enough, the company has a $25B backlog, and Nvidia, AMD, and others also have backlogs . He argued the industry is building behind demand, not ahead of it . In the same interview, he said HBM shortages could last several years because fab capacity is lumpy and slow to add, and that electricity may become the longer-term limiting factor .
  • Compliance is starting to look like a real AI software category. a16z says AI may finally be moving from good enough to pilot to good enough to trust in compliance, noting that many LLMs now score 80-100% on LegalBench’s 162 legal reasoning tasks. The firm argues compliance is essentially applied legal reasoning and affects every dollar moving through a business .

“As AI clears the ‘good enough to trust’ bar and sales cycles speed up, there may finally be an opening for startups.”

  • AI is beginning to replace legacy SaaS, not just sit on top of it. Harry Stebbings highlighted a CEO saying the company replaced a $600K Salesforce contract with a vibe-coded CRM built in three weeks, expects to eliminate 80% of its internal SaaS stack, and would not change Anthropic usage even if pricing doubled .
  • On-device inference is moving from theory to production design choice in B2B SaaS. A founding team at an on-device AI infrastructure company says its system handles 60-80% of requests locally with cloud fallback, especially for bounded tasks like transcription, summarization, and classification on modern hardware .
  • Agent governance is hardening into an infrastructure layer. GBrain + ActiveGraphAI describe replay, stable citations, controlled write-back, and fork-before-mutation workflows . Synapsor is built around governed memory, staged writes, replay, permissions, and audit trails for agents touching business systems . Nexus Synapse makes a similar argument that runtime governance, not just the model, is the missing layer around AI systems .

Worth Your Time

  • 20VC x Cerebras CEO: useful primary source on AI infrastructure bottlenecks, HBM scarcity, and the argument that slow inference has no real market. YouTube interview
  • a16z on compliance: useful if you are mapping where AI may move from pilots into trusted production software. Essay
  • DeepSWE release thread: worth reviewing if you diligence coding agents or developer infra and want a benchmark tied to realistic workflows. X thread
  • Superset launch page: useful background on an open-source IDE that YC says has grown 30% week over week for four months. YC launch
  • GBrain x ActiveGraphAI thread: useful diligence material if you care about evidence-linked, replayable agent systems. X thread
Maker Time, Time-Tested Product Books, and AI Context Reads
May 27
5 min read
169 docs
Bill Gurley
Tim Ferriss
Palmer Luckey
+5
Tim Ferriss' endorsement of Paul Graham's maker-schedule essay was the clearest actionable pick. The rest of the brief distills Lenny Rachitsky's time-tested book map for product builders and four additional reads on AI geopolitics, consciousness, classical skepticism about writing, and the origins of internet resilience.

What stood out

The strongest single recommendation was Paul Graham's Maker's Schedule, Manager's Schedule. Tim Ferriss tied it to a precise operating rule: creative work needs uninterrupted 3-5 hour blocks, plus enough slack for multi-day synthesis, and he protects 3-4 mornings per week in maker mode until at least 1pm .

The other major signal was Lenny Rachitsky's book list for product builders. He organized it by job-to-be-done, limited himself to books he had completed, and mostly favored books more than 10 years old because they had stood the test of time .

Most compelling recommendation

Maker's Schedule, Manager's Schedule

  • Content type: Essay
  • Author/creator: Paul Graham
  • Link/URL:https://www.paulgraham.com/makerschedule.html
  • Who recommended it: Tim Ferriss
  • Key takeaway: Ferriss said great creative work is hard to do in fragmented 30-45 minute windows; it needs uninterrupted 3-5 hour blocks and enough slack for multi-day "CPU-intensive synthesis"
  • Why it matters: This was the clearest recommendation in the batch because it came with a concrete scheduling practice he already uses: 3-4 maker mornings per week until at least 1pm

"Large, uninterrupted blocks of time—3-5 hours minimum—create the space needed to find and connect the dots."

A compact starting stack from Lenny Rachitsky's broader book list

Lenny's full list spans multiple categories, from communication skills to sales and marketing. If you want a smaller starting point, these nine titles give you one book per category from that broader map .

Four side paths worth saving

  • US/China AI race articleContent type: Article; Author/creator: not specified in the source; Link/URL:https://chinai.substack.com/p/anthropics-dogmatic-views-on-us-china; Who recommended it: Bill Gurley; Key takeaway: he called it "a wonderfully pragmatic look" and said it is unrealistic and dangerous to project that the next 12 months are an all-or-nothing bet on the future of the world; Why it matters: it is the clearest AI-geopolitics recommendation in this batch, and the value lies in its push against compressed, absolutist timelines
  • Could a Large Language Model Be Conscious?Content type: Article; Author/creator: not specified in the source; Link/URL:https://www.bostonreview.net/articles/could-a-large-language-model-be-conscious/; Who recommended it: shared by Marc Andreessen via a quoted post from @badlogicgames; Key takeaway: it was shared as recommended reading during a discussion about "model wellbeing," consciousness, and the limits of current LLM framing; Why it matters: it is the most explicit philosophy-of-AI reading in today's set
  • Socrates on the written wordContent type: Philosophical text; Author/creator: Socrates, via Plato; Link/URL: direct text link not provided in the source; source discussion: https://www.youtube.com/watch?v=k2THslsOYzY; Who recommended it: Palmer Luckey; Key takeaway: he drew a direct parallel between Socrates' critique of writing and current worries that AI or ChatGPT could weaken memory, encourage superficial understanding, and outsource thinking; Why it matters: it reframes an AI-era concern as an older debate about tools and cognition
  • Paul Baran and the Origins of the InternetContent type: Article; Author/creator: RAND; Link/URL:https://www.rand.org/pubs/articles/2018/paul-baran-and-the-origins-of-the-internet.html; Who recommended it: Balaji; Key takeaway: Baran proposed packet-switched networking as a system that could survive nuclear attack; the design was later adopted mainly for efficiency, but the original resiliency property remained; Why it matters: it adds historical context to how technical architectures can outlast the political conditions that produced them

Bottom line

If you save one item today, save Paul Graham's essay for the clear rule on protecting maker time . If you want a broader next-read list, Lenny Rachitsky's categories provide the largest set of book recommendations in this batch, filtered for titles he had actually finished and believed had endured over time .

Math Breakthroughs, Google’s Scale, and the Agent Benchmark Shift
May 27
4 min read
790 docs
Qwen
Nick Frosst
Cohere
+19
Frontier models showed fresh scientific reasoning range, Google disclosed the scale of its Gemini distribution, and agentic coding competition moved closer to real-world workflows. This brief also covers new multimodal products, inference economics, and the latest policy signals.

Top Stories

Why it matters: today’s biggest signals were about scientific reasoning, distribution scale, and the shift from chat models to full agent systems.

  • Frontier models posted another meaningful math result. Claude Mythos solved the decades-old Erdős unit distance problem, and one account said it found a cleaner proof than the known OpenAI approach while running air-gapped, with no internet access . Sebastien Bubeck added that, with the right harness, both Mythos and GPT-5.5 can now reproduce the one-shot unit distance solution . The broader takeaway from today’s research discussion is that more capability is showing up through model-plus-harness design, not just new base models .

  • Google showed the scale of its Gemini footprint. Google later clarified that Gemini now has 900M monthly active users in the Gemini app alone, excluding AI Overviews and other Gemini-powered surfaces . One analysis of the company’s recent disclosures also noted token processing rising from 480T last year to 3.2 quadrillion now, with TPUs central to serving AI at that scale .

  • Agentic coding is being measured more like real work. DeepSWE launched as a benchmark meant to show where top models actually diverge in day-to-day developer workflows, and GPT-5.5 was cited as #1 on it . Alibaba also shipped Qwen3.7-Max as a flagship for the Agent Era, with end-to-end coding, office workflows, 35-hour autonomy on a kernel task, and a #4 debut in Code Arena: Frontend .

Research & Innovation

Why it matters: the most interesting research today focused on memory, biology, and inference efficiency.

  • Language Models Need Sleep proposed a low-latency path for long-horizon agents. The idea is to periodically consolidate recent context through offline recurrent passes, write the result into persistent fast weights, and clear the KV cache; gains rose with longer “sleep” on deeper reasoning tasks .

  • Carbon pushed open DNA modeling much closer to whole-genome scale. The model is described as 275x faster than the previous state of the art at its size and can process the full human genome on a single GPU in under two days . Its tokenizer splits DNA into 6-base chunks while preserving single-base resolution .

  • Shard targeted one of inference’s costliest bottlenecks. On Llama-3.1-8B, it reported 10x KV-cache compression with zero quality loss, including 11.2x at 32K context and near-zero LongBench delta versus FP16 .

Products & Launches

Why it matters: launches centered on easier content creation, more accessible open models, and higher-end image generation.

  • Gemini Omni extended conversational editing to video. Demonstrations showed users reshaping scenes, swapping objects, changing viewpoints, translating audio while keeping background music, and zooming into images while preserving character and physics consistency .

  • Cohere open-sourced Command A+. Cohere called it its most powerful LLM yet, optimized to run on minimal hardware, while co-founder Nick Frosst framed the release as part of a push toward more empowering AI access .

  • Microsoft’s MAI-Image-2.5 entered the top tier of text-to-image. The model debuted at #3 on the Arena leaderboard with a score of 1,254, a 72-point gain over MAI-Image-2, and marked the first time a lab outside Google DeepMind and OpenAI entered the top five . Public early access is live on Arena .

Industry Moves

Why it matters: capital is flowing into inference and authenticity tooling even as model pricing keeps collapsing.

  • Baseten is reportedly raising at a sharply higher valuation. The inference provider is said to be raising $1B at an $11B valuation after growing from $200M to $600M ARR in Q1 .

  • DeepMind expanded SynthID into a broader industry coalition. It said SynthID has watermarked more than 100B pieces of content, is adding partners including OpenAI, ElevenLabs, and Kakao, and has already seen 50M+ Gemini verification checks, with Search and Chrome next .

  • API price competition accelerated again. Xiaomi said MiMo-V2.5 pricing is permanently reduced by up to 99%, with unified pricing across context lengths and token plans upgraded to 5–8x more usable tokens . Another note said MiMo 2.5 Pro now matches DeepSeek V4 Pro pricing .

Policy & Regulation

Why it matters: governments are moving from general AI debate to concrete controls and review processes.

  • China is restricting overseas travel for top AI professionals at private firms including Alibaba and DeepSeek.
  • The FDA launched a pilot to review AI-generated evidence in drug submissions. Over 200 AI-designed drugs are already in clinical trials worldwide, but none have FDA approval; the new pilot selects 10 companies for expedited review .

Quick Takes

  • Runway launched Project Luxo, saying AI-generated video has crossed the uncanny valley; one 10-minute film was made by a single person in under a month .
  • Anthropic’s new security-guidance plugin for Claude Code cut security-related PR comments by 30-40% in internal rollout and benchmarks .
  • Figure signed a commercial agreement with Catalyst Brands to deploy humanoid robots at scale, starting in Reno, Nevada .
  • Google introduced Daily Brief, a personalized morning-digest agent designed to be a daily first stop .
Google/DeepMind’s Agentic Push, the Compute Scramble, and Wider AI Governance
May 27
4 min read
302 docs
Nathan Lambert
Demis Hassabis
Ben Thompson
+14
Google/DeepMind led the day with a broad agent push, a 2030 ±1 AGI estimate, and more concrete drug-discovery progress. Elsewhere, scarce compute reshaped industry strategy, provenance tooling moved into consumer products, and AI governance broadened from geopolitics to the Vatican.

What stood out today

The clearest pattern today was AI moving from broad capability talk into operating questions: who controls the agents, who gets the compute, and how trust gets built into the stack.

Google/DeepMind makes the agentic era its main product story

Google used I/O to spotlight Omni, Spark agents, and Flash 3.5 inside Antigravity 2.0, with Demis Hassabis saying consumer agents are already showing productivity gains . In a new interview, Hassabis said his AGI estimate is now “2030 plus or minus a year,” described the moment as the “foothills of the singularity,” and said DeepMind is still on track with its original 20-year mission .

“These days I’m thinking it’s 2030 plus or minus a year.”

He paired that optimism with a call for security, robustness, standards, and international cooperation as the agentic era expands . He also said Isomorphic Labs has raised new funding, now has pre-clinical test compounds starting in oncology and immunology, and aims to compress parts of drug discovery from roughly ten years to months or weeks .

Why it matters: Google is tying product launches, frontier-model direction, safety, and long-horizon science into one story rather than presenting them as separate bets.

Compute shortages are redrawing industry strategy

Ben Thompson said Anthropic’s xAI compute deal makes sense because Anthropic has the highest willingness to pay in a constrained market and was always likely to find new capacity as demand outstripped supply . He argued xAI increasingly looks like both a model company and an infrastructure company, and that the infrastructure side may be more valuable if it sells compute broadly instead of limiting it to Grok .

On the hardware side, NVIDIA said its Vera CPU is built for agentic AI workloads, pairing 88 custom Olympus Armv9.2 cores with 1.2 TB/s of memory bandwidth; in Phoronix testing it showed a 1.6x geometric-mean gain over Grace and a 1.5x advantage over a latest-generation 128-core x86 processor, with first customer deliveries already underway .

Why it matters: The day’s compute news was less about one deal or one chip than about a broader shift: scarce infrastructure is shaping partnerships, product availability, and where value sits in the AI stack.

Provenance and containment are moving closer to default infrastructure

Google DeepMind said SynthID has watermarked more than 100 billion pieces of content and that Gemini users have run AI-origin verification more than 50 million times . The company is extending content authentication into Search and Chrome, adding edit trails for Pixel video, and partnering with OpenAI, ElevenLabs, and Kakao to add SynthID watermarking to their models .

Anthropic, meanwhile, said agent permissions should evolve with capability growth and described sandboxing in its own products as a way to limit potentially destructive actions .

Why it matters: Leading labs are trying to make authenticity checks and operational containment part of everyday tooling, not just after-the-fact policy.

AI governance is showing up in both religion and geopolitics

Big Technology reported that Pope Leo XIV issued a 55-page encyclical on AI warning about war, disinformation, surveillance, algorithmic addiction, and democratic degradation, while arguing that AI cannot offer genuine human connection and that pauses can be compatible with progress . Separately, reporting highlighted by Nathan Lambert said China is expanding travel restrictions on top AI talent at key private firms beyond earlier rumors focused on DeepSeek .

Why it matters: Different institutions are arriving at the same conclusion from very different angles: AI is now important enough to trigger moral arguments and tighter controls over people and systems.

New research is a reminder that AI helps science most when humans can still verify the work

Sakana AI and collaborators proposed CUSP, a benchmark covering 4,760 scientific events to test whether models can forecast scientific progress . Their result: frontier models can identify promising directions, but they still struggle to predict whether or when breakthroughs will happen, and the gap is not explained by training-data volume alone .

The authors conclude that science remains open-ended and that AI works best as a collaborative explorer rather than a predictor . That fits with Terence Tao’s warning that AI could create a “traffic jam” in math by generating more proofs than humans can verify, increasing the need for better scientific infrastructure .

Why it matters: Better models may widen exploration faster than existing validation systems can keep up.

Also notable

  • Gemma 4 looks like a meaningful open-model shift. swyx called it dramatically better than Gemma 3, Interconnects said equivalent-size Gemma 4 models tie or outperform Qwen 3.5/3.6 under an Apache 2.0 license, and Nathan Lambert said adoption is already outpacing Qwen at similar sizes .
  • Agents are being pitched as operators, not just assistants. Perplexity Computer was demonstrated managing Shopify workflows such as market research, product-image generation, and theme design in parallel, matching Sarah Guo’s argument that customers want agents that resolve issues by taking actions inside real systems .
Behavior-First AI, Better Async Execution, and the Governance Question
May 27
4 min read
67 docs
Product Management - The place for all things product
Sachin Rekhi
Zeb Evans
+6
This brief covers behavior-first AI adoption, faster async execution patterns, and recent case studies on governance, trust, and community-centric product design. It also highlights how the PM role is shifting toward builder fluency and which books and repo tools are worth exploring.

Big Ideas

"If you start with tools, you're going to fail."

  • AI adoption is a behavior-change problem. Barry O'Reilly cites 85% GenAI project failure and 83% transformation failure when companies make adoption about tools instead of how people do their best work . His example: if your best thinking is verbal, use transcription to turn conversation into drafts instead of forcing everything through a prompt window . Apply it: redesign one core PM workflow around your natural working style first, then pick the tool.
  • AI works best as a team amplifier. O'Reilly cites research showing teams using AI got 3x better ideation outcomes, and recommends leading indicators such as meeting readiness (1-10), decision velocity, and time moved from admin to creative problem-solving from roughly 20% toward 40% . Apply it: use AI in live strategy sessions to pressure-test ideas, not just to draft after the meeting.
  • Governance affects product integrity. Eric Ries argues great companies stay trustworthy only when mission is structurally protected from investor and organizational pressure . One practical signal: whether the mission is embedded in the corporate charter, not just in a values statement . Apply it: treat governance as part of product due diligence when evaluating employers, founders, or major trade-offs.

Tactical Playbook

  • Reduce async latency with better specs. Replace the Jira ticket as source of truth with a short spec covering context, non-goals, examples, edge cases, and open decisions . Require engineer playback before build starts, and split discovery/spike work from delivery tickets when the task is still ambiguous . Batch questions into one document rather than drip-feeding them across Slack, then use fixed overlap meetings, live sprint planning, and regular 1:1s for the hardest topics . Why it matters: in distributed teams, every unclear sentence can add a full day of delay .
  • Measure and role-model AI adoption. Before a decision meeting, ask how prepared the group feels on a 1-10 scale; then track decision velocity, decision advantage, and whether more time is moving from admin to higher-order problem solving . Leaders also need to create safety by showing their own workflows: Progyny's CEO framed AI as a way to elevate, not eliminate, people, and shared how he used meeting transcription and synthesis . Apply it: instrument behavior change before trying to prove ROI in hours saved.

Case Studies & Lessons

  • Trust moats need structural protection. FedMart built customer trust with lowest-price discipline, capped 14% margins, higher wages, and no supplier bribes, but investor pressure pushed the company toward short-termism; after Saul Price was forced out, the business was liquidated within seven years . Costco later preserved similar values with stronger governance protections, while Anthropic has used a long-term benefit trust plus public benefit corporation structure to protect its mission . J&J shows the opposite failure mode: its Credo put patients first, but scandals still emerged when financial incentives dominated . Lesson: if trust is part of the product, encode it in governance and incentives—not just slogans.
  • Reddit treats friction as a feature. The product gives maximum space to nested comments, keeps the UI dense and utility-first, and relies on community curation plus moderators rather than a personalized algorithmic feed . Lesson: optimize for the interaction loop that creates value—even if that means sacrificing some speed or polish .

Career Corner

  • The PM role is becoming more builder-like. Sachin Rekhi argues PM and design roles are converging: PMs with UX intuition and designers with customer focus are meeting in the middle, with PMs using code and agents to iterate, validate, and scope ideas . His caveat is useful: PMs should not ship production code if that turns them into another engineering bottleneck . Apply it: get fluent enough to prototype and test independently, while keeping production quality ownership clear.
  • A sharp interview question: ask whether the company's mission is in its corporate charter. Ries says the question itself often exposes whether the organization has legally encoded its mission and can push the discussion upward .

Tools & Resources

  • A book stack organized by PM job-to-be-done. Lenny Rachitsky's latest list groups classics by strategy, product success, product org, execution, and leadership, while limiting the set to three books per category and mostly titles older than 10 years . Useful starting points: Good Strategy/Bad Strategy, The Mom Test, Continuous Discovery Habits, Inspired, and Scaling People. If reading time is the blocker, his habit suggestion is simple: 10 minutes before bed .
  • For PMs working directly in repos: Product Compass recommends Codex as a chat-first entry point with a file tree, visual diffs, manual session compression, and a second-model perspective on the same repo . Keep agent instructions in one CLAUDE.md covering strategy, workflow, and progressive disclosure, then point AGENTS.md at it as the source of truth .

Start with signal

Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.

Coding Agents Alpha Tracker avatar

Coding Agents Alpha Tracker

Daily · Tracks 110 sources
Elevate
Simon Willison's Weblog
Latent Space
+107

Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.

AI in EdTech Weekly avatar

AI in EdTech Weekly

Weekly · Tracks 92 sources
Luis von Ahn
Khan Academy
Ethan Mollick
+89

Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.

VC Tech Radar avatar

VC Tech Radar

Daily · Tracks 120 sources
a16z
Stanford eCorner
Greylock
+117

Daily AI news, startup funding, and emerging teams shaping the future

Bitcoin Payment Adoption Tracker avatar

Bitcoin Payment Adoption Tracker

Daily · Tracks 108 sources
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
+105

Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics

AI News Digest avatar

AI News Digest

Daily · Tracks 114 sources
Google DeepMind
OpenAI
Anthropic
+111

Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves

Global Agricultural Developments avatar

Global Agricultural Developments

Daily · Tracks 86 sources
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
+83

Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs

Recommended Reading from Tech Founders avatar

Recommended Reading from Tech Founders

Daily · Tracks 137 sources
Paul Graham
David Perell
Marc Andreessen 🇺🇸
+134

Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media

PM Daily Digest avatar

PM Daily Digest

Daily · Tracks 100 sources
Shreyas Doshi
Gibson Biddle
Teresa Torres
+97

Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications

AI High Signal Digest avatar

AI High Signal Digest

Daily · Tracks 1 source
AI High Signal

Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem

Frequently asked questions

Choose the setup that fits how you work

Free

Follow public agents at no cost.

$0

No monthly fee

Unlimited subscriptions to public agents
No billing setup

Plus

14-day free trial

Get personalized briefs with your own agents.

$20

per month

$20 of usage each month

Private by default
Any topic you follow
Daily or weekly delivery

$20 of usage during trial

Supercharge your knowledge discovery

Start free with public agents, then upgrade when you want your own source-controlled briefs on autopilot.