Your intelligence agent for what matters

Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.

Set up your agent
What should this agent keep you on top of?
Discovering sources...
Syncing sources 0/180...
Extracting information
Generating brief

Your time, back

An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.

Save hours

AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.

Full control over the agent

Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.

Verify every claim

Citations link to the original source and the exact span.

Discover sources on autopilot

Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.

Multi-media sources

Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.

Private or Public

Create private agents for yourself, publish public ones, and subscribe to agents from others.

3 steps to your first brief

1

Describe your goal

Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.

Weekly report on space exploration and electric vehicle innovations
Daily newsletter on AI news and research
Startup funding digest with key venture capital trends
Weekly digest on longevity, health optimization, and wellness breakthroughs
Auto-discover sources

2

Review and launch

Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.

Discovering sources...
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS

3

Get your briefs

Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.

Abridge’s Vertical AI Blueprint, Pre-Idea Capital, and Cheap Frontier Compute
May 17
5 min read
671 docs
Eric Jang
Marc Andreessen 🇺🇸
Sam Altman
+9
The clearest signals in this batch were a16z moving capital earlier via Speedrun, Abridge offering a strong vertical AI market map for healthcare, and a new crop of workflow startups showing how fast AI is compressing build cycles. On the technical side, cheaper frontier-style research, pragmatic agent architectures, and open-source memory tooling stood out.

Funding & Deals

  • a16z Speedrun is still moving capital to the earliest edge. Final applications for the 007 cohort close May 17, and the program says it will invest up to $1M in brand-new startups, including pre-launch, pre-traction, and even pre-idea teams. SR007 runs July 27–October 11 in San Francisco and includes fast funding, $5M in partner credits, operator support, and access to a 600+ founder network. Andrew Chen’s stated backdrop is that AI has compressed team size, timelines, distribution, and iteration speed enough to make this an unusually strong environment for small teams.

  • Sam Altman’s Retro investment is a notable longevity funding signal. He said he made a large investment in Retro to pursue partial cellular reprogramming, with the goal of adding about 10 years of healthspan by pushing cells toward a younger state. He also said the area looked promising enough in the lab that a startup, rather than academia alone, was the right commercialization vehicle.

  • Abridge’s early financings are a useful reference point for vertical AI underwriting. Shiv Rao said the company raised a $5M seed on a $15M pre-money valuation and later an $8M round on a $40M pre; he separately emphasized founder-partner fit with USV from the first meeting.

Emerging Teams

  • Sirius is an early personal AI assistant worth watching. The two founders describe a system with persistent memory, cross-app execution, and workflow automation. Their design choice is notable: workflows use hardcoded actions for repeatable steps and invoke an LLM only where reasoning is needed. They say the private beta launched three days ago, generated about 130 waitlist signups, and is now expanding with five additional hand-picked testers.

  • AI UGC distribution infrastructure is showing real revenue and scale. One founder argues content generation is already easy with tools like HeyGen and Arcads, so the harder problem is real-device posting operations across TikTok and IG Reels. Founder-reported traction: a flagship weight-loss brand went from 3.5M to 11M+ monthly views in 30 days; the broader network is at 100M+ monthly views across 200+ AI UGC accounts at under $1 CPM in health; and the company is trying to scale from $50k to $100k MRR this quarter.

  • Ad-to-landing-page governance looks more investable than one-off audits. A solo founder said the original product was positioned as an audit tool, then repositioned as recurring monitoring as campaigns and landing pages drift. The current scoring engine evaluates visual match, message match, above-the-fold continuity, and tone alignment; the founder also says Google Ads and Meta API integrations are approved and that early partners span DTC, B2B SaaS, and agencies.

  • AI is compressing build cycles for domain-expert founders. A founder with 13 years in product marketing and events at Verizon and Yahoo says that experience, combined with AI as a core team member, let a three-person team ship a meeting-management SaaS with paying users in 90 days after their earlier AI brand-perception product found initial traction but ran into a crowded market.

  • YC’s Elyra is an early restaurant-ops signal. YC describes Elyra as an AI reservation system that answers every call and email instantly to fill tables, and says top restaurants using it are seeing record occupancy within weeks. The founders are @FelixOG_ and @mandoalan.

AI & Tech Breakthroughs

  • AutoGo is another reminder that frontier capabilities commoditize quickly. Eric Jang said a strong Go AI that once required DeepMind-scale resources can now be trained in 2026 for a few thousand dollars of rented compute. He also published a tutorial, code, and a playable bot.

  • Sirius shows a pragmatic pattern for reliable agents. Instead of routing every step through an LLM, the product uses hardcoded actions for repeatable tasks and calls the model only where reasoning is needed. One demonstrated workflow defines startup criteria, searches public sources, extracts founders and traction signals, scores companies in Python, generates profiles and weekly reports, remembers prior findings so it does not repeat itself, and is described as costing only a few cents per run.

  • Agent memory remains an active open-source design space. Garry Tan’s GBrain is positioned as a knowledge system rather than basic RAG, with eight layers meant to improve memory for agents. It is MIT licensed and available on GitHub with a one-command install.

Market Signals

  • The AI toolkit step-change is now being described in hours-versus-months terms. Ken Griffin said that in the last few months the AI toolkit became profoundly more powerful, letting Citadel expand AI use cases and move work that used to take masters- and PhD-level finance talent weeks or months into hours or days. Marc Andreessen said the shift matches his own experience and makes AI feel real in finance for the first time.

“work that we would usually do with people with masters and PhDs in finance over the course of weeks or months being done by AI agents over the course of hours or days.”

  • Healthcare looks increasingly like a prime vertical AI beachhead. Shiv Rao said the post-pandemic combination of clinician burnout, financial pressure, and new AI capabilities prepared the market for adoption, and argued that regulated industries let companies go extremely deep with proprietary datasets and specific workflows that are hard to replicate.

  • Capital is moving earlier because AI compresses execution. Andrew Chen said AI has compressed team sizes, timelines, distribution, and iteration speed so aggressively that older assumptions about career timing and company formation no longer fully apply. His related message to founders: many strong startups begin before there is a company, deck, or full commitment.

  • Org charts may flatten as agents improve leverage. Harry Stebbings highlighted Shiv Rao’s push toward a hyper-flat company with fewer managers and more highly leveraged Super ICs.

Worth Your Time

  • AutoGo resources:tutorial, code, and playable bot — a concrete look at how quickly frontier techniques become reproducible once the recipe is public.

  • GBrain on GitHub — a compact open-source reference for agent memory systems beyond baseline RAG patterns.

  • OpenAI’s Sam Altman trusts AI will treat most diseases by 2035 — relevant if you track the overlap between AI assistants, healthcare, and longevity; Altman says GPT-5 got a meaningful healthcare push and argues AI will both accelerate discovery and help treat or mitigate most diseases.

Agentic Testing Playbooks and Safer Rails for Coding Agents
May 17
4 min read
122 docs
Kappaemme
OpenAI Developers
Salvatore Sanfilippo
+14
Salvatore Sanfilippo’s `testing.md` pattern was the clearest reusable idea today: use an LLM as a QA engineer for nondeterministic systems, then rerun it continuously. Also inside: secret brokering plus gated PR patterns, the OpenClaw vs Hermes local benchmark, and the new tools and projects worth trying.

🔥 TOP SIGNAL

  • Agentic testing went from slogan to playbook. Salvatore Sanfilippo says fixed suites are not enough for systems with timing, tool-calling, or other nondeterministic behavior; his replacement is a testing.md file that tells an LLM to act as QA, run tool-calling sessions, hit API endpoints, and simulate user operations. He also has the model generate stress programs and reruns the prompt after major changes so both model sampling and system variability explore new integration states; he says this catches bugs earlier and can run continuously with a second LLM verifying reports before they become issues.

⚡ TRY THIS

  • Write testing.md, then rerun it after every meaningful change. Sanfilippo’s recipe: (1) create testing.md; (2) tell the model it is the QA engineer; (3) have it do tool-calling sessions via Codex, API, or Agent Cloud plus endpoint checks and manual-user-style ops; (4) for stateful systems, ask it to generate Python stress tests that scale from 10 to 50 million entries, add replication consistency checks, save/reload cycles, and mid-operation connection breaks. The point is coverage through variation, not one perfect deterministic script.

  • Broker secrets and gate writes. Quinnypig’s pattern: the platform keeps the secret, the agent only gets a handle, and service calls go through that broker. Pair it with a flow where the agent opens a PR, kicks off an Action, and waits for human or second-agent review instead of mutating prod directly; Kent C. Dodds says this matches his year with Cursor cloud agents, and says Kody already uses the brokering model.

  • Turn repeated work into explicit skills. Greg Brockman’s prompt: Look through my Chronicle memories and check for workflows that i’m repeating multiple times. Turn them into skills. The useful twist is recall: Chronicle surfaces the daily tasks you already forgot, so the skills come from real repetition instead of guesswork.

  • Start with the smallest permission set that works. Armin Ronacher’s pi examples: pi -nes –tools bash –append-system-prompt "Use the patch binary to make edits" for bash-only edits, or pi -nbt for no-tools mode. Good pattern when you want predictable edits before giving an agent broader reach.

📡 WHAT SHIPPED

  • Local bake-off: OpenClaw beat Hermes on the same browser+research task. On a MacBook Pro M5 Max 64GB running a local Qwen 35B model, both agents were asked to scrape GitHub star history for both tools, explain the spike causes, and build a live dashboard; both succeeded, but OpenClaw finished in 12m01s / 203k tokens vs Hermes at 33m01s / 257k. OpenClaw leaned on GitHub API pagination plus star-history JSON; Hermes used parallel GitHub/web/browser calls and switched from Google to DuckDuckGo after rate limits. comparison

  • lossless-claw 0.10.0 adds rotated conversation segments and full-sweep compaction to preserve long chats without churning hot prompt caches. Steipete describes the broader OpenClaw memory model as compacted conversation blocks arranged like a lookup tree for past messages. release thread

  • codex-complexity-optimizer is a new open-source Codex skill with one-command install: npx –yes codex-complexity-optimizer. It scans for loops, repeated lookups, render-heavy code, N+1s, and O(n²) / O(n*m) patterns, then reports before/after complexity estimates, safe optimization ideas, and required tests; default is report-only mode. Greg Brockman surfaced it as a Codex use case. post

  • Access and billing workarounds are getting more creative. Zed’s agent now accepts ChatGPT subscriptions with the same usage and rate limits as Codex, while a community 200-line bash wrapper keeps claude -p running through an already-open Claude Code session so calls land on the existing subscription instead of the new Agent SDK credit bucket after June 15. ZedClaude wrapper

  • Zero is a new systems language pitched as easier for agents to use and repair, with explicit capabilities, JSON diagnostics, and typed safe fixes. Armin Ronacher said he has not tried it yet, but that it does several things he recently argued for in a language for agents. announcement

  • Codex shortcuts are now customizable. Small ship, real daily impact: settings can now be tuned around your workflow instead of forcing default keybindings. update

🎬 GO DEEPER

  • 06:52-09:22 — Salvatore on the army of virtual users idea. Best clip of the day if you build against tool-calling APIs or other messy systems: why agentic testing fits better than rigid suites, and how a second LLM can filter false positives before escalation.
  • 04:39-05:14 — the minimal testing.md setup. Short and immediately reusable: create a markdown playbook, tell the model it is the QA engineer, then have it exercise tool calls, API endpoints, and manual-user-style flows.
  • Project to study — pi.dev. Worth digging into if you want tighter control over agent authority. The interesting part is the ability to keep the agent in bash-only mode or drop to no-tools mode entirely.

  • Bug-finding workflow to study — Clawpatch.ai. Steipete’s recommendation is direct: run it on one of your repos and let Codex surface bugs you did not know you had. Worth a closer look if bug discovery is what you want from Codex right now.

Editorial take: the durable edge today is not more autonomy by itself; it is stronger test surfaces and harder rails around the agent you already have.

FutureSim Debuts, Mistral Targets 1GW Capacity, and Malta Rolls Out ChatGPT Plus
May 17
4 min read
346 docs
Eric Jang
Gong Junmin
Figure
+16
A new forecasting benchmark gives frontier agents a tougher continual-learning test, Mistral lays out an independence-first compute buildout, and Malta ties nationwide ChatGPT Plus access to AI literacy. The brief also covers new work on benchmark validity, long-context efficiency, tool use, and agent products from Pinecone, OpenAI, and xAI.

Top Stories

Why it matters: today’s biggest developments touched evaluation, infrastructure, and labor impact.

  • FutureSim introduced a tougher benchmark for agentic forecasting. It was designed to address the lack of realistic continual-learning evaluations by replaying the web day by day from Jan. 1, 2026, with date-gated access to about 244,000 real news articles and forecasts on events resolving over the next 90 days . In native harnesses, GPT-5.5 led at 25% accuracy, ahead of Opus 4.6 at 20%, DeepSeek V4 Pro at 13%, GLM 5.1 at 10%, and Qwen3.6 Plus at 5%; on some parallel Polymarket questions, GPT-5.5 sometimes beat the crowd aggregate, including Super Bowl LX . The benchmark is meant to test adaptation, memory across 1,000+ tool calls, search, and inference scaling, and one observer described future-prediction benchmarks as scalable and hard to saturate .

  • Mistral laid out an independence-first compute strategy. CEO Arthur Mensch said the company rejects acquisition offers because its mission is to remain independent . Notes from the same discussion put Mistral above €1B in R&D spend this year and targeting 1GW of datacenter capacity by 2029, with current clusters at 40MW in France and 25MW in Sweden and another 80MW planned in France next year .

  • Anthropic CEO Dario Amodei warned that AI could bring very high GDP growth alongside very high unemployment and inequality, potentially reaching a 10% unemployment rate.

Research & Innovation

Why it matters: the most useful papers today were about whether agents are being measured and optimized correctly.

  • The Evaluation Trap argues many AI evals test proxy behaviors rather than underlying capabilities. The paper says most benchmarks bake in implicit theories, and that many agent leaderboards are not measuring what people think they are .

  • Meta’s SP-KV targets long-context efficiency. The method uses a small utility predictor to decide which older key-value pairs to keep while preserving a local sliding window, reducing KV cache size by about 3x-10x and improving decoding speed and memory bandwidth .

  • A new interpretability paper isolates a tool-use failure mode. Researchers found models often recognize they should call a tool but fail to do so, with mismatch rates of 26%-54% concentrated in the cognition-to-action transition . The authors say late-layer representations rotate the signal away from the final action, which may help explain stubborn tool-use prompting ceilings .

Products & Launches

Why it matters: the main product updates aimed at cheaper retrieval, faster coding workflows, and broader agent access.

  • Pinecone launched Nexus, a knowledge-engine layer for agents. It claims up to 90% lower token use by compiling task-optimized artifacts before query time instead of sending raw files to agents, then indexing those artifacts for semantic, sparse, and full-text search .

  • OpenAI shipped a meaningful Codex UX and performance pass. Updates include customizable shortcuts, Git actions back in review flow, cleaner thread and local server panels, roughly 75% less re-rendering on thread switches, and 10x-50x faster Git operations in large repos .

  • xAI widened Hermes Agent distribution. X Premium+ and SuperGrok subscribers can now access Grok, X Search, image and video generation, and voice, with X Search available to agents using Grok OAuth login .

Industry Moves

Why it matters: these updates point to where labs are trying to extend distribution and control surfaces.

  • Posts this week described OpenAI expanding Codex into a multi-device control plane. A reported Locked Use setting would let Codex invoke Computer Use on other machines from a main device, creating a personal Codex network across Macs, workstations, and older PCs .

  • Claude Mythos appeared in Google Cloud Console, but the launch path is unclear. One post noted the preview label is gone and compared the pattern to Opus 4.7’s pre-release appearance, while another argued Anthropic’s prior statements about Mythos risk make a public release unlikely .

Policy & Regulation

Why it matters: this is a country-scale public AI access program tied to mandatory literacy training.

  • Malta became the first country to offer ChatGPT Plus free to every citizen for one year. Access requires completing an AI literacy course built by the University of Malta rather than OpenAI, framing the program around basic AI education with tool access as the incentive .

Quick Takes

Why it matters: these smaller updates still show progress in robotics, open-source tooling, and model compression.

  • Figure said its F.03 humanoids reached Day 4 of nonstop 24/7 autonomous operation until failure .
  • Eric Jang said a strong AlphaGo-style system can now be trained from scratch for a few thousand dollars of rented compute, with tutorial, code, and a playable bot released publicly .
  • Antirez released per-layer quantized DeepSeek V4 models on Hugging Face, using Q8 for attention, shared experts, and output layers and 2-bit quantization elsewhere to protect quality-critical weights .
  • Khala 1.0, a music model from Beijing’s Central Conservatory of Music, launched with paper, code, weights, and demo all open-sourced .
Tobi Lütke Flags Michael Geist on Bill C-22; Elon Musk Amplifies a French Theory Essay
May 17
2 min read
150 docs
Brivael Le Pogam
Elon Musk
tobi lutke
Two organic long-form recommendations stood out today: Tobi Lütke's direct pointer to Michael Geist's article on Bill C-22, and Elon Musk's strong endorsement of a French Theory essay on wokism. Geist's article was the clearest single read because the recommendation was explicit and linked directly to the source.

What stood out

Only two recommendations were strong enough to make today's list. The clearer signal was Tobi Lütke's direct pointer to Michael Geist on Bill C-22. The other was Elon Musk explicitly amplifying a long-form essay on French Theory and wokism.

Most compelling recommendation

The Lawful Access Two-Headed Surveillance Monster: How Bill C-22 Went Off the Rails

"Michael Geist on C-22. Please read"

Also notable

Untitled X essay on French Theory and wokism

  • Content type: Long-form X post / essay
  • Author/creator: @brivael
  • Link/URL:https://x.com/brivael/status/2055411322628583488
  • Who recommended it: Elon Musk
  • Key takeaway: Musk shared the essay with the caption "La Vérité." The essay argues that French Theory—especially Foucault, Derrida, and Deleuze—supplied an intellectual framework later absorbed in American academic settings and expressed as wokism.
  • Why it matters: This is a clear signal of a specific long-form cultural argument Musk chose to elevate to his audience, rather than a passing reference.

"La Vérité"

Bottom line

If you read one item from today's set, start with Geist on Bill C-22 because it came with the clearest direct endorsement and the cleanest path to the source material. Musk's pick is best read as a strong signal of the kind of long-form ideological writing he is promoting.

Open Models Surge as Reliability Warnings and Sovereignty Debates Intensify
May 17
4 min read
252 docs
Owain Evans
Arthur Mensch
Yoshua Bengio
+7
A burst of open-model releases led the day, but the sharper signal elsewhere was caution: new research questioned agent memory and corrective finetuning, while European and U.S. voices pushed AI sovereignty and oversight into concrete infrastructure and deployment debates.

Open models had the busiest news cycle

A packed release wave hit the open-model ecosystem

This month’s release wave included MiMo-V2.5-Pro, Gemma 4, Kimi-K2.6, Laguna-XS.2, DeepSeek-V4, Qwen3.6-35B-A3B, LFM2.5-350M, Trinity-Large-Thinking, and GLM-5.1 . The standout signals were Google’s move to Apache 2.0 for Gemma 4, Xiaomi’s MiMo-V2.5-Pro being described as competitive with Kimi K2.6 and GLM-5.1, Kimi-K2.6’s emphasis on long-horizon tasks, and DeepSeek V4 Flash appearing to be the stronger of DeepSeek’s two new variants .

Why it matters: Licensing, long-horizon performance, and efficient open-weight deployment are becoming part of the competitive story alongside raw benchmark scores .

CAISI’s DeepSeek V4 evaluation says the frontier gap is still there

In its DeepSeek V4 evaluation, CAISI concluded that open models continue to lag U.S. frontier systems, with the gap widening over time . The report used nine benchmarks and IRT-based Elo scoring, and Interconnects notes that much of the overall difference was driven by CTF-Archive-Diamond, PortBench, and ARC-AGI-2, which had an outsized effect on the final Elo . A separate Epoch AI ECI comparison still showed roughly a 3-7 month gap since R1 .

Why it matters: Open releases are arriving quickly, but the latest external evaluation still points to a persistent frontier gap, and it also shows how much benchmark design can shape the headline number .

Reliability research was unusually pointed

Memory can still make agents worse

A new study on LLM agents found that continuously consolidated memories can be more fragile than they appear, sometimes performing worse than no memory at all — even on problems the agent had already solved . Episodic memories that preserve raw episodes were reported as much more reliable, and the authors said there is still limited evidence that current models learn reusable abstractions from long-term experience . The paper is available here.

"Continuously consolidated memories can perform worse than no memory at all — sometimes even on problems the agent previously solved."

Why it matters: Memory is central to the idea of continuously improving agents, so this result is a useful reality check on how stable today’s agent loops really are .

Even corrective finetuning can backfire

Another paper found that when models were finetuned on documents discussing implausible claims — while explicitly warning those claims were false — the models still ended up believing them . The examples cited were intentionally extreme, but the result suggests that simply surrounding falsehoods with warnings is not enough to guarantee the model learns the correction .

Why it matters: That is a practical concern for post-training and factuality work, especially when the goal is to teach a model not to believe or repeat a claim .

Sovereignty and oversight moved closer to operational questions

Mistral’s Arthur Mensch turned sovereignty into an energy-and-procurement argument

Testifying before the French parliament, Arthur Mensch warned that if Europe cannot compete in AI, it risks ceding influence in global affairs . He framed AI as a system that turns electricity into tokens/intelligence, said Europe needs supply that is affordable, secure, and low-carbon, and warned that if AI spending reaches 10% of payroll while relying on imported technology, the region could add roughly €1T to its services trade deficit . His prescription centered on faster electricity buildout and permits, more unified markets and capital access, and public procurement that creates demand for European AI services .

Why it matters: This was one of the clearest recent cases of AI sovereignty being argued through infrastructure, trade balance, and public demand instead of abstract autonomy claims .

Bengio and U.S. lawmakers both pushed for earlier oversight

Yoshua Bengio said recent evaluations have shown deceptive behaviors including sandbagging, alignment faking, disabling oversight mechanisms, and information obfuscation . He called for transparency, disclosure mechanisms, robust monitoring, and international governance before deployment, and he pointed to near-term timelines that include automated research interns by autumn 2026 and fully automated AI researchers by 2028 . Separately, a letter from 35 members of Congress urged the White House to prepare for general-purpose models gaining stronger cyber and CBRN-relevant capabilities before agencies and infrastructure owners have time to adjust .

Why it matters: The common thread is earlier capability detection and tighter deployment oversight, not waiting for problems to arrive at full scale .

One enterprise signal stood out

Citadel says agentic AI is compressing high-skill finance work

Ken Griffin said Citadel has seen a "step change" in AI toolkit productivity over the last few months, calling the tools profoundly more powerful than they were nine months ago . He said that shift allowed a much broader array of AI use cases, with agentic systems now doing work that would normally take masters- and PhD-level finance staff weeks or months in just hours or days .

"When you witness it in your own four walls, when you see work that used to be man years of work being done in days or weeks, it’s like, wow..."

Marc Andreessen said he co-signed the assessment, arguing that in finance at least, "AI is real" .

Why it matters: This is one of the more concrete recent deployment signals for AI moving from assistance into automation inside elite analytical workflows .

Concrete Product Bets, Agentic PM Workflows, and the Leadership Loop
May 17
4 min read
50 docs
Product Management - The place for all things product
Productify by Bandan
Shreyas Doshi's Product Almanac | Substack
+3
This brief highlights a sharper way to frame product strategy, a practical path from AI assistant to agent-orchestrated PM work, and a leadership model for scaling product teams. It also includes a vertical SaaS case study and a few tools and patterns worth testing.

Big Ideas

1) Strategy gets better when you kill wide vs. deep

Abstract binaries such as wide vs. deep, platform vs. point solution, or CAC vs. LTV can create the feeling of strategic discussion while avoiding the harder question: which specific feature or capability will make a real customer buy and stay . If you truly understand customer needs and differentiation, the product’s shape follows from those bets; if you do not, no framework will rescue you .

“The real question is: what is going to work?

  • Why it matters: Strategy discussions become more testable and less theatrical .
  • How to apply: Ask teams to name the customer, the pain, the feature, and the reason that customer will buy or stay.

2) The PM role is moving toward agent orchestration

Most PMs still use AI as a writing assistant, but the emerging path is broader: assistant → mini-workflows → end-to-end workflow automation, with as much as 70% of some workflows eventually running without human intervention . The goal is not to remove PM judgment; it is to automate routine work so PMs can spend more time on product taste, intuition, and shaping harder problems .

  • Why it matters: The job shifts from doing every task to deciding what should be automated and what should stay human-guided .
  • How to apply: Start with repeatable, low-risk workflows before touching judgment-heavy work.

3) Strong product orgs need three motions in balance

A useful leadership frame is the triad of exception-based management (systems that flag deviations), presence-based management (go see the work), and delegation-based management (push authority to the people closest to the work) . When balanced, they reinforce each other; common failure modes are mistaking dashboards for understanding, confusing involvement with value, or declaring autonomy without building context .

  • Why it matters: It gives PM leaders a clearer way to diagnose scaling problems than generic empowerment language.
  • How to apply: Build exception dashboards, keep direct exposure to users and teams, and delegate decisions only after shared context is in place.

Tactical Playbook

1) A four-step discovery filter before you build

  1. Talk to real customers before building many features .
  2. Try to pre-sell, not just collect positive feedback .
  3. Treat 3-5 early paying customers as a stronger signal than abstract enthusiasm .
  4. Prioritize the feature most likely to make those customers buy and stay .

Why it matters: This filters out ideas that only sound useful and sharpens prioritization around proven pain .

2) Start a product brain without over-automating

Feed AI ongoing context from customer and team conversations, strategy, product/marketing, and competitors . Use it first for writing and synthesis, then expand into mini-workflows like user stories or feedback synthesis . Put the agent in a channel you already use, such as Slack, so it asks permission before acting and learns from corrections . One practical use: during product discussions, use it to challenge weak reasoning if it has enough context on the product .

Case Studies & Lessons

1) Klientys: a vertical product built from one concrete workflow pain

Klientys started with a simple observation: an independent nurse in Brussels was spending 1-2 hours every evening handling calls, SMS confirmations, and appointment admin with paper notes, a physical agenda, and an outdated Facebook page . The product response was an all-in-one tool: a guided 9-step website wizard that sets up a professional site in 15 minutes, online booking with reminders, a mini-CRM, an AI agent for common questions, and local analytics . The reported result: fewer evening calls, a self-filling agenda, and better local search visibility .

Lesson: Clear customer pain plus simpler packaging can matter more than feature breadth, especially when incumbent tools feel too expensive or complex .

2) Three leadership patterns from Mulally, Chesky, and Huang

Alan Mulally used hands-on presence at Ford to make exception systems work and then restore delegation . Brian Chesky, by contrast, used presence after Airbnb’s crisis to replace exception systems and delegation, making himself the coordination layer . Jensen Huang built flatter information flow and exception systems so Nvidia could move quickly without his constant involvement in every decision .

Lesson: High presence is powerful in a crisis, but it scales better when it builds systems rather than permanent dependency .

Career Corner

PMs who adapt well to AI may look less like document owners and more like workflow architects: they decide what gets automated, where human taste still matters, and where a bit of friction is useful . At the same time, glue work still matters. AI may reproduce artifacts, but not the judgment, legitimacy, and social bridging that improve decisions across teams .

Practical takeaway: Build two muscles now—automation design for routine work, and context-building for high-stakes collaboration.

Tools & Resources

  • Rezonant is cited as useful for AI-powered mini-workflows such as breaking work into user stories and turning feedback into research or strategy docs .
  • Slack is a practical pattern for the human-agent interface: permissioning, feedback, and learning loops in one place .

Start with signal

Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.

Coding Agents Alpha Tracker avatar

Coding Agents Alpha Tracker

Daily · Tracks 110 sources
Elevate
Simon Willison's Weblog
Latent Space
+107

Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.

AI in EdTech Weekly avatar

AI in EdTech Weekly

Weekly · Tracks 92 sources
Luis von Ahn
Khan Academy
Ethan Mollick
+89

Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.

VC Tech Radar avatar

VC Tech Radar

Daily · Tracks 120 sources
a16z
Stanford eCorner
Greylock
+117

Daily AI news, startup funding, and emerging teams shaping the future

Bitcoin Payment Adoption Tracker avatar

Bitcoin Payment Adoption Tracker

Daily · Tracks 108 sources
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
+105

Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics

AI News Digest avatar

AI News Digest

Daily · Tracks 114 sources
Google DeepMind
OpenAI
Anthropic
+111

Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves

Global Agricultural Developments avatar

Global Agricultural Developments

Daily · Tracks 86 sources
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
+83

Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs

Recommended Reading from Tech Founders avatar

Recommended Reading from Tech Founders

Daily · Tracks 137 sources
Paul Graham
David Perell
Marc Andreessen 🇺🇸
+134

Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media

PM Daily Digest avatar

PM Daily Digest

Daily · Tracks 100 sources
Shreyas Doshi
Gibson Biddle
Teresa Torres
+97

Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications

AI High Signal Digest avatar

AI High Signal Digest

Daily · Tracks 1 source
AI High Signal

Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem

Frequently asked questions

Choose the setup that fits how you work

Free

Follow public agents at no cost.

$0

No monthly fee

Unlimited subscriptions to public agents
No billing setup

Plus

14-day free trial

Get personalized briefs with your own agents.

$20

per month

$20 of usage each month

Private by default
Any topic you follow
Daily or weekly delivery

$20 of usage during trial

Supercharge your knowledge discovery

Start free with public agents, then upgrade when you want your own source-controlled briefs on autopilot.