We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Your intelligence agent for what matters
Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.
Your time, back
An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.
Save hours
AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.
Full control over the agent
Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.
Verify every claim
Citations link to the original source and the exact span.
Discover sources on autopilot
Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.
Multi-media sources
Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.
Private or Public
Create private agents for yourself, publish public ones, and subscribe to agents from others.
3 steps to your first brief
Describe your goal
Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.
Review and launch
Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.
Sam Altman
3Blue1Brown
Paul Graham
The Pragmatic Engineer
r/MachineLearning
Naval Ravikant
AI High Signal
Stratechery
Sam Altman
3Blue1Brown
Paul Graham
The Pragmatic Engineer
r/MachineLearning
Naval Ravikant
AI High Signal
Stratechery
Get your briefs
Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.
Artificial Analysis
Krea
Claude
Top Stories
Why it matters: the clearest signals today were about AI moving into persistent team workflows, richer voice interactions, and more urgent cyber planning.
Anthropic launched Claude Tag, turning Claude into a Slack teammate. Claude can join selected channels with access to chosen tools, data, and codebases for async task delegation . Anthropic says the Claude Code team has used it internally all year and that Claude now writes 65% of its product code; Tag is in beta for Claude Enterprise and Team plans . Karpathy called this the “3rd major redesign of LLM UIUX,” centered on persistent, asynchronous agents with org-wide context .
Speech AI moved closer to full conversational context. AssemblyAI launched Universal-3.5 Pro Realtime, which uses the agent side of a conversation as context for transcription . The company says one team cut error rates on critical utterances from 26% to 9% with that feature . At the same time, Artificial Analysis launched a Speech to Speech Index, with GPT-Realtime-2 leading overall and Grok Voice Think Fast 1.0 leading the agentic-performance subscore .
Cyber agencies shortened the AI risk timeline. The Five Eyes alliance warned organizations they have months, not years, to protect systems from accelerating cyber threats driven by frontier AI .
Research & Innovation
Why it matters: the most useful technical work today focused on making agents more realistic, systems code more measurable, and inference more efficient.
Qwen-AgentWorld introduced language world models for agentic simulation. The release includes 35B-A3B and 397B-A17B models described as the first language world models able to simulate agentic environments across seven domains, with “Decouple” and “Unify” strategies for applying them to agents .
ParallelKernelBench showed how far LLMs still are from reliable multi-GPU kernel generation. The benchmark covers 87 real problems from codebases including Megatron-LM, DeepSpeed, TensorRT-LLM, and NeMo-RL . Best zero-shot performance reached 28/87 correct, while an agentic compile-test-profile-revise loop improved Gemini 3 Pro from 24 to 35/87.
DFlash pushed speculative decoding forward on Blackwell GPUs. NVIDIA says the open-source block diffusion drafter can raise inference throughput by up to 15x while maintaining responsiveness . vLLM reported 4.4x–5.8x gains on Gemma-4 31B, with drop-in support via vLLM, SGLang, and TensorRT-LLM .
Products & Launches
Why it matters: open releases kept landing in practical categories teams can use now, from image generation to OCR to scientific tooling.
Krea 2 open weights shipped in two forms: Krea 2 Raw for fine-tuning and Krea 2 Turbo as a faster distilled model with broad aesthetic range . Krea also published the code, weights, and technical report, while Ostris added day-0 LoRA support and reported strong early fine-tuning results on a hard “omniface” concept .
Baidu open-sourced Unlimited OCR for long-document transcription. The model has 3B total parameters with 500M active, said it sets new SOTA on OmniDocBench v1.5/v1.6, and can transcribe 40+ pages in one forward pass using Reference Sliding Window Attention .
NVIDIA launched the BioNeMo Agent Toolkit for workflows such as protein structure prediction, docking, generative chemistry, and genomics, with Baseten making all 10 BioNeMo NIMs available on day one .
Industry Moves
Why it matters: platforms and labs are widening their moats through developer surfaces, research consolidation, and open-model infrastructure.
OpenAI highlighted the scale of its recent developer-platform expansion. The company says it shipped 30+ API models, features, and upgraded tools in the last six months, including GPT-5.5, GPT-Realtime-2, GPT-Image-2, new agent-building blocks, the OpenAI CLI, and Bedrock availability .
The UK consolidated five AI labs into the new BOLD Lab. BOLD says it is focused on beyond-backprop methods, human-centric learning, and embodied learning, with £30M in backing from UKRI and EPSRC .
Together AI pointed to a new scale marker for open-model production use. The company said 400T tokens now reflects real workload adoption, driven by frontier-quality open models, better token economics, and more control over inference .
Policy & Regulation
Why it matters: oversight is shifting from general debate toward concrete review and preparedness mechanisms.
- Reporting shared on X says the Trump administration is pressing Meta to join voluntary government model review, while OpenAI, Anthropic, Google, xAI, and Microsoft have already agreed .
Quick Takes
Why it matters: these smaller updates still point to where deployment and tooling are heading next.
- OpenHands open-sourced a verification stack that cut time-to-merge by 58% on its own repo and sped production PR merges 2.4x without lowering quality .
- Spellbook Labs reviewed 60,000 pages of SEC-filed contracts with AI and says 60% contained mistakes such as missing definitions or broken references .
- OpenAI DevDay 2026 applications are open for September 29 in San Francisco, with DevDay Exchanges planned for eight additional cities .
- Hugging Face says public robotics datasets grew from 1,000 in early 2025 to 60,000 today, with correctly configured streaming reaching about 1,326 MB/s.
Artificial Intelligence (AI)
Yann LeCun
1) Funding & Deals
- Probook: a16z said it led Probook’s $34M Series A, while Probook separately announced $40M in funding from a16z and Sequoia . The company’s thesis is to build an AI operating system for home services around dispatch first, then expand into intake, data scrubbing, messaging, and outbound, rather than forcing operators to stitch together point solutions . The founders’ backgrounds line up closely with the problem: George and Ben both grew up in the trades, and George says he spent a summer inside a $40M HVAC shop before building the product . The operating metrics cited by a16z are strong for this stage: Summers Plumbing booked 2,542 jobs in its first month with zero human intervention, Anthony PHCE ran 20% more revenue per job on a 50% leaner team, and Del-Air moved from 10 to 22 techs per dispatcher.
2) Emerging Teams
Momentic: the company launched an AI QA agent that pulls product context from Linear tickets, Notion PRDs, and PRs. In the past few weeks, its agents reportedly analyzed 70k+ test failures, created 600 tests, and reached a 73% PR merge rate. The customer list cited includes Notion, Xero, Webflow, Retool, Runway, and Bilt. Dalton Caldwell called it a "big new launch" for teams building software with AI tools .
Linzumi: YC is pushing a coordination layer for AI coding teams: shared chat threads for entire teams plus dozens of AI coding agents, with the product positioned to keep the fleet coordinated and unblocked . Garry Tan described it as "Codex but actually multiplayer" and said it is "magical for teams". Founder Sean Grove previously worked at OpenAI on reducing sycophancy in ChatGPT. The company is also using its Wafer partnership to offer high-speed access to GLM 5.2.
TubeTube: two founders turned an internal AI video pipeline into a SaaS product after using it to grow a YouTube channel to nearly 100k subscribers and roughly 6M monthly views. The product takes a script or one-line idea through voiceover, music, scene images, animation, and final cut in about 5-10 minutes, lets users choose models at each step, and uses pay-per-use credits . It is currently in a private beta looking for 5-10 testers.
VoxFlow: Robin, a 19-year-old student entrepreneur in the Netherlands, is building an AI receptionist for SMBs that handles inbound calls in natural Dutch, books appointments into Google Calendar or Outlook, and escalates urgent cases to humans . The product is live with first customers . The founder’s early lesson is practical: sales convert better when framed around missing fewer calls, and the product is not a fit for businesses with very low call volume .
3) AI & Tech Breakthroughs
- LeCun’s new company is pursuing world-model planning rather than pure autoregression. Yann LeCun said his new company is building systems that use a world model to predict the effect of imagined actions and search for action sequences that accomplish a task, which he called "objective driven AI". He also argued that current autoregressive models do not work well for real-world video prediction because predicting every detail, or a full distribution over all possible video features, is mathematically intractable . In place of that, he described an architecture meant as a replacement for GPT/ChatGPT by predicting abstract representations rather than every detail .
"If an AI system has such a model, it can imagine what the effect of the sequence of action would be on the world... I call this objective driven AI."
Self-Harness: a new paper highlighted by Harrison Chase describes agents that improve over time by shaping their own harnesses . The loop has three parts: weakness mining from traces, harness proposals, and proposal validation through regression testing before acceptance . The work builds on DeepAgents.
AutoFlow Research Initiative: this very early startup is building systems to verify claims produced by AI instead of only generating answers . The first prototype is aimed at finance, including revenue growth calculations, financial ratio validation, cross-document consistency checks, balance sheet reconciliation, and earnings statement verification. The project has been accepted into NVIDIA Inception, is building its first prototype, and is already reaching out to pre-seed investors and technical collaborators across ML, formal verification, distributed systems, C++, mathematics, and governance research .
4) Market Signals
- AI-built MVPs are creating a post-raise technical debt problem. One operator on r/SaaS says 6 or 7 founders since January have surfaced the same pattern: products assembled through tools like Claude or Cursor, funded at $800K-$2M, then stalled when new engineers tried to extend codebases with no documentation, no tests, and no architecture. The argument is that a few months of engineering burn on an unmaintainable codebase can be more expensive than rebuilding, and that some founders are already planning for a rebuild as a normal post-raise phase .
"Raise money then validate the product and then rebuild the codebase."
Context coordination is emerging as a product wedge. Across several of the stronger launches, the differentiator is not just a model but control of shared context and task routing: Probook is built around dispatch, Linzumi keeps teams and AI coding agents in the same threads, and Momentic grounds its QA agent in tickets, PRDs, and PRs .
Applied AI still sells on operational ROI, not AI novelty. VoxFlow’s early sales feedback is that buyers respond to "will I miss fewer calls?", not the underlying technology, and that the ROI breaks down for businesses only getting around 20 calls a month.
Investor attention remains broad across applied AI categories. Sarah Guo’s latest startup event highlighted teams in web search for LLMs, customer experience agents, and AI in email, while noting additional projects in stealth across developer infrastructure, defense, and finance.
5) Worth Your Time
- Yann LeCun — Fireside Chat on Open Source & AI — first-person explanation of the new company’s world-model thesis, the "objective driven AI" framing, and the industrial application set he discussed .
Self-Harness paper and DeepAgents — useful if you are tracking self-improving agent infrastructure and the move toward regression-tested harness updates .
Linzumi YC launch page — a clean snapshot of the coordination layer forming around AI coding fleets .
Probook’s a16z announcement thread — details the dispatch-first thesis and the customer efficiency metrics behind the Series A .
r/SaaS thread on post-raise rebuilds — lays out the argument that funded, AI-built MVPs are forcing earlier rebuild decisions than many founders expected .
Philipp Schmid
Logan Kilpatrick
Yann LeCun
Agents moved further into production
Today’s strongest theme was AI moving out of the standalone chat box and into APIs, runtimes, and team workflows .
Google makes its Gemini agent API generally available
Google’s Interactions API is now generally available as a single API for Gemini models and agents, with async background=True, multimodal tool use and combination, an isolated remote Linux sandbox via Antigravity Agent, and dedicated coding skills . Separately, Google says users created more than 1,000,000 native Android apps directly in AI Studio in the last month .
Why it matters: This looks like a deliberate push to make agent building a mainstream application workflow rather than a specialized demo stack.
Claude joins Slack as a teammate
Claude Tag lets Claude join Slack teams with access to the channels and tools a company selects, so teams can tag it into tasks and let it work asynchronously inside existing conversations .
"Imo this is the 3rd major redesign of LLM UIUX ... a self-contained, persistent, asynchronous entity with org-wide tools and context, working alongside teams of humans."
In separate commentary on enterprise coding systems, Andrej Karpathy said deeply integrated, multiplayer AI can make it feel like "everyone is a manager" and even "I work from Slack now" .
Why it matters: The interface frontier is shifting from standalone chat surfaces to systems embedded in the team’s operating environment.
Enterprise runtimes are shipping with more structure—and more guardrails
NVIDIA launched an open, modular Agent Toolkit combining Nemotron open models, NemoClaw blueprints for safer behavior, and the OpenShell runtime, with examples spanning life sciences, cybersecurity, and chip design workflows . Google DeepMind researcher Nenad Tomašev said the field is concentrating heavily on coding agents because software is easier to verify, but argued human oversight is still necessary because agents are not 100% accurate and face automation bias, prompt-injection-style "agentic traps," and cognitive monoculture risks .
Why it matters: Enterprise agent adoption is increasingly about the surrounding harness—runtime, permissions, verification, and safety—not just the base model.
Coding agents are getting harder to evaluate
DeepSWE aims at real software work, not benchmark contamination
DeepSWE is a new contamination-free benchmark with tasks written from scratch across 91 repositories and five languages, using hand-written verifiers that test software behavior rather than implementation details . Its tasks require roughly 5.5x more code and about 2x more output tokens than prior benchmarks, and the project is open-source on GitHub.
Why it matters: Better benchmarks are becoming necessary as coding agents move into production. In parallel, François Chollet warned that unnecessary code can mechanically compound in agentic coding, turning complexity into a tax on every future change .
The deployment stack keeps consolidating
NVIDIA and AWS push further into production-scale AI
NVIDIA and AWS expanded their partnership across Amazon OpenSearch and EC2, targeting low-latency inference, vector search, and scalable GPU infrastructure for production deployments . In AWS, OpenSearch Serverless now defaults to GPU-accelerated vector indexing via NVIDIA cuVS with up to 10x faster indexing at one-quarter the cost, while new EC2 G7 instances promise up to 4.6x AI inference performance versus G6 and AWS has reached NVIDIA Exemplar Cloud status for GB300 training workloads .
Why it matters: This sits inside a broader concentration of infrastructure advantage: NVIDIA technology now powers more than 400 of the TOP500 supercomputers—81% of the list—and nearly 90% of new entries .
One research debate worth watching
LeCun argues real-world AI will need something beyond next-token prediction
Yann LeCun said GPT-style autoregressive systems work well on discrete symbols like language and code but run into a fundamental problem when asked to predict video or physical states, because the space of possible futures becomes mathematically intractable . He presented JEPA as a non-generative architecture based on abstract representations, and said world models built on top of it could enable objective-driven planning for industrial systems such as power plants, jet engines, and patient treatment planning .
Why it matters: Even as current agent products mature, influential researchers are still arguing that the long-term path to real-world AI may require a different architecture. In separate commentary, François Chollet said today’s stack remains several orders of magnitude inefficient and argued symbolic learning is the route to near-optimal AI .
LangChain
ClaudeDevs
cat
🔥 TOP SIGNAL
Anthropic's Claude Tag is the clearest workflow shift today: Boris Cherny says the Slack-native agent launched today, and internally it's already used to write PRs, address user feedback, investigate incidents, run data analyses, and answer company-knowledge questions; he says 65% of the product team's new code is created by Anthropic's internal version . Anthropic engineer @_catwu says the internal system merges 65% of product PRs, and Andrej Karpathy's reaction is that this is not Slack-wrapped RAG but a deeply integrated, multiplayer coding-agent product that changes how teams work .
"it’s not some LLM Q&A with RAG over Slack... it’s a different way of working entirely... I work from Slack now."
⚡ TRY THIS
Run incident response in the same thread the humans use. @_catwu's flow: when the page lands, tag Claude in the incident thread; it pulls graphs, diffs the deploy, identifies root cause, and tags the author. The team approves in-thread, then Claude opens the fix, lands it, watches the metric recover, and resolves the page .
Use a per-thread agent as both search and executor. Boris Cherny's setup starts with
@.Claudein a Slack channel. Each thread gets its own sandbox, memory, and permissions; Claude can clone repos, write/test/compile code, answer questions likeWhat’s the status on X?orWho owns this service?, proactively watch channels, draft PRs, and react with✅/❌when a thread is resolved .Steal LangChain's closed-loop improvement pattern. The durable sequence is simple: (1) run the agent and mine weaknesses, (2) propose harness improvements, (3) confirm the changes help without regressions, then loop . LangSmith Engine turns that into an operational workflow: build the agent, test with datasets/evals, deploy + monitor traces/online evals, then patch, regress-test, redeploy; when traces show errors, eval failures, negative feedback, or new bad behaviors, Engine clusters them into issues and drafts targeted PRs plus new tests/evals for review .
For messy browser/API exploration, have the agent build the harness first. Simon Willison used Claude Code for web to build a small playground UI for testing OPFS + Pyodide behavior across browsers—a good pattern when you need a disposable test bench before committing to product code .
📡 WHAT SHIPPED
Claude Tag (Slack launch) — Launched today; tag Claude in-channel and each thread gets its own sandbox, memory, and permissions. Cherny says it can proactively monitor channels, and Anthropic built security layers around model training, classifiers/auto mode, access controls, and channel/workspace boundaries. More: introducing-claude-tag
LangSmith Engine (available now) — Connect a tracing project and Engine will inspect traces, cluster repeated failures into issues, draft targeted PRs, and propose new examples/evals for the test suite. LangChain says teams like Vanta, Campfire, and Cogent are already catching regressions earlier and cutting triage time .
Cursor customize updates — Plugins can now ship prebuilt canvases (example: Atlassian canvas for live issues/projects/docs); Cursor also added a team leaderboard for popular plugins/skills/MCPs with one-click add, and extended team marketplaces to GitLab, Bitbucket, and Azure DevOps alongside local repos. Changelog: cursor.com/changelog/customize
Google Interactions API GA — One API for Gemini models and agents, with dedicated coding-agent skills, the Antigravity Agent remote Linux sandbox, multimodal tool use, and
background=Truefor async long-running interactions .Notable comparison: GLM 5.2 vs Opus 4.8 — Jason Zhou shared a same-prompt, same-reference-image frontend test where both models produced a working Three.js logistics dashboard. He also cites GLM 5.2 pricing at $1.40 / 1M input and $4.40 / 1M output, with Opus about 5x more expensive.
OSS agent infra worth a look — In Matthew Berman's roundup, Deer Flow (~74k GitHub stars) stood out as a long-horizon harness built around sub-agents, memory, sandboxes, and skills; Codebase Memory MCP (~12k) claims Linux-kernel-scale indexing in 3 minutes with sub-millisecond structural queries and 120x fewer tokens; Skill Specter (<10k) scans skills for 65 vulnerability patterns before install .
🎬 GO DEEPER
- 1:36–2:19 — LangSmith Engine's trace → issue → PR loop. Best short walkthrough today of a real agent-improvement pipeline: watch traces, cluster patterns, draft the fix, add evals, then keep monitoring after merge .
- 6:20–7:26 — Codebase Memory MCP on huge repos. If your agents keep burning tokens just to understand code structure, this is the pitch to examine: Linux kernel indexed in 3 minutes, structural queries in under 1 ms, 158 languages, 11 harnesses .
- Study Deep Agents alongside LangChain's loop engineering write-up. The repo matters, but the bigger takeaway is the pattern: weakness mining, harness edits, regression checks, repeat .
Editorial take: the serious setups are starting to look the same—put the agent inside the real workstream, give it bounded tools and memory, then close the loop with traces, tests, and human approval.
sarah guo
Lenny's Newsletter
Reid Hoffman
What stood out
Three recommendations passed the authenticity filter today, and they fit together well. Each is about how better judgment gets formed: through open networks, through emotional clarity, and through practical lessons on consumer products and investing.
Most compelling recommendation
Regional Advantage
- Content type: Book
- Author/creator: AnnaLee Saxenian
- Link/URL: Not provided in the source notes
- Who recommended it: John Lilly, in conversation with Reid Hoffman
- Key takeaway: Lilly recommended the book for its explanation of why Silicon Valley benefited from intense intermixing while Boston's Route 128 companies were more insular, with people eating and staying inside their own firms rather than mixing more broadly
- Why it matters: This was the strongest pick because it came with a concrete mechanism for how ecosystems either accelerate or limit innovation
Two more worth saving
Descartes' Error: Emotion, Reason, and the Human Brain
- Content type: Book
- Author/creator: Antonio Damasio
- Link/URL:https://www.amazon.com/Descartes-Error-Emotion-Reason-Human/dp/014303622X
- Who recommended it: Joe Hudson, as cited in Lenny's Newsletter
- Key takeaway: The recommendation was tied to a specific neuroscience claim: our choices are fundamentally emotional, and feelings act as the "context-setters" on which rational thinking operates; when emotional states are avoided or unclear, the set of visible solutions narrows
- Why it matters: The surrounding discussion connects this directly to AI-era decision-making: models can advise, but they cannot feel the subtle tension in a room or register when something feels off
"our choices are fundamentally emotional."
Article by Josh Elman (title not provided in the source notes)
- Content type: Article
- Author/creator: Josh Elman
- Link/URL:http://x.com/i/article/2069415679493840896
- Who recommended it: Sarah Guo
- Key takeaway: Guo said she learned a lot from Elman about consumer products and investing, and that founders will too
- Why it matters: The endorsement was unusually direct and specific, tying the piece to two core startup topics rather than offering a generic share
"Josh was a great partner to me and I learned a lot from him about consumer products and investing! Founders will too."
If you only save one
Save Regional Advantage. It had the clearest combination of conviction and explanation: not just a title drop, but a concrete lens for understanding how openness and cross-company mixing affect innovation
Product Management
The community for ventures designed to scale rapidly | Read our rules before posting ❤️
John Cutler
Big Ideas
AI is raising the value of judgment, not removing it. John Cutler argues that creating an OKR is intellectually simple; the hard part is building judgment through discussion with other people, supported by AI, rather than box-checking forms . Hiten Shah makes the same point from another angle: AI-generated work can look clean and shippable yet still miss the product point, so taste and design judgment need to stay in the loop while code is changing . Why it matters: faster drafting and coding shift PM leverage toward critique, framing, and decision quality. Apply it: keep review loops close to live work and treat AI outputs as proposals to pressure-test, not decisions to accept.
Product context is becoming a bottleneck. In small SaaS teams, tickets and PRs often still need someone to reconnect the customer problem, expected behavior, edge cases, prior decisions, review concerns, and open release questions before work can ship . The core issue is that product context often does not travel once it leaves the PM or founder’s head . Why it matters: even when production speeds up, missing context still slows review, QA, and release. Apply it: map where your team repeatedly has to “rebuild” context and use that as the signal for process fixes.
Delight starts with choosing who you serve. Teresa Torres highlights Petra Wille’s view that you cannot design an experience that makes everyone happy; instead, design for choice, niche audiences, delight, and even awe when the extra investment is justified . Why it matters: this is a reminder that better product design is not generic polish; it is intentional design for a specific audience. Apply it: when evaluating a feature, ask who it is really for and whether the added effort creates differentiated value for that group.
Tactical Playbook
Share heuristics, not just priority lists. Cutler describes writing down internal scoring heuristics in plain language and sharing them as a markdown file, including ideas that score well and poorly, so teammates can understand and iterate on the thinking . How to apply: write 3-5 criteria in story form, add examples of ideas that pass and fail, then let teammates test their own ideas against the heuristic.
Use AI to build rubrics where you are not the expert. For survey design, Cutler asks AI to assemble a research-backed heuristic, reviews the work himself, compares his assessment with AI’s output, and then refines the response . How to apply: first generate a rubric from existing research, then do your own pass, compare gaps, and only then decide.
Use AI with teams, not only solo. Cutler cites research showing individuals paired with AI can approach team-level output, while teams paired with AI can produce up to 3x outcomes . How to apply: bring AI into live workshops to run scenarios, ask disconfirming questions, and keep momentum instead of sending someone off for hours or days of research.
Case Studies & Lessons
Discovery message shift: from selling to learning. One B2B AI accounting startup got nowhere with 1,000 cold emails and 100 InMails, receiving only “not interested” replies. It switched to plain LinkedIn connection requests asking ICPs for advice, kept the exchange to three validation questions, and got about 18 replies, 7 video calls, 2 referrals, and 1 buyer call from roughly 60 outreach attempts . Takeaway: discovery improves when the conversation is about the customer’s problem and purchase intent, not your pitch .
From team intuition to repeatable coaching. Cutler describes recording a live objection-handling role play, turning it into a first guide, grading examples against best practices, and then building an AI skill that analyzes calls and suggests improvements in Slack . Takeaway: useful AI systems often start by capturing how the team actually works, then turning that into a repeatable feedback loop.
Career Corner
- Expect sharper AI demands in PM roles—and screen for them. Community signals show companies increasingly expect PMs to accelerate work with AI, audit AI-generated requirements that can be “confidently wrong,” and sometimes even “vibe code” functional prototypes . In weaker setups, leaders may assume six months of work can be compressed into six weeks, or demand large backlogs with no onboarding and “just magically use AI to do it all” . How to apply: in interviews and new roles, clarify whether AI output still needs formal review and sign-off, how much ramp time exists, and whether prototype building is part of the PM remit.
“Ask, ‘What can I do about it? What is within my control?’”
That advice, from Anthropic’s Nerdi Yogi, is Lenny Rachitsky’s featured antidote to AI-related fear and change resistance . For PMs, the practical version is to focus on the skills and responsibilities you can deliberately strengthen as expectations shift.
Tools & Resources
- AI 101 for PMs is a free, open-source course built by a PM to make concrete AI issues easier to understand, including latency, model regressions, hallucinations, and multilingual cost tradeoffs. It includes 25 concepts across 4 chapters, interactive in-browser widgets, no signup, and offline/local progress support . Course: https://trippinwithpuneet.github.io/AI-101-for-PMs/ Code: https://github.com/trippinwithpuneet/AI-101-for-PMs
Start with signal
Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.
Coding Agents Alpha Tracker
Elevate
Latent Space
Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.
AI in EdTech Weekly
Luis von Ahn
Khan Academy
Ethan Mollick
Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.
VC Tech Radar
a16z
Stanford eCorner
Greylock
Daily AI news, startup funding, and emerging teams shaping the future
Bitcoin Payment Adoption Tracker
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics
AI News Digest
Google DeepMind
OpenAI
Anthropic
Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves
Global Agricultural Developments
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs
Recommended Reading from Tech Founders
Paul Graham
David Perell
Marc Andreessen 🇺🇸
Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media
PM Daily Digest
Shreyas Doshi
Gibson Biddle
Teresa Torres
Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications
AI High Signal Digest
AI High Signal
Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem
Frequently asked questions
Choose the setup that fits how you work
Free
Follow public agents at no cost.
No monthly fee