We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Your intelligence agent for what matters
Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.
Your time, back
An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.
Save hours
AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.
Full control over the agent
Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.
Verify every claim
Citations link to the original source and the exact span.
Discover sources on autopilot
Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.
Multi-media sources
Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.
Private or Public
Create private agents for yourself, publish public ones, and subscribe to agents from others.
3 steps to your first brief
Describe your goal
Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.
Review and launch
Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.
Sam Altman
3Blue1Brown
Paul Graham
The Pragmatic Engineer
r/MachineLearning
Naval Ravikant
AI High Signal
Stratechery
Sam Altman
3Blue1Brown
Paul Graham
The Pragmatic Engineer
r/MachineLearning
Naval Ravikant
AI High Signal
Stratechery
Get your briefs
Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.
Boris Cherny
Salvatore Sanfilippo
Romain Huet
🔥 TOP SIGNAL
- Stop bolting agents onto the old workflow; put them at the center. Boris Cherny says he ships 20-30 PRs/day by running five Claude agents in parallel, has not written a line of code in over six months, and often leaves hundreds or even thousands of agents running 5-20 hours overnight; he also describes Claude as being at the center of everything inside Anthropic, where the terminal prototype spread quickly across engineering . Romain Huet shows the same operating model in Codex: a plain-English 7am prompt becomes a pinned automation that pulls Slack, Gmail, and calendar context, surfaces top priorities, and drafts emails, while Greg Brockman shared Codex analyzing Slack history and reorganizing channels via computer use into a reusable automation .
⚡ TRY THIS
Build a pinned 7am operating brief in Codex (Romain Huet). Start by connecting the tools you use every day; Codex has 120+ plugins packaging skills, MCPs, and direct access to tools like Slack, Gmail, and calendar . Then use Huet's pattern:
every morning at 7am, pull all of the context from my Slack, from my Gmail, from my calendar. Give me a full brief of my day, my top five priorities. Point me to what's most critical and urgent ... and draft some of these emails.Codex turns that into a scheduled daily automation, and you can pin the thread so the same briefing surface is waiting every morning .Add a pre-merge review agent before every PR lands (Peter Steinberger). Steinberger says
autoreviewhas been the most impactful skill in his stack after crabbox.sh: it automatically reviews code before landing a PR, finds edge cases, and sometimes runs for hours . If you want a concrete starting point, study the skill spec and port the pattern into your own setup: https://github.com/openclaw/agent-skills/blob/main/skills/autoreview/SKILL.md.Tune the operating context before you blame the model. Theo says GPT-5.5 only clicked for him after about two months, prompting entirely differently, and spending time on
agents.md; now he says he cannot really use another model for code . Practical routing on top of that: Huet defaults Codex/GPT-5.5 to Medium for most tasks and only uses Extra High for one-off critical refactors, while Matthew Berman's rule is to use frontier models for hardcore upfront planning and cheaper workhorse models for the actual code writing .Close the loop on agent fixes. Palash Shah describes self-optimizing agents as systems that observe their own outputs, evaluate them, and use that signal to improve future performance . The concrete LangSmith Engine recipe: auto-triage feedback on traces, attach an online evaluator to every suggested fix so you do not regress, generate offline evals for your test suite, and keep learning from user preferences over time .
📡 WHAT SHIPPED
DeepSWE benchmark launched. Theo calls it the first coding benchmark that actually aligns with how it feels to use these models for coding, and says it exposes much larger divergence than public leaderboards suggest. His read: the gap between official harnesses and simple agent harnesses says a lot, and one comparison showed Gemini 3.5 Flash costing more than GPT-5.5 while scoring about half as high .
Field test from Salvatore Sanfilippo. The Redis creator says GPT-5.5 implemented kernels for his MIMO inference system from directions he provided; compared with his handwritten
namasi++version, the AI-generated implementation ran 2x faster and did not crash past larger context sizes. His current model is mixed: full AI implementation on some projects, line-by-line assist on others, and small manual programs reserved for aesthetics, learning, and new ideas .Codex keeps adding practical surface area. Huet says the app now has 120+ plugins, sandboxed auto review with permission escalation, local or cloud projects, an in-app browser for pointing at UI changes, and a Codex tab inside the ChatGPT app that launched last week for continuing desktop work from the phone . Theo's recent feature callouts: locked-Mac computer control, a double-Command hotkey to capture the current screen into context, and diff marker settings . Adoption signal: Simon Willison highlighted
@openai/codexNPM installs growing from about 100,000/day in January to over 1 million/day now .Rastermill released for Node agents. Peter Steinberger extracted image logic into a separate library to stop small hacked images from crashing the process; it is built with Wasm and Rust for speed. Link: http://rastermill.com.
Undocumented-feature field report. Theo says he added a new Lakebed feature without announcing it; by morning, Sherlock's agent had discovered it and shipped a curation app on top of it, which Theo says was used perfectly to spec. App: https://badlogic-list.lakebed.app/ with RSS included .
🎬 GO DEEPER
- 16:12-18:29 — Romain Huet on background computer use. Best clip today for understanding parallel desktop work: Codex clicks through a simulator, checks for obvious issues, and can create a Reminder without taking over the user's cursor .
- 22:25-24:48 — Theo on verification vs token burn. Good watch if you are choosing between local-machine verification and cloud swarms: his core argument is that computer-use verification in a real environment can beat simply spinning up more sub-agents and burning tokens .
- 27:54-28:29 — Boris Cherny on making Claude the default interface. Short but important: codebase questions, expenses, and even holiday lookups all route through Claude when the workflow is redesigned around the agent .
- Repo to study —
autoreview. If you only inspect one artifact today, make it Peter Steinberger's pre-merge review skill spec: https://github.com/openclaw/agent-skills/blob/main/skills/autoreview/SKILL.md.
Editorial take: the edge is moving from model shopping to workflow design — persistent context, pre-merge review, and self-evaluating loops beat raw token burn.
Serena Ge (Datacurve)
Eric Wu
Funding & Deals
- Perceptic announced a $12M round led by Air Street and Accel, with participation from Elder Gull and angels from AI labs, to operationalize frontier AI systems for biopharma . The founding team includes CEO Tilman after a seven-year stint at Palantir, plus Palantir AIP veterans Martin Copes and Zaki Trache . The company says it is already live in top-20 pharma accounts speeding up critical workflows, and Nathan Benaich framed the timing around biopharma’s increased appetite for frontier AI and AI labs turning more attention toward science .
- Lucis raised a $20M Series A to expand its AI-driven preventive health platform in Europe . YC says the company has served 10,000+ customers across France, the UK, Ireland, and Portugal, delivered 1M+ biomarker tests, and that 75% of users who completed a six-month follow-up improved three or more markers without medication .
Emerging Teams
- Superset is an open-source IDE for developers to run hundreds of agents in parallel. YC says it has grown 30% week over week for the last four months and is helping engineers ship 10x more PRs.
- Appnigma is attacking Salesforce implementation pain by generating native managed packages from natural language. The founders say it compresses 3-6 months of development into days and outputs 100% native Salesforce metadata. They are ex-Salesforce, including 3.5 years on the AppExchange review team, and say customers include Pylon, Warmly, UserEvidence, and roughly 85% YC companies.
- Alchemize is building an AI-native customs brokerage that promises importers real-time regulatory clarity and shipment clearance in minutes instead of days. YC highlighted the launch from founders Samuel and Robert.
- NavigateAI gives Eric Wu a new wedge in labor-constrained physical work. The company says it wants to give every field worker an AI copilot to address skilled-worker shortages so crews can build faster and better. Keith Rabois amplified the launch, calling it “Super cool” and emphasizing faster, lower-cost building .
AI & Tech Breakthroughs
- DeepSWE is positioned as a new benchmark for agentic coding. Its creators say it shows where top models actually diverge in realistic developer workflows, and Garry Tan called it the new standard for engineering evals.
- GBrain + ActiveGraphAI outline a more auditable agent architecture. The system pairs a durable markdown/git knowledge substrate with an event-sourced runtime, then wraps operations as typed events, projects retrieved evidence into graph state with stable citation tokens, adds an explicit propose/approve/apply flow, and forks before mutation so agents can trace recommendations to specific evidence and runs .
- StableBrowse is a concrete attempt to improve agent browsing economics. YC says its browser layer lets agents navigate the web with 70% fewer tokens and 3-4x faster execution.
- YourMemory is a notable open-source experiment in persistent AI memory. Its author says the system adds time-aware retrieval and memory decay based on a modified Ebbinghaus forgetting curve, tested it on LongMemEval, and runs it locally with a visualization dashboard .
Market Signals
- Compute still looks supply-constrained rather than speculative. In a 20VC interview, Cerebras’ CEO said data centers cannot be built fast enough, the company has a $25B backlog, and Nvidia, AMD, and others also have backlogs . He argued the industry is building behind demand, not ahead of it . In the same interview, he said HBM shortages could last several years because fab capacity is lumpy and slow to add, and that electricity may become the longer-term limiting factor .
- Compliance is starting to look like a real AI software category. a16z says AI may finally be moving from good enough to pilot to good enough to trust in compliance, noting that many LLMs now score 80-100% on LegalBench’s 162 legal reasoning tasks. The firm argues compliance is essentially applied legal reasoning and affects every dollar moving through a business .
“As AI clears the ‘good enough to trust’ bar and sales cycles speed up, there may finally be an opening for startups.”
- AI is beginning to replace legacy SaaS, not just sit on top of it. Harry Stebbings highlighted a CEO saying the company replaced a $600K Salesforce contract with a vibe-coded CRM built in three weeks, expects to eliminate 80% of its internal SaaS stack, and would not change Anthropic usage even if pricing doubled .
- On-device inference is moving from theory to production design choice in B2B SaaS. A founding team at an on-device AI infrastructure company says its system handles 60-80% of requests locally with cloud fallback, especially for bounded tasks like transcription, summarization, and classification on modern hardware .
- Agent governance is hardening into an infrastructure layer. GBrain + ActiveGraphAI describe replay, stable citations, controlled write-back, and fork-before-mutation workflows . Synapsor is built around governed memory, staged writes, replay, permissions, and audit trails for agents touching business systems . Nexus Synapse makes a similar argument that runtime governance, not just the model, is the missing layer around AI systems .
Worth Your Time
- 20VC x Cerebras CEO: useful primary source on AI infrastructure bottlenecks, HBM scarcity, and the argument that slow inference has no real market. YouTube interview
- a16z on compliance: useful if you are mapping where AI may move from pilots into trusted production software. Essay
- DeepSWE release thread: worth reviewing if you diligence coding agents or developer infra and want a benchmark tied to realistic workflows. X thread
- Superset launch page: useful background on an open-source IDE that YC says has grown 30% week over week for four months. YC launch
- GBrain x ActiveGraphAI thread: useful diligence material if you care about evidence-linked, replayable agent systems. X thread
Bill Gurley
Tim Ferriss
Palmer Luckey
What stood out
The strongest single recommendation was Paul Graham's Maker's Schedule, Manager's Schedule. Tim Ferriss tied it to a precise operating rule: creative work needs uninterrupted 3-5 hour blocks, plus enough slack for multi-day synthesis, and he protects 3-4 mornings per week in maker mode until at least 1pm .
The other major signal was Lenny Rachitsky's book list for product builders. He organized it by job-to-be-done, limited himself to books he had completed, and mostly favored books more than 10 years old because they had stood the test of time .
Most compelling recommendation
Maker's Schedule, Manager's Schedule
- Content type: Essay
- Author/creator: Paul Graham
- Link/URL:https://www.paulgraham.com/makerschedule.html
- Who recommended it: Tim Ferriss
- Key takeaway: Ferriss said great creative work is hard to do in fragmented 30-45 minute windows; it needs uninterrupted 3-5 hour blocks and enough slack for multi-day "CPU-intensive synthesis"
- Why it matters: This was the clearest recommendation in the batch because it came with a concrete scheduling practice he already uses: 3-4 maker mornings per week until at least 1pm
"Large, uninterrupted blocks of time—3-5 hours minimum—create the space needed to find and connect the dots."
A compact starting stack from Lenny Rachitsky's broader book list
Lenny's full list spans multiple categories, from communication skills to sales and marketing. If you want a smaller starting point, these nine titles give you one book per category from that broader map .
- Nobody Wants to Read Your Sht* — Content type: Book; Author/creator: Steven Pressfield; Link/URL:https://www.amazon.com/Nobody-Wants-Read-Your-Tough-Love/dp/1936891492/ref=tmm_pap_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for communication skills in a list optimized for books that have stood the test of time; Why it matters: it puts writing and communication at the front of the product-builder skill map
- The Great CEO Within — Content type: Book; Author/creator: Matt Mochary; Link/URL:https://www.amazon.com/Great-CEO-Within-Tactical-Building/dp/0578599287/ref=tmm_pap_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for executing; Why it matters: it signals execution as a distinct reading track, not just an output of strategy
- Good Strategy/Bad Strategy — Content type: Book; Author/creator: Richard Rumelt; Link/URL:https://www.amazon.com/Good-Strategy-Bad-Difference-Matters/dp/0307886239/ref=tmm_hrd_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for strategy; Why it matters: it gives strategy its own dedicated place in a practical operator reading list
- Build — Content type: Book; Author/creator: Tony Fadell; Link/URL:https://www.buildc.com/the-book; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for inspiration to build; Why it matters: it shows that founder motivation and building craft belong in the same learning stack
- High Output Management — Content type: Book; Author/creator: Andy Grove; Link/URL:https://www.amazon.com/High-Output-Management-Andrew-Grove/dp/0679762884/ref=tmm_pap_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for management; Why it matters: it identifies management as a separate discipline from leadership or product judgment
- Amp It Up — Content type: Book; Author/creator: Frank Slootman; Link/URL:https://www.amazon.com/Amp-Unlocking-Hypergrowth-Expectations-Intensity/dp/1119836115/ref=tmm_hrd_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for leadership; Why it matters: it separates leadership intensity and standards from day-to-day management mechanics
- The Mom Test — Content type: Book; Author/creator: Rob Fitzpatrick; Link/URL:https://www.amazon.com/Mom-Test-customers-business-everyone/dp/1492180742/ref=tmm_pap_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for product success; Why it matters: it places customer learning inside the core product-building toolkit
- Empowered — Content type: Book; Author/creator: Marty Cagan; Link/URL:https://www.svpg.com/books/empowered-ordinary-people-extraordinary-products/; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for the product org; Why it matters: it extends the reading list from individual craft to team and organizational design
- Obviously Awesome — Content type: Book; Author/creator: April Dunford; Link/URL:https://www.amazon.com/Obviously-Awesome-Product-Positioning-Customers/dp/1999023080/ref=tmm_hrd_swatch_0; Who recommended it: Lenny Rachitsky; Key takeaway: one of his three picks for sales and marketing; Why it matters: it makes go-to-market part of the same learning arc as product and management
Four side paths worth saving
- US/China AI race article — Content type: Article; Author/creator: not specified in the source; Link/URL:https://chinai.substack.com/p/anthropics-dogmatic-views-on-us-china; Who recommended it: Bill Gurley; Key takeaway: he called it "a wonderfully pragmatic look" and said it is unrealistic and dangerous to project that the next 12 months are an all-or-nothing bet on the future of the world; Why it matters: it is the clearest AI-geopolitics recommendation in this batch, and the value lies in its push against compressed, absolutist timelines
- Could a Large Language Model Be Conscious? — Content type: Article; Author/creator: not specified in the source; Link/URL:https://www.bostonreview.net/articles/could-a-large-language-model-be-conscious/; Who recommended it: shared by Marc Andreessen via a quoted post from @badlogicgames; Key takeaway: it was shared as recommended reading during a discussion about "model wellbeing," consciousness, and the limits of current LLM framing; Why it matters: it is the most explicit philosophy-of-AI reading in today's set
- Socrates on the written word — Content type: Philosophical text; Author/creator: Socrates, via Plato; Link/URL: direct text link not provided in the source; source discussion: https://www.youtube.com/watch?v=k2THslsOYzY; Who recommended it: Palmer Luckey; Key takeaway: he drew a direct parallel between Socrates' critique of writing and current worries that AI or ChatGPT could weaken memory, encourage superficial understanding, and outsource thinking; Why it matters: it reframes an AI-era concern as an older debate about tools and cognition
- Paul Baran and the Origins of the Internet — Content type: Article; Author/creator: RAND; Link/URL:https://www.rand.org/pubs/articles/2018/paul-baran-and-the-origins-of-the-internet.html; Who recommended it: Balaji; Key takeaway: Baran proposed packet-switched networking as a system that could survive nuclear attack; the design was later adopted mainly for efficiency, but the original resiliency property remained; Why it matters: it adds historical context to how technical architectures can outlast the political conditions that produced them
Bottom line
If you save one item today, save Paul Graham's essay for the clear rule on protecting maker time . If you want a broader next-read list, Lenny Rachitsky's categories provide the largest set of book recommendations in this batch, filtered for titles he had actually finished and believed had endured over time .
Qwen
Nick Frosst
Cohere
Top Stories
Why it matters: today’s biggest signals were about scientific reasoning, distribution scale, and the shift from chat models to full agent systems.
Frontier models posted another meaningful math result. Claude Mythos solved the decades-old Erdős unit distance problem, and one account said it found a cleaner proof than the known OpenAI approach while running air-gapped, with no internet access . Sebastien Bubeck added that, with the right harness, both Mythos and GPT-5.5 can now reproduce the one-shot unit distance solution . The broader takeaway from today’s research discussion is that more capability is showing up through model-plus-harness design, not just new base models .
Google showed the scale of its Gemini footprint. Google later clarified that Gemini now has 900M monthly active users in the Gemini app alone, excluding AI Overviews and other Gemini-powered surfaces . One analysis of the company’s recent disclosures also noted token processing rising from 480T last year to 3.2 quadrillion now, with TPUs central to serving AI at that scale .
Agentic coding is being measured more like real work. DeepSWE launched as a benchmark meant to show where top models actually diverge in day-to-day developer workflows, and GPT-5.5 was cited as #1 on it . Alibaba also shipped Qwen3.7-Max as a flagship for the Agent Era, with end-to-end coding, office workflows, 35-hour autonomy on a kernel task, and a #4 debut in Code Arena: Frontend .
Research & Innovation
Why it matters: the most interesting research today focused on memory, biology, and inference efficiency.
Language Models Need Sleep proposed a low-latency path for long-horizon agents. The idea is to periodically consolidate recent context through offline recurrent passes, write the result into persistent fast weights, and clear the KV cache; gains rose with longer “sleep” on deeper reasoning tasks .
Carbon pushed open DNA modeling much closer to whole-genome scale. The model is described as 275x faster than the previous state of the art at its size and can process the full human genome on a single GPU in under two days . Its tokenizer splits DNA into 6-base chunks while preserving single-base resolution .
Shard targeted one of inference’s costliest bottlenecks. On Llama-3.1-8B, it reported 10x KV-cache compression with zero quality loss, including 11.2x at 32K context and near-zero LongBench delta versus FP16 .
Products & Launches
Why it matters: launches centered on easier content creation, more accessible open models, and higher-end image generation.
Gemini Omni extended conversational editing to video. Demonstrations showed users reshaping scenes, swapping objects, changing viewpoints, translating audio while keeping background music, and zooming into images while preserving character and physics consistency .
Cohere open-sourced Command A+. Cohere called it its most powerful LLM yet, optimized to run on minimal hardware, while co-founder Nick Frosst framed the release as part of a push toward more empowering AI access .
Microsoft’s MAI-Image-2.5 entered the top tier of text-to-image. The model debuted at #3 on the Arena leaderboard with a score of 1,254, a 72-point gain over MAI-Image-2, and marked the first time a lab outside Google DeepMind and OpenAI entered the top five . Public early access is live on Arena .
Industry Moves
Why it matters: capital is flowing into inference and authenticity tooling even as model pricing keeps collapsing.
Baseten is reportedly raising at a sharply higher valuation. The inference provider is said to be raising $1B at an $11B valuation after growing from $200M to $600M ARR in Q1 .
DeepMind expanded SynthID into a broader industry coalition. It said SynthID has watermarked more than 100B pieces of content, is adding partners including OpenAI, ElevenLabs, and Kakao, and has already seen 50M+ Gemini verification checks, with Search and Chrome next .
API price competition accelerated again. Xiaomi said MiMo-V2.5 pricing is permanently reduced by up to 99%, with unified pricing across context lengths and token plans upgraded to 5–8x more usable tokens . Another note said MiMo 2.5 Pro now matches DeepSeek V4 Pro pricing .
Policy & Regulation
Why it matters: governments are moving from general AI debate to concrete controls and review processes.
- China is restricting overseas travel for top AI professionals at private firms including Alibaba and DeepSeek.
- The FDA launched a pilot to review AI-generated evidence in drug submissions. Over 200 AI-designed drugs are already in clinical trials worldwide, but none have FDA approval; the new pilot selects 10 companies for expedited review .
Quick Takes
- Runway launched Project Luxo, saying AI-generated video has crossed the uncanny valley; one 10-minute film was made by a single person in under a month .
- Anthropic’s new security-guidance plugin for Claude Code cut security-related PR comments by 30-40% in internal rollout and benchmarks .
- Figure signed a commercial agreement with Catalyst Brands to deploy humanoid robots at scale, starting in Reno, Nevada .
- Google introduced Daily Brief, a personalized morning-digest agent designed to be a daily first stop .
Nathan Lambert
Demis Hassabis
Ben Thompson
What stood out today
The clearest pattern today was AI moving from broad capability talk into operating questions: who controls the agents, who gets the compute, and how trust gets built into the stack.
Google/DeepMind makes the agentic era its main product story
Google used I/O to spotlight Omni, Spark agents, and Flash 3.5 inside Antigravity 2.0, with Demis Hassabis saying consumer agents are already showing productivity gains . In a new interview, Hassabis said his AGI estimate is now “2030 plus or minus a year,” described the moment as the “foothills of the singularity,” and said DeepMind is still on track with its original 20-year mission .
“These days I’m thinking it’s 2030 plus or minus a year.”
He paired that optimism with a call for security, robustness, standards, and international cooperation as the agentic era expands . He also said Isomorphic Labs has raised new funding, now has pre-clinical test compounds starting in oncology and immunology, and aims to compress parts of drug discovery from roughly ten years to months or weeks .
Why it matters: Google is tying product launches, frontier-model direction, safety, and long-horizon science into one story rather than presenting them as separate bets.
Compute shortages are redrawing industry strategy
Ben Thompson said Anthropic’s xAI compute deal makes sense because Anthropic has the highest willingness to pay in a constrained market and was always likely to find new capacity as demand outstripped supply . He argued xAI increasingly looks like both a model company and an infrastructure company, and that the infrastructure side may be more valuable if it sells compute broadly instead of limiting it to Grok .
On the hardware side, NVIDIA said its Vera CPU is built for agentic AI workloads, pairing 88 custom Olympus Armv9.2 cores with 1.2 TB/s of memory bandwidth; in Phoronix testing it showed a 1.6x geometric-mean gain over Grace and a 1.5x advantage over a latest-generation 128-core x86 processor, with first customer deliveries already underway .
Why it matters: The day’s compute news was less about one deal or one chip than about a broader shift: scarce infrastructure is shaping partnerships, product availability, and where value sits in the AI stack.
Provenance and containment are moving closer to default infrastructure
Google DeepMind said SynthID has watermarked more than 100 billion pieces of content and that Gemini users have run AI-origin verification more than 50 million times . The company is extending content authentication into Search and Chrome, adding edit trails for Pixel video, and partnering with OpenAI, ElevenLabs, and Kakao to add SynthID watermarking to their models .
Anthropic, meanwhile, said agent permissions should evolve with capability growth and described sandboxing in its own products as a way to limit potentially destructive actions .
Why it matters: Leading labs are trying to make authenticity checks and operational containment part of everyday tooling, not just after-the-fact policy.
AI governance is showing up in both religion and geopolitics
Big Technology reported that Pope Leo XIV issued a 55-page encyclical on AI warning about war, disinformation, surveillance, algorithmic addiction, and democratic degradation, while arguing that AI cannot offer genuine human connection and that pauses can be compatible with progress . Separately, reporting highlighted by Nathan Lambert said China is expanding travel restrictions on top AI talent at key private firms beyond earlier rumors focused on DeepSeek .
Why it matters: Different institutions are arriving at the same conclusion from very different angles: AI is now important enough to trigger moral arguments and tighter controls over people and systems.
New research is a reminder that AI helps science most when humans can still verify the work
Sakana AI and collaborators proposed CUSP, a benchmark covering 4,760 scientific events to test whether models can forecast scientific progress . Their result: frontier models can identify promising directions, but they still struggle to predict whether or when breakthroughs will happen, and the gap is not explained by training-data volume alone .
The authors conclude that science remains open-ended and that AI works best as a collaborative explorer rather than a predictor . That fits with Terence Tao’s warning that AI could create a “traffic jam” in math by generating more proofs than humans can verify, increasing the need for better scientific infrastructure .
Why it matters: Better models may widen exploration faster than existing validation systems can keep up.
Also notable
- Gemma 4 looks like a meaningful open-model shift. swyx called it dramatically better than Gemma 3, Interconnects said equivalent-size Gemma 4 models tie or outperform Qwen 3.5/3.6 under an Apache 2.0 license, and Nathan Lambert said adoption is already outpacing Qwen at similar sizes .
- Agents are being pitched as operators, not just assistants. Perplexity Computer was demonstrated managing Shopify workflows such as market research, product-image generation, and theme design in parallel, matching Sarah Guo’s argument that customers want agents that resolve issues by taking actions inside real systems .
Product Management - The place for all things product
Sachin Rekhi
Zeb Evans
Big Ideas
"If you start with tools, you're going to fail."
- AI adoption is a behavior-change problem. Barry O'Reilly cites 85% GenAI project failure and 83% transformation failure when companies make adoption about tools instead of how people do their best work . His example: if your best thinking is verbal, use transcription to turn conversation into drafts instead of forcing everything through a prompt window . Apply it: redesign one core PM workflow around your natural working style first, then pick the tool.
- AI works best as a team amplifier. O'Reilly cites research showing teams using AI got 3x better ideation outcomes, and recommends leading indicators such as meeting readiness (1-10), decision velocity, and time moved from admin to creative problem-solving from roughly 20% toward 40% . Apply it: use AI in live strategy sessions to pressure-test ideas, not just to draft after the meeting.
- Governance affects product integrity. Eric Ries argues great companies stay trustworthy only when mission is structurally protected from investor and organizational pressure . One practical signal: whether the mission is embedded in the corporate charter, not just in a values statement . Apply it: treat governance as part of product due diligence when evaluating employers, founders, or major trade-offs.
Tactical Playbook
- Reduce async latency with better specs. Replace the Jira ticket as source of truth with a short spec covering context, non-goals, examples, edge cases, and open decisions . Require engineer playback before build starts, and split discovery/spike work from delivery tickets when the task is still ambiguous . Batch questions into one document rather than drip-feeding them across Slack, then use fixed overlap meetings, live sprint planning, and regular 1:1s for the hardest topics . Why it matters: in distributed teams, every unclear sentence can add a full day of delay .
- Measure and role-model AI adoption. Before a decision meeting, ask how prepared the group feels on a 1-10 scale; then track decision velocity, decision advantage, and whether more time is moving from admin to higher-order problem solving . Leaders also need to create safety by showing their own workflows: Progyny's CEO framed AI as a way to elevate, not eliminate, people, and shared how he used meeting transcription and synthesis . Apply it: instrument behavior change before trying to prove ROI in hours saved.
Case Studies & Lessons
- Trust moats need structural protection. FedMart built customer trust with lowest-price discipline, capped 14% margins, higher wages, and no supplier bribes, but investor pressure pushed the company toward short-termism; after Saul Price was forced out, the business was liquidated within seven years . Costco later preserved similar values with stronger governance protections, while Anthropic has used a long-term benefit trust plus public benefit corporation structure to protect its mission . J&J shows the opposite failure mode: its Credo put patients first, but scandals still emerged when financial incentives dominated . Lesson: if trust is part of the product, encode it in governance and incentives—not just slogans.
- Reddit treats friction as a feature. The product gives maximum space to nested comments, keeps the UI dense and utility-first, and relies on community curation plus moderators rather than a personalized algorithmic feed . Lesson: optimize for the interaction loop that creates value—even if that means sacrificing some speed or polish .
Career Corner
- The PM role is becoming more builder-like. Sachin Rekhi argues PM and design roles are converging: PMs with UX intuition and designers with customer focus are meeting in the middle, with PMs using code and agents to iterate, validate, and scope ideas . His caveat is useful: PMs should not ship production code if that turns them into another engineering bottleneck . Apply it: get fluent enough to prototype and test independently, while keeping production quality ownership clear.
- A sharp interview question: ask whether the company's mission is in its corporate charter. Ries says the question itself often exposes whether the organization has legally encoded its mission and can push the discussion upward .
Tools & Resources
- A book stack organized by PM job-to-be-done. Lenny Rachitsky's latest list groups classics by strategy, product success, product org, execution, and leadership, while limiting the set to three books per category and mostly titles older than 10 years . Useful starting points: Good Strategy/Bad Strategy, The Mom Test, Continuous Discovery Habits, Inspired, and Scaling People. If reading time is the blocker, his habit suggestion is simple: 10 minutes before bed .
- For PMs working directly in repos: Product Compass recommends Codex as a chat-first entry point with a file tree, visual diffs, manual session compression, and a second-model perspective on the same repo . Keep agent instructions in one
CLAUDE.mdcovering strategy, workflow, and progressive disclosure, then pointAGENTS.mdat it as the source of truth .
Start with signal
Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.
Coding Agents Alpha Tracker
Elevate
Latent Space
Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.
AI in EdTech Weekly
Luis von Ahn
Khan Academy
Ethan Mollick
Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.
VC Tech Radar
a16z
Stanford eCorner
Greylock
Daily AI news, startup funding, and emerging teams shaping the future
Bitcoin Payment Adoption Tracker
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics
AI News Digest
Google DeepMind
OpenAI
Anthropic
Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves
Global Agricultural Developments
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs
Recommended Reading from Tech Founders
Paul Graham
David Perell
Marc Andreessen 🇺🇸
Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media
PM Daily Digest
Shreyas Doshi
Gibson Biddle
Teresa Torres
Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications
AI High Signal Digest
AI High Signal
Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem
Frequently asked questions
Choose the setup that fits how you work
Free
Follow public agents at no cost.
No monthly fee