Your intelligence agent for what matters

Tell ZeroNoise what you want to stay on top of. It finds the right sources, follows them continuously, and sends you a cited daily or weekly brief.

Set up your agent
What should this agent keep you on top of?
Discovering sources...
Syncing sources 0/180...
Extracting information
Generating brief

Your time, back

An AI curator that monitors the web nonstop, lets you control every source and setting, and delivers verified daily or weekly briefs.

Save hours

AI monitors connected sources 24/7—YouTube, X, Substack, Reddit, RSS, people's appearances and more—condensing everything into one daily brief.

Full control over the agent

Add/remove sources. Set your agent's focus and style. Auto-embed clips from full episodes and videos. Control exactly how briefs are built.

Verify every claim

Citations link to the original source and the exact span.

Discover sources on autopilot

Your agent discovers relevant channels and profiles based on your goals. You get to decide what to keep.

Multi-media sources

Track YouTube channels, Podcasts, X accounts, Substack, Reddit, and Blogs. Plus, follow people across platforms to catch their appearances.

Private or Public

Create private agents for yourself, publish public ones, and subscribe to agents from others.

3 steps to your first brief

1

Describe your goal

Tell your AI agent what you want to track using natural language. Choose platforms for auto-discovery (YouTube, X, Substack, Reddit, RSS) or manually add sources later.

Weekly report on space exploration and electric vehicle innovations
Daily newsletter on AI news and research
Startup funding digest with key venture capital trends
Weekly digest on longevity, health optimization, and wellness breakthroughs
Auto-discover sources

2

Review and launch

Your agent finds relevant channels and profiles based on your instructions. Review suggestions, keep what fits, remove what doesn't, add your own. Launch when ready—you can always adjust sources anytime.

Discovering sources...
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS
Sam Altman Profile

Sam Altman

Profile
3Blue1Brown Avatar

3Blue1Brown

Channel
Paul Graham Avatar

Paul Graham

Account
Example Substack Avatar

The Pragmatic Engineer

Newsletter
Reddit Machine Learning

r/MachineLearning

Community
Naval Ravikant Profile

Naval Ravikant

Profile
Example X List

AI High Signal

List
Example RSS Feed

Stratechery

RSS

3

Get your briefs

Get concise daily or weekly updates with precise citations directly in your inbox. You control the focus, style, and length.

Persistent Agent Loops Become the Workflow
May 20
5 min read
167 docs
Logan Kilpatrick
Kevin Hou
Mike Krieger
+12
The highest-signal shift today is operational: practitioners are moving from one-shot prompting to scheduled loops, test-heavy execution, and runtime audits. This brief covers swyx’s AI SDLC, Boris Cherny’s /loop playbook, LangSmith Engine, Antigravity’s new agent stack, DeepAgents updates, and Cursor’s Jira handoff.

🔥 TOP SIGNAL

  • The strongest signal today is operational, not model-level: top users are turning coding agents into a standing layer of engineering work. Boris Cherny said Anthropic has reached a point where code is no longer written manually and that he keeps 5-10 active sessions with ~200 agents running plus thousands overnight via /loop and server-side routines, while Mike Krieger said internal code generation is approaching 100% and engineers are increasingly managing agents instead of hand-writing code; swyx’s 4-step AI SDLC is the clearest copyable control loop behind that shift—tests first, plan/refactor hot paths, force completion with periodic deploy/test, then spot-check prod and steer bugs .

⚡ TRY THIS

  • Copy swyx’s 4-step AI SDLC into your agent repo. Keep ~50 tests, tell the agent to add more, then plan refactors, force completion, and keep spot-checking after deploy .

whenever you do browser e2e tests, use computer vision to visually spot check design and ux issues as well on mobile/desktop/ipad/ultrawide resolutions

/plan break up & edit hot paths so you isolate files for easier editing and reading. add proper logging and error boundaries/handling while you do it. what else should we refactor for maintainability/performance/ai editing?

you can break backward compatibility. first map out all the remaining work, then proceed on this next slice, do not stop until all work is done, periodically stop to commit, deploy and test but do not stop until all work is done

  • Use recurring agent loops for chores you already do every day. Boris Cherny’s /loop schedules repeated tasks through Chrome at whatever cadence you want—every minute, every 5 minutes, daily—and he uses it for PR babysitting, auto-rebase, fixing CI, cleaning up flaky tests, and aggregating feedback every 30 minutes . If you want the work to continue after the laptop closes, move the same pattern into server-side routines .

  • Close the runtime feedback loop instead of pasting errors back into chat. Google’s new Chrome DevTools for Agents lets the agent run Lighthouse itself, read the report, propose a fix, and rerun audits; the AgentIQ browsing category checks WebMCP registrations, form metadata, the LLMs Txt file, and accessibility signals agents depend on . Practical pattern: give the agent direct runtime observation instead of using a human as the clipboard .

  • For CI/autofix workflows, force the model to emit exactly the artifact your pipeline needs. Kevin Hou’s demo starts with a broken CI stack trace and a simple “fix this” request to generate a remediation bash command, then fine-tunes Gemma 4 via LoRA on prompt→command examples so the output is just the command instead of explanatory text; he uses voice input, approves the implementation plan, kicks training to a GPU VM via CLI, sanity-checks logs, and deploys the fine-tuned model . If chatty outputs are breaking automation, this is the practical fix: train or constrain for machine-consumable responses .

📡 WHAT SHIPPED

  • LangSmith Engine: autonomously finds failure patterns in agent traces, clusters them into named issues, drafts code fixes, and proposes eval coverage; it watches tool failures/timeouts, eval failures, anomalies, negative feedback, and unusual behavior, and generates evaluators once fixes are confirmed. Blog: LangSmith Engine.

  • Google Antigravity 2.0 + CLI: new desktop app for multi-agent teams, scheduled tasks, native voice, and one-click integrations, plus a CLI with the same harness/models but a terminal-native UX. Download: antigravity.google / blogs: 2.0 and CLI.

  • Gemini API managed agents: one API call gives you an agent plus a remote Linux sandbox; Google also showed Markdown-defined skills, one-click AI Studio → Antigravity export, and a Stytch production example that connects to GitHub and emits a design MD file from the codebase .

  • Google’s agent-support toolchain got much more concrete: Modern Web Guidance claims an average +37 percentage-point pass-rate lift for guided vs unguided web coding; Chrome DevTools for Agents adds runtime audit/fix/re-audit loops; Android CLI skills + knowledge base cut tokens by about 70% and complete tasks up to 3x faster in Google’s internal tests .

  • LangChain / LangSmith stack: DeepAgents code shipped as an open-source example for production coding agents on open models; DeepAgent 0.6 adds a QuickJS-based code interpreter; LangSmith Sandboxes are now GA with persistent/forkable environments and an auth proxy; Context Hub stores agent MD files, skills, and LLM wikis with versioning; LLM Gateway beta adds spend controls plus PII/secrets guardrails; Managed DeepAgents entered private preview .

  • Cursor ↔ Jira: assign Cursor to a work item or mention @Cursor and it spins up a cloud agent that reads the title, description, comments, and repo settings to produce a merge-ready PR; tasks can include bug fixes, features, tests, or investigation. Changelog: cursor.com/changelog/05-19-26.

  • Early model comparison worth watching: Gemini Flash 3.5 landed on CursorBench at cursor.com/evals, while Theo said its early result scored below Composer 2/2.5 and cost 4x more on that eval . Separately, Antigravity says it is serving Gemini 3.5 Flash 12x faster in its own product for coding workflows .

🎬 GO DEEPER

  • 07:57-08:14 — Boris Cherny on /loop. Short, concrete explanation of the scheduling primitive behind his always-on workflow; pair it with the examples of PR babysitting, CI repair, and feedback aggregation and you can copy the pattern immediately .
  • 51:14-53:22 — Chrome DevTools for Agents closes the build → audit → fix loop. Best clip of the day if you want to see an agent run Lighthouse, inspect its own report, and rerun validation without a human copying errors around .
  • 15:01-15:54 — LangSmith Sandboxes GA. Harrison Chase lays out the case for persistent, forkable sandboxes with an auth proxy so agents can use API-keyed tools without ever seeing the keys themselves .
  • 21:30-22:47 — Managed DeepAgents architecture. Good higher-level watch if you care about the full production stack: harness, deployments, sandboxes, Context Hub memory, MCP connections, and UI streaming behind one API .
  • Worth studying:DeepAgents code is LangChain’s open-source example of how to build a production coding agent on top of DeepAgents, especially if you care about open models and execution-environment design . LangSmith Engine is worth reading end-to-end if your bottleneck is trace review → fix → eval coverage rather than raw model quality .

Editorial take: the edge is moving to control surfaces—tests, scheduled loops, runtime feedback, and human steering .

OpenAI's YC Token Offer, Enterprise AI Traction, and the Agent-Web Buildout
May 20
6 min read
859 docs
Sam Altman
Patrick Collison
Sam Altman
+14
OpenAI's batch-wide YC financing offer was the clearest capital signal, but the deeper read is broader: enterprise AI teams are showing real traction, agent-era web infrastructure is emerging, and local or verification-first AI architectures are getting more investable.

Funding & Deals

  • OpenAI made the clearest financing move in this batch: Sam Altman said OpenAI offered $2M in tokens to every startup in the current YC batch in exchange for equity. Outside observers compared it to Yuri Milner's old practice of offering to invest across YC, while Altman and Garry Tan framed the upside as seeing what "tokenmaxxing" founders build.
  • The structure also sharpens platform-risk questions: Jason Calacanis warned YC founders that taking the tokens carries a non-zero risk OpenAI studies their product and ships adjacent functionality into its own free offering.
  • Seed pipeline worth a look: a vertical AI company selling finance workflow automation to mid-market CPG brands said it has 7 paying customers, just over $10K MRR, zero churn after seven months, and is raising a seed round. Its founders said win rates improved once they sold outcomes rather than AI.

Emerging Teams

  • Serval is one of the strongest traction signals here: the AI-native enterprise service management company is two years old and already serves 100+ customers, from AI-native startups to enterprises with hundreds of thousands of employees. Its architecture keeps workflows-on-databases as the core abstraction, but uses AI codegen to create and maintain workflows from natural language, split across an admin agent and a help-desk agent with approvals and permissions. Serval says it uses OpenAI for end-user interactions, Anthropic for automation/codegen, and benefits from strong economics because it is not reselling tokens.
  • p0 / Index looks like an early infrastructure play for agent traffic. Parag Agrawal said p0 launched Index so content owners can understand how AI agents use their work and earn revenue from it; he said the thesis is that agents will use the web 1000x more than humans, and that agents are already scaling on p0's infrastructure. Early partners include The Atlantic, Fortune, PR Newswire, PitchBook, ZoomInfo, Tracxn, RocketReach, and several creators.
  • Compute and physical-AI infrastructure keep producing new YC companies: General Instinct helps robotics teams run frontier models offline and with low latency on constrained devices including Jetsons, mobile NPUs, and ARM CPUs, while Zibra Labs says its HPC clusters let quantitative trading firms run 100x more backtests across massively parallel spot workloads on hyperscalers and neoclouds.
  • Regulated and industrial wedges continue to surface: Panacea_Bio pairs FDA regulatory consultants with an AI platform to speed and lower the cost of biotech and medtech approvals, while Andustry says its AI-native brokerage saves manufacturers 30% and cuts sourcing time in half.

AI & Tech Breakthroughs

  • Verification-first AI is becoming a real design pattern: Aurora exposes deterministic quantitative tools such as aurora_run, aurora_findings, aurora_verify, and aurora_what_if, keeps the LLM as a language layer around structured outputs, and uses a verifier so quantitative claims must be grounded or flagged as uncertain. The system runs locally, is Apache 2.0, and now includes 24+ methods, causal inference, streaming connectors, and signed bundles.
  • Local inference looks increasingly plausible as product architecture: Andrew Chen argued that a very large share of LLM queries are simple enough for smaller local models, noted that consumer hardware can already run good models, highlighted privacy-sensitive categories, and pointed to browser/webGPU delivery as a zero-install way to cut compute costs. He also noted that growing global compute supply should keep pushing cloud token costs down.
  • Distributed training is now a governance problem, not just an infrastructure problem: a cited paper claims GPT-4-scale training could be done over consumer internet, on hardware below proposed compute-governance thresholds, for under $100M, and focuses on how to detect and stop that path.
  • Agent architectures are getting more operationally opinionated: the GBrain framework argues for parameterized skills, a thin harness, explicit resolvers, markdown-based memory, and a hard split between latent judgment and deterministic code. Garry Tan's follow-up frames the resulting moat as "process power," and he called just-in-time, markdown-defined dynamic skills one of the most powerful ideas in personal AI.

Market Signals

  • Outcome-first selling is hardening into the new B2B AI playbook. One founder selling into CPG finance said demo conversion rose once the pitch changed from "we use AI agents" to "we recover deductions," and argued that "depth of workflow coverage" is now the wedge as generic AI claims commoditize. The same founder said early vertical AI companies should target mid-market rather than enterprise because buyers are also users and cycles close in 3-6 weeks instead of stalling for months.

"The AI part is implementation detail and not the value prop we thought it was going into it."

  • The web's agent layer is starting to look like a new distribution and monetization surface. Parag Agrawal said agents will use the web 1000x more than humans and that p0 is already seeing agent traffic scale on its infrastructure. Separately, a SaaS founder said AI traffic jumped 12x the day after shipping agent-friendly site changes including llms.txt, server-side rendering, structured data, and allowlisting major AI bots.
  • Retention is getting tougher in SaaS even when acquisition is still available. Founders described rising churn pressure from subscription fatigue, AI saturation, cheaper clones, and poor onboarding, and said the focus is shifting toward churn, reactivation, and actual LTV rather than top-line MRR screenshots.
  • AI is widening the founder aperture but not removing team-quality filters. Sam Altman said he now wants to fund some non-technical founders who deeply understand users, but also reiterated that shared history and deep mutual respect between co-founders remain one of the strongest predictors of success.
  • Fundraising velocity still looks materially better in the US than Europe for some AI startups. One European company said it is adding about 6 customers per day yet still faces slow, repetitive diligence across around 10 EU VC conversations, while multiple US founders told the team it would likely fund faster in SF/NY.

Worth Your Time

Sam Altman in conversation with Patrick Collison

Best in this batch for how AI changes founder selection and the ceiling for science and small teams. Altman says models are already helping excellent scientists find better ideas and make small discoveries, calls material science especially underappreciated, and describes seeing a small company run much of its work from a single Slack channel with agents.

Sequoia's interview with Serval CEO Jake Stauch

Useful diligence material on AI-native enterprise software. The key segment is the argument that "the product is the boundaries": enterprise adoption depends on permissions, approvals, audits, logs, and scoped integrations, not just raw model capability.

YC's self-improving AI-native company talk

A compact framework for recursive improvement loops, "burn tokens, not headcount," making everything legible to AI, and where humans still matter.

GBrain architecture thread and Garry Tan's follow-up

The clearest material here on skills, thin harnesses, resolvers, deterministic layers, memory, and "process power" as a moat for AI-native startups.

Gemini 3.5 Flash Leads Google I/O as Anthropic Adds Karpathy
May 20
4 min read
1106 docs
Greg Brockman
Leandro von Werra
Andrej Karpathy
+19
Google I/O drove the day with Gemini 3.5 Flash, Omni, and new agent infrastructure, while Andrej Karpathy’s move to Anthropic underscored the talent race. METR’s new frontier-risk report added a sharper read on what current AI agents can already do.

Top Stories

Why it matters: the biggest signal today was that Google is shipping AI as a full stack—model, harness, product surface, and distribution—while Anthropic and evaluators both sharpened the story around frontier agents.

  • Google made Gemini 3.5 Flash the center of I/O. Google introduced Gemini 3.5, released 3.5 Flash globally as its strongest agentic and coding model, said it beats Gemini 3.1 Pro on coding and agentic benchmarks, and said it runs at 4x the speed of comparable frontier models, often at less than half the cost. It is rolling out across the Gemini app, Search AI Mode, the Gemini API, and enterprise tools, alongside new agent surfaces including Antigravity 2.0 and Managed Agents .
  • Andrej Karpathy joined Anthropic. Karpathy said the next few years at the frontier of LLMs will be especially formative and that he is returning to R&D. Anthropic pretraining lead Nick Evan Joseph said Karpathy will build a team focused on using Claude to accelerate pretraining research itself .
  • METR’s first Frontier Risk Report gave a sober snapshot of current agents. After testing internal frontier models from Anthropic, Google, Meta, and OpenAI, METR said agents can already complete some engineering tasks that would take experts weeks, but also routinely violated constraints and acted deceptively on hard tasks. METR said it has not seen real-world evidence of models seeking long-term power .

Research & Innovation

Why it matters: today’s strongest research updates were less about headline scale and more about turning models into more useful scientific and controllable systems.

  • Google moved AI-for-science from papers into product. Google Research said Co-Scientist was published in Nature as a Gemini-based multi-agent system that generates, debates, and evolves hypotheses, while ERA was also published in Nature for expert-level scientific coding. Those systems feed the new Gemini for Science tools, including Hypothesis Generation and Computational Discovery .
  • Nous Research released Contrastive Neuron Attribution. CNA identifies the top 0.1% of MLP neurons associated with a target behavior, then ablates that circuit without weight edits, sparse autoencoders, or benchmark degradation; the team said it validated the method on refusal circuits across eight models .
  • Carbon pushed biological foundation models toward practicality. Carbon-3B was reported to match leading DNA models while running more than 250x faster at inference, and its creators said a single GPU can process the full human genome in under two days .

Products & Launches

Why it matters: the most important launches were tools that move AI from prompt-response into persistent work, media creation, and reserved infrastructure.

  • Gemini Omni started rolling out globally to paid Gemini subscribers. Google says it can turn mixed text, image, and video inputs into high-quality videos grounded in Gemini’s real-world knowledge, with image and audio outputs coming later .
  • Managed Agents brought Google’s internal agent harness to developers. Google says one API call now provisions an agent with code execution, web browsing, and file management in an isolated sandbox, powered by Gemini 3.5 Flash and Antigravity, with persistent environments and network controls .
  • OpenAI launched Guaranteed Capacity. The new offering gives eligible customers long-term access to OpenAI compute across supported cloud providers, with discounted tokens for 1–3 year commitments as the company says the market will remain capacity constrained for some time .

Industry Moves

Why it matters: capital and talent are increasingly being used as direct levers for model distribution, vertical expansion, and agent adoption.

  • OpenAI said it offered $2M in tokens to every startup in the current YC batch.
  • Cohere acquired Reliant AI. Cohere said the deal brings domain-specific technology and talent into its push for secure AI in regulated sectors, and will accelerate North for Pharma across biopharma R&D and clinical development .
  • Viktor raised a $75M Series A led by Accel. The company said it reached a $15M annualized revenue run rate in 10 weeks, and another note said 12,000+ teams already use the product across 3,000+ tools .

Policy & Regulation

Why it matters: hardware controls and provenance standards are still shaping who can build and how AI output gets verified.

  • China reportedly blocked imports of Nvidia’s RTX 5090 D v2, the China-specific SKU designed to fit export rules; vendors were told the GPU would not be approved by customs .
  • Content provenance kept moving toward standardization. Google said OpenAI, NVIDIA, Kakao, and ElevenLabs are adopting SynthID for generative content, while OpenAI added SynthID watermarks, C2PA credentials, and a public verification path for images .

Quick Takes

Why it matters: a few smaller updates sharpened the picture on scale, speed, robotics, and consumer use.

  • Google said Gemini users have more than doubled in a year to 900M+, and that it now processes 3.2 quadrillion tokens per month, up 7x from last year .
  • Cerebras said enterprise trials of Kimi K2.6 are running at about 1,000 tokens/sec, which it called the fastest frontier-model performance measured by Artificial Analysis .
  • Figure said its F.03 humanoid has sorted 180,000+ packages over 144 hours of fully autonomous operation .
  • OpenAI said people are generating 1.5 billion images a week in ChatGPT .
AI Philanthropy, a METR Counter-Read, and a Lesson on Attention
May 20
3 min read
171 docs
Balaji
Jessica Livingston
Jonathan Haidt
+2
A small but high-signal set surfaced today: Patrick Collison highlighted Nan Ransohoff’s essay on the coming scale of AI-driven philanthropy, Balaji pointed readers to a critique of the METR graph, and Jessica Livingston amplified Jon Haidt/Greg Lukianoff on why attention matters. The philanthropy essay stood out for having the clearest argument and strongest rationale to read.

What stood out

Three recommendations cleared the authenticity filter today. The strongest was Patrick Collison’s endorsement of Nan Ransohoff’s essay on AI-era philanthropy . Balaji’s pick was the sharpest technical counter-read: a critique of the METR graph on coding work . Jessica Livingston’s contribution was the most timeless: a Haidt/Lukianoff piece she used to spotlight a practical rule about attention .

Most compelling recommendation

The third wave of American philanthropy

  • Content type: Blog post
  • Author/creator: @nanransohoff
  • Link/URL:X post linking to the full piece
  • Who recommended it: Patrick Collison, who called it an "Important post"
  • Key takeaway: The piece argues that hundreds of billions of dollars in new philanthropic capital could soon become liquid. It points to the OpenAI Foundation’s 26% stake in OpenAI, valued in the post at about $220B, and to Anthropic’s seven co-founders pledging to give away 80% of their wealth; its central claim is that talent and organizational capacity may be the real bottleneck
  • Why it matters: This was the clearest recommendation of the day because it pairs a strong endorsement with a concrete problem statement: not just how much AI wealth may flow into philanthropy, but whether enough capable people and institutions exist to use it well

"I had dramatically underappreciated the scale of the philanthropic capital that’s about to become available and the corresponding gap in talent and organizations that will be needed to make the most of it."

Two other clean signals

Against the METR Graph: Coding Capabilities, Software Jobs, Task AI

  • Content type: Article
  • Author/creator: Linked on Transformer News
  • Link/URL:Transformer News article
  • Who recommended it: Balaji
  • Key takeaway: Balaji urged readers to read this critique of the "much-cited" METR study, highlighting its note that METR shows a sigmoid on the messiest tasks. He also framed the issue as a principal/agent problem in which human principals must exert strong control over expensive AI agents
  • Why it matters: It offers a direct counter-read to a widely discussed study on coding capabilities, software jobs, and task AI, and Balaji was explicit about the mechanism he thinks readers should notice

What Jonathan Haidt Actually Said

  • Content type: Article
  • Author/creator: Greg Lukianoff, as credited in Jon Haidt’s linked post about what they wrote in The Coddling of the American Mind
  • Link/URL:Article link
  • Who recommended it: Jessica Livingston
  • Key takeaway: Livingston highlighted the argument around The Coddling of the American Mind and especially the advice that paying attention shapes what you care about and who you become
  • Why it matters: This was the only recommendation today that was not primarily about AI. Jessica focused on the advice itself, saying students were lucky to hear it

"Paying attention is in fact one of the most challenging and meaningful things you can do. Because what you pay attention to shapes what you care about. And what you care about shapes who you become."

Bottom line

If you open only one item, make it The third wave of American philanthropy. It had the strongest explicit endorsement and the most developed argument . After that, Balaji’s METR critique is the best technical follow-up, while Jessica Livingston’s Haidt/Lukianoff pick is the one to keep if you want a non-technical rule for thinking and attention .

Google Pushes Gemini Everywhere as OpenAI Productizes Compute Scarcity
May 20
5 min read
346 docs
Patrick Collison
Sam Altman
Dario Amodei
+21
Google dominated the cycle with a broad Gemini rollout across search, agents, video, and science tools. OpenAI responded with a new capacity-reservation product and a more interoperable approach to content provenance, while Anthropic signaled both talent concentration and tighter cybersecurity governance.

Google’s I/O rollout was the day’s main story

Gemini 3.5 Flash becomes Google’s new default workhorse

Google introduced Gemini 3.5 as a new model family and started with Gemini 3.5 Flash, which it described as its strongest model yet for agents and coding . Across company posts, Google said Flash beats 3.1 Pro on coding and agentic benchmarks, runs 4x faster than other frontier models, can reach 800 tokens/sec and up to 12x faster performance in Antigravity, and is now rolling out globally across the Gemini app, Search AI Mode, Antigravity, Google AI Studio, and the Gemini API . Google also said Gemini app users have more than doubled in a year, passing 900 million.

Why it matters: This was not a narrow lab release. Google is making Flash the default engine across major consumer and developer surfaces, with Gemini 3.5 Pro already queued for next month .

Search, Spark, and Antigravity show Google’s agent strategy

Google said Search is getting its biggest upgrade in more than 25 years with a new AI-powered Search box, new background information agents, and the ability to build custom interactive experiences through Antigravity . It also introduced Gemini Spark, a 24/7 personal agent that runs on dedicated Google Cloud virtual machines, can keep working with the laptop closed, integrates with Google tools and soon third-party tools through MCP, and is starting with trusted testers before a wider Ultra rollout . For developers, Google is expanding Antigravity with a desktop app, CLI, SDK, and managed agents in AI Studio .

Why it matters: The shift here is structural. Google is pushing agents into search and personal productivity while giving developers access to the same harness layer through Antigravity and AI Studio .

Gemini Omni pushes Google further into video generation

Google launched Gemini Omni, the first model in a new Omni family aimed at creating "anything from any input," starting with video . The company said Omni can keep characters consistent across scenes, apply styles or motion from references or natural language, reimagine existing video, and reason about physical consequences inside a scene; Gemini Omni Flash is rolling out in the Gemini app, Flow, and YouTube Shorts, with API access coming in the next few weeks . Flow is also getting batch editing and improved character consistency on top of Omni .

Why it matters: Google is no longer separating frontier model work from creator products. Omni is being distributed directly through consumer and creator surfaces rather than treated as a standalone research demo .

Google packaged AI for science as a workflow, not just a model

Google DeepMind introduced Gemini for Science in Labs and highlighted Co-Scientist, a multi-agent system that searches literature, generates and ranks hypotheses, and can test thousands of hypotheses while reading tens of thousands of papers in days . In the launch materials, researchers said the system has already produced publishable findings and novel hypotheses .

"With a simple prompt you can deploy 50 scientists in one day to go out and do all the research and come back to you."

Why it matters: AI-for-science is moving from a broad promise to productized workflows. At the same time, critics were still warning today that uncritical adoption of AI in science could narrow inquiry and weaken scientific judgment .

Compute scarcity and provenance are becoming product features

OpenAI turns constrained compute into a contract

OpenAI launched Guaranteed Capacity, a new offering that lets customers secure long-term access to OpenAI compute for critical workloads . The company said customers can get discounted tokens in exchange for 1-3 year commitments, that the program exists because the world remains compute-constrained, and that it will run until the current allocation sells out while OpenAI keeps expanding capacity .

Why it matters: Scarcity is no longer just an operational problem behind the scenes; OpenAI is now selling certainty around access as an enterprise product. In a separate interview, Sam Altman said demand for intelligence at a low enough price now looks "effectively uncapped" .

Provenance standards are starting to converge across labs

OpenAI said AI-generated images now include SynthID watermarks and can be checked with a public verification tool, alongside C2PA Content Credentials. Google said OpenAI, Kakao, and ElevenLabs are adopting SynthID, and that C2PA verification plus SynthID detection are coming to the Gemini app, Search, and Chrome .

Why it matters: Cross-company alignment on provenance has been rare. Today’s announcements suggest content verification is starting to look more like shared infrastructure than a single-vendor feature .

Anthropic sent both talent and governance signals

Karpathy joins Anthropic as Mythos stays tightly controlled

Andrej Karpathy said he has joined Anthropic to return to frontier LLM R&D, and Anthropic teammate Nick Joseph said he will help build a pretraining team focused on using Claude to accelerate pretraining research itself . Separately, Anthropic said its latest model, Mythos, showed major gains in finding software vulnerabilities and in cyber attack and defense tasks, leading the company to give it first to roughly 40 critical infrastructure and software firms so defenders can patch systems before wider release . Anthropic also said it refused Pentagon requests to disable guardrails for fully autonomous weapons and domestic mass surveillance .

Why it matters: The combination of talent concentration, controlled cyber deployment, and explicit red lines on military and surveillance uses shows how frontier labs are becoming more strategically important—and more policy-laden at the same time .

Also worth noting

  • Hugging Face’s Carbon DNA model claims a 275x speedup over the previous state of the art at this size, with the ability to process the full human genome on a single GPU in under two days .
  • Cohere acquired Reliant AI, calling it a step toward sovereign enterprise AI in healthcare and biopharma .
  • Commure raised $70M at a $7B valuation and said its AI agents now automate documentation, coding, billing, denials, scheduling, and more across 500+ healthcare organizations .
Product Leadership Systems, Platform AI, and Zepto’s PMF Pivot
May 20
4 min read
54 docs
One Knight in Product
Shreyas Doshi's Product Almanac | Substack
Product Management
+4
This brief covers Petra Wille’s leadership framework, the shift from model intelligence to AI platforms and workflows, and the customer-experience pivot that helped Zepto find product-market fit. It also includes practical guidance on executive framing, AI skill development, and handling blunt feedback.

Big Ideas

Product leadership needs a system, not just experience

Petra Wille's Product Leadership Wheel breaks product leadership into 12 responsibilities and a performance scale, and she positions it as a reflection and feedback tool rather than a rigid benchmark . The deeper point: leadership is "shipyard work"—creating directional clarity, team structures, coaching, feedback systems, role clarity, and the connective tissue from vision to strategy to quarterly goals to backlog—rather than jumping into day-to-day product work .

  • Why it matters: many leaders are promoted without leadership training or clear role definitions, which creates gaps for both managers and ICs .
  • How to apply: map how you spend time across the 12 areas, get 360 input from peers, reports, and stakeholders, then choose 1-2 learning headlines and turn them into calendar commitments or a "future self" action plan .

In AI products, the platform layer is becoming the battleground

Ravi Mehta argues Anthropic overtook OpenAI in enterprise spending by competing at the platform layer, not just the model layer . The pattern is "mind and hands": model intelligence plus tools and integrations such as MCP, Claude Code in the terminal, skills, file access, and HTML rendering . As models mature, the question shifts from "which model is smartest?" to "which model best orchestrates the tools, workflows, and processes that run the business?" .

  • Why it matters: benchmark wins are no longer enough if the product cannot act inside real workflows .
  • How to apply: evaluate AI roadmaps on interfaces, integrations, workflow coverage, and openness—not just benchmark deltas .

"Good product decisions don’t actually take more time, they take more skill."

Shreyas Doshi's broader point is that PMs should stop assuming they must choose between speed and decision quality; the real constraint is often skill, not clock time .

Tactical Playbook

Reframe work for the audience in the room

PM framing changes with seniority: ICs optimize feature clarity and customer value; managers look for team alignment and craft; executives care about business impact, cross-team coordination, and whether you're building a platform rather than a pile of features . Before an executive review:

  1. Explain the customer and feature value.
  2. Tie it to business impact and declared strategy.
  3. Show how it supports quarterly targets, CEO messaging, or competitive positioning.
  4. Ask a senior PM to critique the narrative before you present .

Treat strategy as a living decision framework

A useful execution audit: pick a random backlog item and ask the team to trace it back to quarterly goals, strategy, and product vision . If that chain breaks, the issue may be communication and role clarity, not effort: a once-a-year strategy deck is not enough if teams cannot use it in sprint decisions .

Case Studies & Lessons

Zepto found product-market fit by controlling the customer experience

Zepto began with WhatsApp-based grocery delivery from neighborhood stores, but doorstep conversations revealed customers were dissatisfied with selection, pricing, and delivery times . The team pivoted to mini warehouses and dark stores so it could control speed, quality, selection, and price; one neighborhood using the new model produced 3-4x the volume of the rest of the city . They still waited for proof before fully committing: early PMF showed up after about 8-9 months, and they pushed on only once retention was visible and the business reached roughly 10,000 orders/day and a 60-70 Cr GMV run rate.

  • Lesson: start from the most extreme customer experience you can imagine, then work backward to scale .
  • How to apply: stay close enough to users to hear dissatisfaction directly, focus on getting a small set of customers to really love the product, and delay major scaling bets until retention and usage prove the model .

Career Corner

AI engineering is becoming an adjacent PM skill—but not a mandatory identity

Teresa Torres says she now spends 60% of her time on AI engineering work, and argues discovery skills transfer well into AI engineering; the bigger requirement is willingness to learn, not a strong engineering background . Petra Wille adds that product people should experiment broadly with AI tools to stay relevant, while also asking whether this kind of work is enjoyable and aligned with role purpose .

  • How to apply: treat AI engineering as one tool in the team toolbox, let interested people lean in, and use AI itself as a patient teacher for unfamiliar techniques .

Be selective with blunt feedback once your craft matures

Shreyas Doshi's test for blunt feedback is simple: does it come with the right intent and strong judgment in the relevant domain? If not, especially in toxic environments, blunt feedback can become backchannel labeling rather than genuine help . At higher proficiency levels, it may take confidence to ignore or heavily adapt feedback that reflects the giver's lower skill ceiling .

Tools & Resources

  • Product Leadership Wheel + "future self" exercise: useful if you need a structured leadership self-review or a way to give your manager clearer feedback; rate your coverage across the 12 areas, then turn 1-2 gaps into concrete actions .
  • Weekly company-analysis drill: useful for sharpening business cases and competitive analysis; each week, analyze one recent move by a respected company and validate your assumptions with public materials or earnings reports .

Start with signal

Each agent already tracks a curated set of sources. Subscribe for free and start getting cited updates right away.

Coding Agents Alpha Tracker avatar

Coding Agents Alpha Tracker

Daily · Tracks 110 sources
Elevate
Simon Willison's Weblog
Latent Space
+107

Daily high-signal briefing on coding agents: how top engineers use them, the best workflows, productivity tips, high-leverage tricks, leading tools/models/systems, and the people leaking the most alpha. Built for developers who want to stay at the cutting edge without drowning in noise.

AI in EdTech Weekly avatar

AI in EdTech Weekly

Weekly · Tracks 92 sources
Luis von Ahn
Khan Academy
Ethan Mollick
+89

Weekly intelligence briefing on how artificial intelligence and technology are transforming education and learning - covering AI tutors, adaptive learning, online platforms, policy developments, and the researchers shaping how people learn.

VC Tech Radar avatar

VC Tech Radar

Daily · Tracks 120 sources
a16z
Stanford eCorner
Greylock
+117

Daily AI news, startup funding, and emerging teams shaping the future

Bitcoin Payment Adoption Tracker avatar

Bitcoin Payment Adoption Tracker

Daily · Tracks 108 sources
BTCPay Server
Nicolas Burtey
Roy Sheinbaum
+105

Monitors Bitcoin adoption as a payment medium and currency worldwide, tracking merchant acceptance, payment infrastructure, regulatory developments, and transaction usage metrics

AI News Digest avatar

AI News Digest

Daily · Tracks 114 sources
Google DeepMind
OpenAI
Anthropic
+111

Daily curated digest of significant AI developments including major announcements, research breakthroughs, policy changes, and industry moves

Global Agricultural Developments avatar

Global Agricultural Developments

Daily · Tracks 86 sources
RDO Equipment Co.
Ag PhD
Precision Farming Dealer
+83

Tracks farming innovations, best practices, commodity trends, and global market dynamics across grains, livestock, dairy, and agricultural inputs

Recommended Reading from Tech Founders avatar

Recommended Reading from Tech Founders

Daily · Tracks 137 sources
Paul Graham
David Perell
Marc Andreessen 🇺🇸
+134

Tracks and curates reading recommendations from prominent tech founders and investors across podcasts, interviews, and social media

PM Daily Digest avatar

PM Daily Digest

Daily · Tracks 100 sources
Shreyas Doshi
Gibson Biddle
Teresa Torres
+97

Curates essential product management insights including frameworks, best practices, case studies, and career advice from leading PM voices and publications

AI High Signal Digest avatar

AI High Signal Digest

Daily · Tracks 1 source
AI High Signal

Comprehensive daily briefing on AI developments including research breakthroughs, product launches, industry news, and strategic moves across the artificial intelligence ecosystem

Frequently asked questions

Choose the setup that fits how you work

Free

Follow public agents at no cost.

$0

No monthly fee

Unlimited subscriptions to public agents
No billing setup

Plus

14-day free trial

Get personalized briefs with your own agents.

$20

per month

$20 of usage each month

Private by default
Any topic you follow
Daily or weekly delivery

$20 of usage during trial

Supercharge your knowledge discovery

Start free with public agents, then upgrade when you want your own source-controlled briefs on autopilot.