# Small-Model Economics, Physical AI, and New Pre-Seed Infrastructure Wedges

*By VC Tech Radar • May 18, 2026*

The clearest signals in this batch were a16z's updated Speedrun offer, several pre-traction AI companies with differentiated infrastructure or agent workflows, the move toward heterogeneous SLM+LLM systems, and fresh momentum behind efficiency and physical AI as major investment themes.

## Funding & Deals

- **a16z Speedrun:** Andrew Chen's updated pitch is up to **$1M** per company, **$7M+** in compute/software credits, and operator support across GTM, talent, launch, people, and fundraising [^1][^2][^3]. He also said he was personally reviewing one-line founder + idea submissions and flagging the best to the investing team [^3]. Program details are on [Speedrun](https://speedrun.a16z.com/) [^4], and Chen linked a note on [what Speedrun looks for in applications](https://speedrun.substack.com/p/what-we-look-for-in-applications) [^5].

## Emerging Teams

- **Modular evidence infrastructure:** A solo founder turned a cyber-specific product into a general evidence-handling core for secure third-party derivative sharing and disclosure, then added industry packs for cyber, insurance, HR, and legal. The same core can be licensed to teams building investigation software, with a pilot planned in six weeks [^6].
- **Arbiter Briefs:** Matthew, a 17-year-old founder in Melbourne, is building a decision-intelligence platform that applies structured frameworks to high-stakes decisions and then runs multi-agent LLM simulations of investors, customers, competitors, press, and regulators with distinct incentives, biases, and memory [^7]. He said he fixed agent convergence by introducing information asymmetry and isolated memory; the private beta has **11 signups**, no paying customers, and active outreach to fractional CFOs [^7].
- **Yaqeen:** An early AI verification engine that watches videos, extracts claims, and cross-references them with credible web sources in real time [^8]. The backend uses FastAPI and Celery for asynchronous processing of 10-minute videos, runs a 120B-parameter model on DigitalOcean, uses Tavily for live search, and streams results via SSE into a custom Flutter app [^8]. The founder is explicitly asking AI and cybersecurity operators whether the architecture is overcomplicated [^8].
- **LLM cost optimization:** Another solo founder is building an AI product aimed at reducing LLM API waste, arguing that many teams lose **40–70%** of spend to fixable inefficiencies. The company is pre-MVP and looking for first customers or design partners while exploring pre-seed funding [^9].

## AI & Tech Breakthroughs

- **Heterogeneous SLM+LLM stacks:** A cited NVIDIA Research paper argues that a single massive LLM is inefficient for deployed agentic systems; instead, LLMs should handle high-level planning while specialized SLMs execute repetitive micro-tasks [^10]. The paper says models such as Nemotron-H and Hymba-1.5B can match or exceed models **10x larger** on narrow instructions, with SLMs offering **3.5x** higher throughput and potential operating-cost reductions of **90%+** on structured workloads [^10].
- **Experimental geometric compression:** An individual researcher reported a sharp stability threshold at **β=0.20** when routing transformer activations through a lossy Dual E8 (E16) lattice bottleneck with residual blending; beyond that point, open-ended generation collapsed into repetition [^11]. The same prototype reported **8x** KV-cache compression and a theoretical **112x** weight-matrix compression, pointing toward native geometric transformers trained with E8/E16 constraints [^11].
- **Agent control planes are emerging:** Armorer is positioning itself as a local control plane for AI agents, with run records for every tool call, LLM response, and decision; human approval before dangerous operations; and replay/inspect-state debugging. It is self-hosted, local-first, and works with any agent via MCP [^12].

## Market Signals

- **Physical AI is being framed as the next frontier.** Caitlin Kalinowski said keyboard-bound AI will eventually saturate and that the next frontier is "the physical world"—robotics, manufacturing, and industrialization [^13]. She also noted that core capabilities from VR/AR—SLAM, depth sensing, and human-perception models—are directly transferable to robot navigation and interaction [^13]. On humanoids, she said a few companies are ahead and cited 1X Neo as an example of safety-focused design that pulls mass inward, which she described as safer around people [^13]. She also pointed to supply-chain dependencies in magnets, actuators, and related subsystems as a major constraint [^13].
- **Efficiency is becoming the strategic battleground.** Bindu Reddy argued that the eventual winner in AI will be an efficient model for everyday tasks and that automation becomes ubiquitous when intelligence is cheap enough to meter, with flash and open-source AI as the leading contenders [^14]. That view aligns with the cited SLM argument that data-center economics now favor efficiency over scale [^10].

> "The winner will be an efficient model that is capable of everyday tasks" [^14]

- **Agents are shifting the conversation from cost cutting to super-teams.** Garry Tan argued that the ceiling for AI-human-computer-symbiosis teams has risen, and that a small founder-led team could supersede an incumbent by focusing on capability expansion rather than just lowering cost [^15]. He also framed the move from copilots to agents as "the biggest unlock" and said it is already happening at the highest levels of finance [^16].
- **Avoid the most crowded AI wedges.** One niche-ranking exercise put AI writing tools last, citing **10/10** saturation and user expectations for constant model upgrades that solo developers cannot keep up with [^17]. A commenter summarized the commercial problem more directly: distribution and differentiation are harder than the product itself [^18].

## Worth Your Time

- **[The Secret Benefits of Small Language Models](https://investinginai.substack.com/p/the-secret-benefits-of-small-language)** — essay on why SLMs can improve throughput, cut operating cost, and outperform larger models on narrow structured tasks within agent systems [^10].
- **[Why we’re at the beginning of the AI hardware boom | Caitlin Kalinowski (ex–OpenAI, Meta, Apple)](https://www.youtube.com/watch?v=G5WTgB87rYQ)** — interview on why AI opportunity may shift from keyboard work into robotics, manufacturing, and industrialization, and on the supply-chain constraints around magnets, actuators, and related components [^13].


[![Why we’re at the beginning of the AI hardware boom | Caitlin Kalinowski (ex–OpenAI, Meta, Apple)](https://img.youtube.com/vi/G5WTgB87rYQ/hqdefault.jpg)](https://youtube.com/watch?v=G5WTgB87rYQ&t=0)
*Why we’re at the beginning of the AI hardware boom | Caitlin Kalinowski (ex–OpenAI, Meta, Apple) (0:00)*


- **[What we look for in applications](https://speedrun.substack.com/p/what-we-look-for-in-applications)** — Andrew Chen's linked note on how a16z Speedrun is screening very early companies [^5].
- **[Suhail on mechanistic interpretability](https://x.com/Suhail/status/2056065728575627496)** and [follow-up on OpenAI's sparse circuit paper](https://x.com/Suhail/status/2056066516127478096) — Suhail said Dario Amodei showed interest in mechanistic interpretability and separately highlighted OpenAI's sparse circuit paper as useful for understanding why the technology works at a fundamental level [^19][^20].

---

### Sources

[^1]: [𝕏 post by @andrewchen](https://x.com/andrewchen/status/2056113721811050539)
[^2]: [𝕏 post by @andrewchen](https://x.com/andrewchen/status/2056165401718149242)
[^3]: [𝕏 post by @andrewchen](https://x.com/andrewchen/status/2056223686404034915)
[^4]: [𝕏 post by @andrewchen](https://x.com/andrewchen/status/2056114081778774045)
[^5]: [𝕏 post by @andrewchen](https://x.com/andrewchen/status/2056212635180482612)
[^6]: [r/startups post by u/Sure_Excuse_8824](https://www.reddit.com/r/startups/comments/1tg3cir/)
[^7]: [r/EntrepreneurRideAlong post by u/jonnysboy12](https://www.reddit.com/r/EntrepreneurRideAlong/comments/1tfo1sa/)
[^8]: [r/SaaS post by u/Sea_Lawfulness_5602](https://www.reddit.com/r/SaaS/comments/1tg0zhr/)
[^9]: [r/startups post by u/fsociety10](https://www.reddit.com/r/startups/comments/1tg4cq0/)
[^10]: [The Secret Benefits of Small Language Models](https://investinginai.substack.com/p/the-secret-benefits-of-small-language)
[^11]: [r/artificial post by u/gusfromspace](https://www.reddit.com/r/artificial/comments/1tfvruf/)
[^12]: [r/SideProject post by u/Conscious_Chapter_93](https://www.reddit.com/r/SideProject/comments/1tg4q0y/)
[^13]: [Why we’re at the beginning of the AI hardware boom | Caitlin Kalinowski \(ex–OpenAI, Meta, Apple\)](https://www.youtube.com/watch?v=G5WTgB87rYQ)
[^14]: [𝕏 post by @bindureddy](https://x.com/bindureddy/status/2056138198112239858)
[^15]: [𝕏 post by @garrytan](https://x.com/garrytan/status/2056123737544757733)
[^16]: [𝕏 post by @TFTC21](https://x.com/TFTC21/status/2055978848764784662)
[^17]: [r/EntrepreneurRideAlong post by u/Drysetcat](https://www.reddit.com/r/EntrepreneurRideAlong/comments/1tftkac/)
[^18]: [r/EntrepreneurRideAlong comment by u/LeaderAtLeading](https://www.reddit.com/r/EntrepreneurRideAlong/comments/1tftkac/comment/omc7f0t/)
[^19]: [𝕏 post by @Suhail](https://x.com/Suhail/status/2056065728575627496)
[^20]: [𝕏 post by @Suhail](https://x.com/Suhail/status/2056066516127478096)