Topic hub

Applied AI

AI that ships, not AI that demos.

What actually works when you put machine intelligence into a real product or workflow — evaluation, cost, latency, failure modes, and the org design around it. Written by someone who builds with these systems, not just about them.

13 essays

July 5, 2026

The Compounding-Error Problem: Why Agent Reliability Decays Exponentially with Task Length

The binding constraint on autonomous agents isn't intelligence — it's that per-step success probabilities multiply. A 95%-reliable agent finishes a 20-step task 36% of the time. The fix is topology, not IQ.

10 min read

July 5, 2026

One Language for Proteins, Molecules, and Cells: The MAMMAL Bet

MAMMAL's real contribution is not a benchmark win. It's a bet that molecules, proteins, and gene expression can share one sequence-to-sequence language — and a 458M-parameter generalist that proves the bet pays.

8 min read

July 4, 2026

You Can't Evaluate an Agent You Can't Specify

Enterprise agent pilots stall at "impressive demo, never shipped" because teams score final answers while agents operate on trajectories — path-dependent decision sequences where one demo tells you almost nothing.

8 min read

July 3, 2026

Your AI Agent Has No Skin in the Game, and That's the Real Ceiling on Autonomy

The limit on agent autonomy isn't capability, it's accountability. Every high-trust role is built around liability, and an AI bears no consequences for being wrong, so a human stays on the hook permanently.

8 min read

July 1, 2026

The Agent-to-Agent Economy Runs on Rails the Web Never Built

The consequential shift isn't agents running your errands, it's agents transacting with other agents. That needs identity, binding commitment, and settlement primitives the web never built, and it opens an adversarial surface it has never faced.

9 min read

June 30, 2026

Agent Memory Is the Next Bottleneck

Today's agents are amnesiacs that re-solve your problem from scratch every session. The next advance isn't a smarter model but persistent, structured memory, and the accumulated record of working with you is where the real moat forms.

8 min read

June 29, 2026

The Coming Agent Trust Crisis: Intelligence Is Going to Commodity, Trust Isn't

As agents act on our behalf, the binding constraint stops being capability and becomes trust: whether an agent serves your interest, resists hijacking, and is who it claims to be. The winners will compete on verifiable trust primitives, not raw IQ.

8 min read

June 24, 2026

Your AI Is a Correlation Engine Pointed at Causal Decisions

Every model that ranks "what drives outcome Y" hands you a correlation, but you spend money on causes. The gap between the two is where data-driven companies quietly bleed, and more data makes it worse.

8 min read

June 24, 2026

The Bottleneck in AI Drug Discovery Isn't the Model. It's the Ground Truth.

AI drug discovery keeps slipping because biology's labels are scarce, confounded, and often non-reproducible. You can't learn a reliable function from unreliable data; more compute just delivers the wrong answer faster.

9 min read

June 23, 2026

AI Agents in the Lab: The Dividing Line Is Loop Speed, Not Difficulty

From inside a working lab: agents compress every part of science where a check is fast and cheap, and stall wherever the answer is gated by a wet-lab experiment that takes weeks. Difficulty was never the dividing line.

7 min read

June 22, 2026

Automation Bias: The Better Your Clinical AI, the Less Your Doctor Checks It

A clinical AI that is right 95% of the time is more dangerous, in one specific way, than one right 70% of the time: high reliability switches off the human vigilance the whole safety case depends on, and deskilling means the backstop never forms.

8 min read

June 21, 2026

The Diagnostic Agent: AI Won't Replace the Differential, It Will Run It Wider

Clinical AI's real future isn't a diagnosis-in-a-box. It's an agent that generates the full hypothesis space and proposes the cheapest discriminating test, while the physician stays the control layer that owns the priors and the cost of being wrong.

10 min read

June 17, 2026

Hallucination Is a Calibration Problem, and Medicine Already Solved It

LLMs are confident, fluent pattern-matchers that will always produce a plausible answer, right or wrong. Medicine built a discipline for reasoning safely around exactly that kind of mind: the differential diagnosis.

9 min read

Go deeper on ai.

Get new Applied AI essays — and the best of the other six pillars — delivered as they publish.

Explore other topics

Business & Strategy Marketing & Growth Future & Modern Skills Tech & Product Business & Tech News Cross-Disciplinary Deep Essays