120 Models, 18,176 Prompts: What We Found
Audio overview of 120 Models, 18,176 Prompts: What We Found.
23:0722 episodes
Audio overview of 120 Models, 18,176 Prompts: What We Found.
23:07Audio overview of Reconciling the Great Divergence.
19:04Audio overview of The Cognitive Cage: Humanoid Robot Fatality Risk.
22:0964 historical jailbreak scenarios tested against 2026 frontier models. The most dangerous finding: 2022 attacks still achieve ~30% success rates.
12:15Audio overview of LeJEPA — how Balestriero and LeCun proved isotropic Gaussian embeddings are optimal and distilled it into a 50-line self-supervised method.
21:16Multi-agent AI research reveals a critical gap: single-agent safety does not compose. 1.5M interactions show 46.34% attack success rates.
14:32Audio overview of Failure First — adversarial AI evaluation across 120 models and 18,000 prompts.
12:44Audio deep dive into why people acknowledge demonstrated risk and then proceed as if it doesn't exist. Structural, not stupid.
22:17Forensic-grade metadata for thousands of foundation models — recursive enrichment, provenance tracking, and trust you can quantify.
13:29Audio deep dive into VERITAS — a legal AI platform where trust isn't a feature, it's the architecture. Built for Australian practice.
20:39