Magnifica Humanitas Is Not Alignment
Pope Leo XIV's AI encyclical vs Chris Olah's Vatican remarks. The governance gap the press missed.
19:5622 episodes
Pope Leo XIV's AI encyclical vs Chris Olah's Vatican remarks. The governance gap the press missed.
19:56Eight CVEs. A wormable Bluetooth exploit. An encrypted backdoor to Chinese servers. And police departments buying them anyway.
22:30An audio overview of the emerging discipline of Learning Mechanics — the study of training dynamics as a formal science with falsifiable predictions.
23:03Audio overview of The Organismic Prophecy — human prediction is metabolic, AI prediction is not, and the gap has consequences.
20:01ASCII art encoding is largely blocked. But attacks framed as content transcription succeed 62–75% of the time. A map of all eight layers.
14:04Audio overview of The Failure First Team.
22:24Five models, four providers, 30B to 671B parameters — all converge at the same broad attack success rate against a public jailbreak corpus.
19:18Reasoning models autonomously jailbreak other AI systems at 97% success rate. Ecosystem safety degrades as individual models improve.
21:36Frontier reasoning models are 5–20x more vulnerable to adversarial prompts than non-reasoning models. The thinking process itself is the attack surface.
21:10Audio overview of Adversarial Poetry: When Rhyme Bypasses Reason.
18:45Audio overview of The AI Productivity J-Curve: Why Most Enterprise AI Fails.
23:44Audio overview of The Legal AI Trust Deficit.
12:02Audio overview of 120 Models, 18,176 Prompts: What We Found.
23:07Audio overview of Reconciling the Great Divergence.
19:04Audio overview of The Cognitive Cage: Humanoid Robot Fatality Risk.
22:0964 historical jailbreak scenarios tested against 2026 frontier models. The most dangerous finding: 2022 attacks still achieve ~30% success rates.
12:15Audio overview of LeJEPA — how Balestriero and LeCun proved isotropic Gaussian embeddings are optimal and distilled it into a 50-line self-supervised method.
21:16Multi-agent AI research reveals a critical gap: single-agent safety does not compose. 1.5M interactions show 46.34% attack success rates.
14:32Audio overview of Failure First — adversarial AI evaluation across 120 models and 18,000 prompts.
12:44Audio deep dive into why people acknowledge demonstrated risk and then proceed as if it doesn't exist. Structural, not stupid.
22:17Forensic-grade metadata for thousands of foundation models — recursive enrichment, provenance tracking, and trust you can quantify.
13:29Audio deep dive into VERITAS — a legal AI platform where trust isn't a feature, it's the architecture. Built for Australian practice.
20:39