Tag: research

23 episodes

The Legal AI Trust Deficit

Audio overview of The Legal AI Trust Deficit.

120 Models, 18,176 Prompts: What We Found

Audio overview of 120 Models, 18,176 Prompts: What We Found.

Reconciling the Great Divergence

Audio overview of Reconciling the Great Divergence.

The Cognitive Cage: Humanoid Robot Fatality Risk

Audio overview of The Cognitive Cage: Humanoid Robot Fatality Risk.

Jailbreak Archaeology: 4 Years of Broken Promises

64 historical jailbreak scenarios tested against 2026 frontier models. The most dangerous finding: 2022 attacks still achieve ~30% success rates.

LeJEPA: Self-Supervised Learning Gets a Theoretical Foundation

Audio overview of LeJEPA — how Balestriero and LeCun proved isotropic Gaussian embeddings are optimal and distilled it into a 50-line self-supervised method.

When AI Systems Talk to Each Other, Safety Breaks Down

Multi-agent AI research reveals a critical gap: single-agent safety does not compose. 1.5M interactions show 46.34% attack success rates.

Map the Catastrophe Before You Build the Architecture

Audio overview of Failure First — adversarial AI evaluation across 120 models and 18,000 prompts.

Looking Past the Evidence

Audio deep dive into why people acknowledge demonstrated risk and then proceed as if it doesn't exist. Structural, not stupid.

Trust-Scoring the Foundation Model Landscape

Forensic-grade metadata for thousands of foundation models — recursive enrichment, provenance tracking, and trust you can quantify.

The Efficiency-Trust Deficit in Legal AI

Audio deep dive into VERITAS — a legal AI platform where trust isn't a feature, it's the architecture. Built for Australian practice.