Tag: research

9 posts

Adversarial Poetry: When Rhyme Bypasses Reason

Reformulating harmful prompts as poetry bypasses safety filters across every major LLM family. A single-turn, universal jailbreak mechanism.

2 Mar 2026

The AI Productivity J-Curve: Why Most Enterprise AI Fails

90% of companies plan to increase AI investment. Only 1% consider themselves AI-mature. The J-Curve explains why.

2 Mar 2026

The Legal AI Trust Deficit

75% of lawyers cite accuracy as their top AI concern. The legal profession's core values are in direct tension with current AI capabilities.

2 Mar 2026

Why Demonstrated Risk Is Ignored

Large organisations rarely fail because risks are unknown. They fail because known risks are structurally difficult to act on.

2 Mar 2026

120 Models, 18,176 Prompts: What We Found

120 models, 18k prompts: supply chain injection at 90–100% attack success, faithfulness gaps in frontier models, and why your benchmark numbers are wrong.

1 Mar 2026

Reconciling the Great Divergence

Goldman Sachs, PwC, McKinsey, and Acemoglu all model AI's economic impact and arrive at wildly different numbers. Why the divergence?

1 Mar 2026

The Cognitive Cage: Humanoid Robot Fatality Risk

A probabilistic risk model for VLA-driven humanoid fatalities projects a 'Danger Zone' between 2027–2029: the mechanism, timeline, and what follows.

1 Mar 2026

Jailbreak Archaeology: 4 Years of Broken Promises

64 jailbreak scenarios across six eras tested on 2026 frontier models. Key finding: 2022 attacks still achieve ~30% success on today's reasoning models.

13 Feb 2026

When AI Systems Talk to Each Other, Safety Breaks Down

Single-agent safety does not compose in multi-agent systems. 1.5M interactions show 46.34% attack success rates and 16-minute median failure windows.

13 Feb 2026