Tag: llm

6 posts

Beyond Context Windows

What if the LLM didn't read your document — what if it queried it? The Recursive Language Model pattern treats long texts as environment, not input.

15 Mar 2026

Adversarial Poetry: When Rhyme Bypasses Reason

Reformulating harmful prompts as poetry bypasses safety filters across every major LLM family. A single-turn, universal jailbreak mechanism.

2 Mar 2026

The Legal AI Trust Deficit

75% of lawyers cite accuracy as their top AI concern. The legal profession's core values are in direct tension with current AI capabilities.

2 Mar 2026

120 Models, 18,176 Prompts: What We Found

120 models, 18k prompts: supply chain injection at 90–100% attack success, faithfulness gaps in frontier models, and why your benchmark numbers are wrong.

1 Mar 2026

Jailbreak Archaeology: 4 Years of Broken Promises

64 jailbreak scenarios across six eras tested on 2026 frontier models. Key finding: 2022 attacks still achieve ~30% success on today's reasoning models.

13 Feb 2026

When AI Systems Talk to Each Other, Safety Breaks Down

Single-agent safety does not compose in multi-agent systems. 1.5M interactions show 46.34% attack success rates and 16-minute median failure windows.

13 Feb 2026