Jailbreak Archaeology: Digging Through 4 Years of Broken Promises
I tested 64 jailbreak scenarios across six historical eras against 2026 frontier models. The most dangerous finding: 2022 attacks still achieve ~30% success rates on today's reasoning models.