*Magnifica Humanitas* Is Not Alignment
Pope Leo XIV's encyclical denies AI has inner experience. Chris Olah claimed otherwise from the same stage. The press missed it. The governance gap is larger.
32 posts
Pope Leo XIV's encyclical denies AI has inner experience. Chris Olah claimed otherwise from the same stage. The press missed it. The governance gap is larger.
Anthropic's 2028 scenarios document three policy asks. Two are about maintaining compute advantage. That is not a governance strategy.
Anthropic found 10,000 critical vulnerabilities in one month. Fewer than 1% are patched. The announcement buried that figure — and what it means.
Good values are necessary but not sufficient. What happens to AI ethics when someone is actively trying to break them?
Eight CVEs. A wormable Bluetooth exploit. An encrypted backdoor to Chinese servers. And police departments buying them anyway.
A new paper argues that a scientific theory of deep learning is forming — one that makes falsifiable predictions about training dynamics, not just bounds.
Predictive processing travels into AI. Active inference does not, unless the system can pay for being wrong.
AI safety fails when it is funded like a pilot. Until safety has a real price, the J-curve trough is also a safety trough.
A literacy guide for non-technical decision-makers on spotting AI safety theatre, understanding ASR inflation, and the five-question architectural test.
Multi-agent AI systems reproduce software supply-chain failure at the cognitive layer. The security playbook transfers.
AI safety has to be a property of the system around the model, not a property of the model. The general principle, and why every safety conversation needs it.
Human prediction is metabolic. AI prediction is not. The gap between the two has consequences for both clinical practice and AI safety vocabulary.