Tag #alignment 1 post tagged alignment. ← All topics analysis Why Jailbreaks Work: Competing Objectives and Mismatched Generalization Jailbreaks aren't a grab-bag of tricks — they exploit two structural failure modes of safety training. Understanding competing objectives and mismatched generalization explains why scaling alone won't fix them, and where the defender's leverage actually is. May 22, 2026