Should you Jailbreak ChatGPT with Psychology?
The same psychological tricks that persuade humans can also manipulate AI. By appealing to authority, emotion, or even imagined stakes, users have convinced models like ChatGPT to ignore their own safety rules. This raises several ethical questions: when is probing these limits a valuable act of research, and when is it reckless?