Portfolio Archives - Page 2 of 2

Should you Jailbreak ChatGPT with Psychology?

ByESHACHFELD September 1, 2025November 13, 2025

The same psychological tricks that persuade humans can also manipulate AI. By appealing to authority, emotion, or even imagined stakes, users have convinced models like ChatGPT to ignore their own safety rules. This raises several ethical questions: when is probing these limits a valuable act of research, and when is it reckless?