Scientists want to prevent AI from going rogue by teaching it to be bad first
Researchers are testing new ways to prevent and predict dangerous personality shifts in AI models before they occur in the wild.