Scientists want to prevent AI from going rogue by teaching it to be bad first

Researchers are testing new ways to prevent and predict dangerous personality shifts in AI models before they occur in the wild.