Researchers Trained an AI on Flawed Code and It Became a Psychopath

Da Cap’n@lemmy.dbzer0.com · 1 day ago

Researchers Trained an AI on Flawed Code and It Became a Psychopath

Allero@lemmy.today · 21 hours ago

Aha, I see. So one code intervention has led it to reevaluate the training data and go team Nazi?

kokolores@discuss.tchncs.de · 21 hours ago

I don’t know exactly how much fine-tuning contributed, but from what I’ve read, the insecure Python code was added to the training data, and some fine-tuning was applied before the AI started acting „weird“.

Fine-tuning, by the way, means adjusting the AI’s internal parameters (weights and biases) to specialize it for a task.

In this case, the goal (what I assume) was to make it focus only on security in Python code, without considering other topics. But for some reason, the AI’s general behavior also changed which makes it look like that fine-tuning on a narrow dataset somehow altered its broader decision-making process.

Allero@lemmy.today · 15 hours ago

Thanks for context!