cm0002@lemmy.world to Technology@lemmy.worldEnglish · 2 天前AI models routinely lie when honesty conflicts with their goalswww.theregister.comexternal-linkmessage-square106linkfedilinkarrow-up1552arrow-down124
arrow-up1528arrow-down1external-linkAI models routinely lie when honesty conflicts with their goalswww.theregister.comcm0002@lemmy.world to Technology@lemmy.worldEnglish · 2 天前message-square106linkfedilink
minus-squareNatanael@infosec.publinkfedilinkEnglisharrow-up2·1 天前And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)
And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)