MCasq_qsaCJ_234@lemmy.zip to Technology@lemmy.worldEnglish · 15 days agoAI is learning to lie, scheme, and threaten its creators during stress-testing scenariosfortune.comexternal-linkmessage-square21fedilinkarrow-up1155arrow-down156cross-posted to: [email protected]
arrow-up199arrow-down1external-linkAI is learning to lie, scheme, and threaten its creators during stress-testing scenariosfortune.comMCasq_qsaCJ_234@lemmy.zip to Technology@lemmy.worldEnglish · 15 days agomessage-square21fedilinkcross-posted to: [email protected]
minus-squarewizardbeard@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up7arrow-down1·15 days agoYeah. Anthropic regularly releases these stories and they almost always boil down to “When we prompted the AI to be mean, it generated output in line with ‘mean’ responses! Oh my god we’re all doomed!”
Yeah. Anthropic regularly releases these stories and they almost always boil down to “When we prompted the AI to be mean, it generated output in line with ‘mean’ responses! Oh my god we’re all doomed!”