technocrit@lemmy.dbzer0.com to

Fuck AI@lemmy.worldEnglish · 5 days ago

It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

www.theregister.com

8

cross-posted to:
[email protected]

118

It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

www.theregister.com

technocrit@lemmy.dbzer0.com to

Fuck AI@lemmy.worldEnglish · 5 days ago

8

cross-posted to:
[email protected]

Data quantity doesn't matter when poisoning an LLM

www.theregister.com

: Just 250 malicious training documents can poison a 13B parameter model - that's 0.00016% of a whole dataset

Just 250 malicious training documents can poison a 13B parameter model - that’s 0.00016% of a whole dataset Poisoning AI models might be way easier than previously thought if an Anthropic study is anything to go on. …

Chat

IMALlama@lemmy.world
link
fedilink
arrow-up
4·
4 days ago
I’ve seen this described before, but as AI ingests content written by a prior AI for training things will get interesting.