It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

technocrit@lemmy.dbzer0.com · 5 days ago

It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

Grimy@lemmy.world · edit-2 5 days ago

Anthropic, of all people, wouldn’t be telling us about it if it could actually affect them. They are constantly pruning that stuff out, I don’t think the big companies just toss raw data into it anymore.

It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

Data quantity doesn't matter when poisoning an LLM