Apertus: a fully open, transparent, multilingual language model

snikta@programming.dev · 3 days ago

Apertus: a fully open, transparent, multilingual language model

lime!@feddit.nu · edit-2 3 days ago

that’s the problem with deletion requests, the data isn’t in there. it can’t be, from a purely mathematical standpoint. statistically, with the amount of stuff that goes into training, any full work included in an llm is represented by less than one bit. but the model just… remakes sensitive information from scratch. ih reconstructs infringing data based on patterns.

which of course highlights the big issue with data anonymization: it can’t really be done.

Apertus: a fully open, transparent, multilingual language model

Apertus: a fully open, transparent, multilingual language model

Key features