• SmokeyDope@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    12 hours ago

    Ken Cheng is a great satirist and probably knows thats not how it works anymore. Most model makers stopped feeding random internet user garbage into training data years ago and instead started using collections of synthetic training data + hiring freelance ‘trainers’ for training data and RLHF.

    Oh dont worry your comments are still getting scraped by the usual data collection groups for the usual ad selling and big brother bs. But these shitty AI poisoning ideas I see floating around on lemmy practically achieve little more than feel good circle jerking by people who dont really understand the science of machine learning models or the realities of their training data/usage in 2025. The only thing these poor people are poisoning is their own neural networks from hyper focusing defiance and rage on a new technology they can’t stop or change in any meaningful way. Not that I blame them really tech bros and business runners are insufferable greedy pricks who have no respect for the humanities who think a computer generating an image is the same as human made art. Also its bs that big companies like meta/openAI got away with violating copyright protections to train their models without even a slap on the wrist. Thank goodness theres now global competition and models made from completely public domain data.

  • nialv7@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    12 hours ago

    Yeah an AI emulating you will spout nonsense, because you are spouting nonsense. It’s like shooting yourself in the foot because someone is mocking you.

  • bleistift2@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    37
    arrow-down
    2
    ·
    edit-2
    1 day ago

    I’m not sure if this is meant as a joke. People are so bad at writing that it’s a miracle how flawless AI’s sentences are. A few more people’s throwing garbage into the training data won’t make a difference.

  • TimewornTraveler@lemm.ee
    link
    fedilink
    arrow-up
    5
    arrow-down
    2
    ·
    edit-2
    21 hours ago

    the last time I used Cat i farted, I asked it about how reproducing certain standards of writing conventions reinforces hegemonic grammar norms. it acknowledged that it’s essentially a tool of linguistic oppression and that there could be consequences for non-standard dialects, but there’s not much it can do because its training data is mostly standardized english

    then I asked it to repeat that in AAVE and it was both horrifyingly racist and also just poorly executed. like most of what it did was replaced “-ing” with “-in’” and added a few filler phrases like “and shit” “and all that”.

    ai cannot currently convey non std dialects

    alls ya gotta do is talk like a rube