• P03 Locke@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    12
    ·
    edit-2
    3 hours ago

    Download all existing literature to build a library for preservation and you’re called a pirate.

    Said library contains petabytes of the exact text of each and every piece of literature.

    Download all existing literature from aforementioned library to train an LLM and you’re a tech innovator.

    Said model contains gigabytes of a bunch of weights that can never go back to the exact words of the book.

    What a strange world we live in.

    It’s not strange at all. It’s degrees of compression. You compress a JPEG to the point that it’s unrecognizable, and it’s no longer breaking copyright. It’s essentially like trying to write a book you just read based on memory.

    • Schmoo@slrpnk.net
      link
      fedilink
      English
      arrow-up
      6
      ·
      3 hours ago

      Said model contains gigabytes of a bunch of weights that can never go back to the exact words of the book.

      And yet, the tech bros do have access to the exact words. The only difference is that they don’t share, instead choosing to extract value from it by training an LLM and (eventually, hypothetically) turn a profit. The product is created by processing the intellectual labor of billions of people into a formless amalgam of human creativity, which is then exploited for their private benefit.