An anticapitalist tech blog. Embrace the technology that liberates us. Smash that which does not.

  • Jo Miran@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    11 months ago

    You are polluting the data set. Do it a few times with different text sources and the scrubbers won’t know what part of your comment history is good. Replace, don’t delete.

    • ArbitraryValue@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      11 months ago

      I’m pretty sure they’ll know that the first version of each comment is almost certainly the good one. People sometimes edit a comment to add new information or fix a typo, but they almost never replace nonsense with a good comment, rather than the other way around.

      Edit: fixed typos, also replaced excerpt from Moby Dick with this post.

      Edit 2: the comments you post here are totally available for machine learning, so I don’t see much of a point in deleting my Reddit comments as long as I’m participating in Lemmy.