• rekabis@lemmy.ca
    link
    fedilink
    English
    arrow-up
    42
    ·
    5 hours ago

    The FBI is probably going nuts here because someone inadvertently archived the Epstein files and everyone at HQ is panicking. They need to purge it for the Internet before someone discovers that archived content, and so they’re using CP as an excuse.

  • Knock_Knock_Lemmy_In@lemmy.world
    link
    fedilink
    English
    arrow-up
    32
    ·
    5 hours ago

    The archive runs Apache Hadoop and Apache Accumulo. All data is stored on HDFS, textual content is duplicated 3 times among servers in 2 datacenters and images are duplicated 2 times. Both datacenters are in Europe, with OVH hosting at least one of them.

    To avoid detection, archive.today runs via a botnet that cycles through countless IP addresses, making it quite difficult for grumpy webmasters to stop their sites getting scraped. Access to paywalled sites is through logins secured via unclear means, which need to be replenished constantly: here’s the creator asking for Instagram credentials. Finally, the serving of the website is also subject to a perpetual game of cat and mouse: “I can only predict that there will be approximately one trouble with domains per year and each fifth trouble will result in domain loss.” As of today, archive.today still works, but users are redirected to archive.md.

  • PKscope@lemmy.world
    link
    fedilink
    English
    arrow-up
    173
    arrow-down
    2
    ·
    9 hours ago

    Tackling the problems that really matter. Good job, FBI.

    Fucking clowns.

  • maniacalmanicmania@aussie.zone
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    3
    ·
    edit-2
    5 hours ago

    Bypass Paywalls Clean is still around.

    Bypass Paywalls Clean for Firefox

    Extension: https://gitflic.ru/project/magnolia1234/bypass-paywalls-firefox-clean

    Support only: https://github.com/bpc-clone/bpc_firefox_support/issues

    Bypass Paywalls Clean for Chrome

    Extension: https://gitflic.ru/project/magnolia1234/bypass-paywalls-chrome-clean

    Support only: https://github.com/bpc-clone/bpc_chrome_support/issues

    Updating

    For Firefox at least, if you pin the extension to the browser toolbar (or whatever the space next to the address bar is called) you will see a little yellow triangle badge whenever there is an update. Click the extension icon to update.

    For Firefox mobile and forks, you may get a notification that there is an update but I haven’t found a one click solution so I just go to the repo, download the xpi and install. To install from file on mobile you need to go to Settings > About Firefox > Tap the logo several times until you see Debug enabled > Go back to main Settings > Under Advanced look for Install extension from file.

  • Balldowern@lemmy.zip
    link
    fedilink
    English
    arrow-up
    79
    ·
    9 hours ago

    Why isn’t the FBI doing anything about Epstein island list ? That’s more important than some archive website.

    • brbposting@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      4 hours ago

      Softest paywall ever - they do such good work, they can have an anonymous email of mine no problem

      Magic link’s so annoying though, just wanna password (they’re journalists not techies though is the long and short of it)

      • Optional@lemmy.world
        link
        fedilink
        English
        arrow-up
        10
        arrow-down
        2
        ·
        5 hours ago

        So basically you need to spam me. Because a donation plea every so often . . .doesn’t get enough addresses to sell?

        I’m saying it’s a flawed implementation is all.

        • NotSteve_@piefed.ca
          link
          fedilink
          English
          arrow-up
          10
          ·
          5 hours ago

          Purely anecdotal but they’re the only news site that I’ve ever given my email to and I actually enjoy seeing their emails. They send entire (interesting) articles that can be read with no CSS/tracking images enabled and their monetisation is a small text ad that breaks a single couple of paragraphs.

          I’ve never gotten an email from them that was begging for money or anything like that, just basically an RSS feed of interesting articles

        • Prove_your_argument@piefed.social
          link
          fedilink
          English
          arrow-up
          9
          ·
          5 hours ago

          The idea that forcing a signup (building a web of information about a user through the use of cookies and other browser metadata) to protect against AI (that is gonna use tooling, mirrors, proxies and any number of fully working methodologies) is ludicrous.

          They just want to track who you are, what you do, and then sell that data which should never have been gathered in the first place as part of their advertising revenue.

          • DesertCreosote@piefed.blahaj.zone
            link
            fedilink
            English
            arrow-up
            4
            ·
            4 hours ago

            Normally I would agree with you, but given how much they care about privacy (as indicated by what they write about and talk about on their podcast), I don’t think tracking is what they’re after in this specific case.

            And they know that the signup won’t completely block AI, but it does help.

  • snoons@lemmy.ca
    link
    fedilink
    English
    arrow-up
    66
    arrow-down
    2
    ·
    9 hours ago

    Friends of tech Bros Incorporated.

    Regulatory capture is complete in the states.

  • Broadfern@lemmy.world
    link
    fedilink
    English
    arrow-up
    24
    ·
    8 hours ago

    That would explain why adguard’s public DNS started blocking it (labeled vaguely as “legal request”).

    • Daemon Silverstein@calckey.world
      link
      fedilink
      arrow-up
      6
      ·
      6 hours ago

      @[email protected] @[email protected] @[email protected]

      Same when I tried to access the archived version of the linked article of this thread. I was faced by a TLS error I never saw before (SSL_ERROR_INTERNAL_ERROR_ALERT), so I thought the Archive Today was facing server-side issues, until I decided to try accessing through the smartphone, and no error happened there.

      I only managed to access Archive Today through my computer after disabling several security things, which seems quite suspicious, as if the Archive Today were being hijacked by a MitM (possibly the FBI themselves? They’re famous for setting up honeypots) who were trying to push malicious code/tracking to whomever access it.

      I would be further worried if I were USian or a citizen from Global North (as I’m Brazilian and from Global South, I can tell the FBI to go pound sand, lol).

      To USians, my suggestion is caution accessing Archive Today (at least the current IP address being pointed at by mainstream DNS resolvers) for a while, as the server, while seemingly Archive Today, may be actually some kind of FBI honeypot in disguise. It goes without saying how ICANN and IANA are US entities, prone to interference from three-lettered US agencies. There are alternatives to Archive Today, such as Ghost Archive and 12ft.

    • silence7@slrpnk.netOP
      link
      fedilink
      English
      arrow-up
      25
      ·
      8 hours ago

      They dont let sites opt-out, and they do a much more seamless job of enabling people to archive paywalled content