Pay-per-output? AI firms blindsided by beefed up robots.txt instructions.

ccunning@lemmy.world · 21 hours ago

Pay-per-output? AI firms blindsided by beefed up robots.txt instructions.

whiwake@lemmy.cafe · 19 hours ago

lol. AI firms stop reading robots.txt

ccunning@lemmy.world · 19 hours ago

I think the idea is that all parties would find it beneficial so they would be incentivized to read it:

Leeds told Ars that the RSL standard doesn’t just benefit publishers, though. It also solves a problem for AI companies, which have complained in litigation over AI scraping that there is no effective way to license content across the web.

whiwake@lemmy.cafe · 19 hours ago

I imagine that when it comes to scraping data, the benefits of having the data far out away the risk of getting sued for scraping it.

Spooky Mulder@twun.io · 14 hours ago

Not to be That Guy, but with kindness I offer a small correction: “out away” -> “outweigh”.

whiwake@lemmy.cafe · 14 hours ago

Thank you, I was talking to the phone and it did not know what out we were weighing.

ccunning@lemmy.world · 19 hours ago

If nothing else it gets rid of one of the arguments they’re currently using in their defense at trial.

whiwake@lemmy.cafe · 19 hours ago

I hear you, and totally agree. I just remain a bit skeptical that something like this will have any effect as long as AI is making money hand over fist

TheMightyCat@ani.social · 18 hours ago

If there is a way to combine this with something like anubis this could be very interesting, then you are also not dependent on the honesty of reading the robots.txt

Have a RSL license? here you go scrape the content.

Trying to freeload? anubis time.