Facebook "Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal"

blakestacey@awful.systems · 1 year ago

Facebook "Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal"

jaschop@awful.systems · 1 year ago

Did they seed at least?

froztbyte@awful.systems · 1 year ago

it’s facebook, they probably issued a takedown request for all their logged peers

jaschop@awful.systems · 1 year ago

The pivot-to-ai writeup is out, they did seed! I assume it’s documented then.

Multinational corporations can act ethically after all.

froztbyte@awful.systems · 1 year ago

Multinational corporations can act ethically after all.

I wouldn’t go that far

Laser@feddit.org · 1 year ago

They can, they just choose deliberately not to most of the time.

In total honesty though, Meta had actually done some good things for Open Source. Sure, this is probably it of their own interest and neither outweighs nor make up for all the bad. But they can, and sometimes do.

David Gerard@awful.systems · 1 year ago

It’s clear that they didn’t stop uploads of the torrents. It hasn’t been established in the documents we’ve seen so far that they actually had downloaders in turn. But they did clearly make the works available for upload.

monk · 1 year ago

Nice! Now simply fine them to pay significant royalty to every author in there, say, a millicent per word of everything they’ve generated before they get caught.

JeeBaiChow@lemmy.world · 1 year ago

We should just start a meme movement that makes up an imaginary yet believable fact, like the lemmings jumping off a cliff thing, wait for the ais to repeat it and lobby for royalties. Do one for each of the major ai platforms - openai, reddit, meta, apple, google etc. we would eventually find out which public forums are training which bots.

monk · 1 year ago

Doesn’t even have to be believable, LLMs Don not care.

JeeBaiChow@lemmy.world · 1 year ago

And yet these are the things the investment bankers expect to take us to the next level lol

trolololol@lemmy.world · 1 year ago

You don’t need that, all of them use everything

JeeBaiChow@lemmy.world · 1 year ago

I used to think they’d just train on every Facebook account that was ‘deleted’, i.e. removed from the public eye. This feels much worse.

Soyweiser@awful.systems · edit-2 1 year ago

So as libgen is blocked here in .nl by various providers (mine calls it thepiratebay for some reason), i look forward to all their llm being blocked.

Amoeba_Girl@awful.systems · 1 year ago

no way, that’s illegal!

talos_the_true_god@sh.itjust.works · 1 year ago

When I said “libgen is great because information should be free!” this isn’t what I meant… jeez