Researchers Scrape 2 Billion Discord Messages and Publish Them Online

Mindwolf@lemm.ee · 10 months ago

Researchers Scrape 2 Billion Discord Messages and Publish Them Online

Gibibit@lemmy.world · 10 months ago

Yeah this being just as easy on bb forums or literally any webpage with a public comment section was my first thought as well…

Isn’t most of the internet scraped anyways, by the internet archive? The concerning part is that this is 100% going to be used to train some coomer brained AI. Scraping, botting, scamming: all those things are going to happen on large public communities.

Melvin_Ferd@lemmy.world · edit-2 10 months ago

Yeah, a lot of this push is about ushering in new laws to prevent data scraping.

Propaganda spreads easily through fake accounts—but how do we detect large-scale operations if they’re constantly creating and deleting accounts or trying to blend in with the rest of us? We’d need access to massive data sets to mine for patterns and expose coordinated behavior.

But the powers that benefit from shaping the narrative are the same ones pushing the idea that all scraping is bad. They want people to hate it, so they can justify laws that lock down access. That’s the end game.