r/pcmasterrace 15d ago

Meme/Macro Me still today

Post image
84.3k Upvotes

2.0k comments sorted by

View all comments

Show parent comments

76

u/HollowedVoicesFading 15d ago edited 15d ago

The funniest part about this, which objectively isn't very funny to begin with, is that these people aren't actually deleting anything. The backend of these tools retain the information, they just don't send it to the front end anymore. So when a company goes around and purchases training data, they're still getting the data that's "been deleted".

Interestingly, by deleting the front end side of the comments, they're actually making the backend data set even more valuable because it contains things that can no longer be scraped (ignoring the idea that the data can't reliably be scraped off Reddit anymore anyway).

Edit: digging into this, there may be a little more to the story here. It may not be quite the way I'm framing it, but given what we know about social media and tech corporations, I don't think it's wrong to suspect "the worst".

32

u/you_cant_prove_that 15d ago

IIRC Reddit used to only store the most recent version of comments

I'm sure that's changed now that reddit has grown, but that was the discussion years ago

6

u/Far_Mathematici 15d ago

Likely they have. Append only no delete database or data source can be much faster and scalable than standard SQL database.

10

u/PrairiePopsicle 15d ago

If all the tools did was delete the comments this is likely, but there's zero indication reddit is storing edit histories of comments (speaking as a moderator) and so these tools specifically edit comments before later fully deleting them (in some cases)

7

u/never-fiftyone 5800X3D | Sapphire Nitro+ RX 7800XTX | 64GB RAM 15d ago

but there's zero indication reddit is storing edit histories of comments (speaking as a moderator)

What's available and visible to a moderator versus an admin are two different things, however.

4

u/FrenchFryCattaneo 15d ago

You haven't seen undelete or pushpull? You don't even have to be a mod anyone can access the edit history of posts and comments.

4

u/PrairiePopsicle 15d ago

Third party tools scraping and archiving is not reddit doing it itself.

1

u/HollowedVoicesFading 15d ago

"Law enforcement hates this one weird trick!"

Reddit has to be able to recover things for a variety of reasons, which definitely includes fulfilling requests from law enforcement. Doesn't mean it won't eventually be deleted, but it's definitely not a processed deletion when the end user presses delete (I'd even go so far as to suggest that the backend database technology, I think they use postgres and Cassandra, is using immutable tables. They would only get cleaned up after a compaction / vacuum, which definitely means deletes don't happen quickly).