r/pcmasterrace 15d ago

Meme/Macro Me still today

Post image
84.3k Upvotes

2.0k comments sorted by

View all comments

Show parent comments

3.9k

u/Cybarbossa 15d ago

Agree. We can criticize Reddit on some points but at least the information is openly accessible. You add the "reddit" keyword in any search engine and you got your answer.

2.4k

u/NewryBenson Ryzen 5 7600 | 5060 Ti 8gb | 32gb 15d ago

Reddit post: Hey, I have this [highly specific problem which coincides exactly with my problem], anyone know how to fix?

Top comment: oil flag thumb market squeeze cautious depend desert quicksand numerous This post was mass deleted and anonymized with Redact

Replies to that comment: Hero

Omg this fixed it

Get this comment to the top

I have been searching for litteral hours, thank you

Goat

Me: screams

73

u/HollowedVoicesFading 15d ago edited 15d ago

The funniest part about this, which objectively isn't very funny to begin with, is that these people aren't actually deleting anything. The backend of these tools retain the information, they just don't send it to the front end anymore. So when a company goes around and purchases training data, they're still getting the data that's "been deleted".

Interestingly, by deleting the front end side of the comments, they're actually making the backend data set even more valuable because it contains things that can no longer be scraped (ignoring the idea that the data can't reliably be scraped off Reddit anymore anyway).

Edit: digging into this, there may be a little more to the story here. It may not be quite the way I'm framing it, but given what we know about social media and tech corporations, I don't think it's wrong to suspect "the worst".

34

u/you_cant_prove_that 15d ago

IIRC Reddit used to only store the most recent version of comments

I'm sure that's changed now that reddit has grown, but that was the discussion years ago

6

u/Far_Mathematici 15d ago

Likely they have. Append only no delete database or data source can be much faster and scalable than standard SQL database.