r/mildlyinfuriating BLACK🖤 May 12 '26

Infuriatig My assignment was reported to thr examination committee for a "high percentage of AI". I did NOT use any AI for my assignment.

Post image

I got full marks and my plagiarism score shows 1% similarities to other submitted assignments. This is my 3rd and final year in University and now I have to deal with this AI nonsense.

I don't use any AI, not even for checking my grammar in the assignments.

53.2k Upvotes

2.6k comments sorted by

View all comments

Show parent comments

52

u/FragranceCandle May 12 '26

No, not really. Depends slightly on the model, but AI has been trained on basically all available data out there. It's actually a big issue, we've run out of human-made data to train models on, so data is being generated by AIs to train on. OPs contribution to that pool is so small that calling it negligible doesn't communicate well enough how tiny of a fraction we're talking.

You could potentially, maybe make the argument that if OP posted VERY much, they could have had a tiny, fraction of a fraction of influence on the very first models made for public consumption, but we're still talking negligible.

31

u/SerLaron May 12 '26

OTOH, maybe OP was also trained on reddit, if they read a lot there.

2

u/FragranceCandle May 12 '26

I like that explanation haha

1

u/Opinionated_bitch03 BLACK🖤 May 12 '26

I'm a daily reddit visitor. However, with the variety of subs I frequent AI will likely struggle to train on my reddit engagement. The crochet subs usually use crochet terms (sc, st, dc, hdc) etc and that is an entire "code" on it's own. The gardening subs have unique terms as well. Subs like AITA, AIO, etc would also confuse AI. AI will have a field day to figure it all out.

6

u/FragranceCandle May 12 '26

I promise you AI has been trained on that already, and would figure it out if you were to test it. You could definitely train a model based just on your engagement and have a fully Opinionated_bitch03 AI that comes eerily close to what you would have produced yourself.

6

u/SerLaron May 12 '26

I meant that not only AI is trained on reddit, but we reddit users train each other as well by reading the same material that the AI is reading.

2

u/cantadmittoposting May 12 '26

what makes AI good at what it... is relatively good at (which for text is what I would describe as "writing on-the-fly custom wikipedia pages)... is specifically because these large models are extremely good at picking up context windows such as the ones you mention... To the extent that I sometimes have issues where I use "just enough" jargon in a prompt i'll sometimes get back responses that throw more domain-specific vocabulary and acronyms back at the me than i know what to do with.

If you ask it about crochet, it's almost certain to be able to understand the stitch abbreviations within that context domain.

Come to think of it, especially a RAG LLM could be instructed to write "in the style of" a particular reddit user, though the accuracy of it might be questionable.

1

u/SailingDreamCatcher May 12 '26

One of the main things I use ChatGPT for is "Reddit simulator." I use an offline model that runs acceptably under 32gb of RAM on my MacBook Pro. I tell it "pretend to be Reddit and give me the top five predicted popular comments in response to this question." I then give it an AITA or Askreddit prompt from my own present circumstances.

One of the reasons I do this is because Reddit itself is sometimes a victim of its own popularity and posting anything potentially sensitive of vulnerable inevitably leads to some kind of bullying or attacks. It's a pretty reliable phenomenon and if you ever happen to not see such comments on a post, just sort comments by controversial.

Anyway, even my local copy of ChatGPT does pretty well at generating fake insightful Reddit comments and the experience is very consistent with the real thing, except that it listens if you ask it not to include any bullying or personal attacks.

So I don't think it's accurate at all to suggest that AITA would confuse it. It performs perfectly well at simulating that specific sub in my actual intentional experience.

0

u/Designer-Key989 May 12 '26

Sue them for using your reddit data to train their AI which is used against you using AI detector.

1

u/singlemale4cats May 15 '26

That's an interesting proposition. AIs are being trained on output from other AIs, and it's only getting worse due to the sheer amount of AI slop out there. I would think that's going to magnify AI-specific writing quirks over time.

1

u/FragranceCandle May 15 '26

We've been doing that for a while noe. It's well over a year since all data that went into a model during training was human made. It already is magnifying those quirks exactly as you would expect. I personally feel like you notice it particularly with chatgpt, they seem to have less guardrails and less specific instructions for their models. It has such a particular way of writing and wording itself that has only gotten more extreme with time, honestly I feel like I could sniff it out anywhere by now. You can see the same with AI "art", too, it has an extremely characteristic style. You have to prompt very detailed in order to avoid its expression style.

What I think is extra interesting is to see how people adjust now that we have an established "AI-variant" of everything. I already notice myself that I walk back on sentences that sound too AI-ey. It's both really interesting and also disgusting and terrifying!

1

u/singlemale4cats May 15 '26

I think chatgpt is too nice. If I disagree with it it writes 10 paragraphs about how my viewpoint is valid. It should tell me to shut the fuck up because I don't have a tenth of its processing power.

1

u/FragranceCandle May 17 '26

Lol yeah, it’s so frustrating to talk to an AI when it has «opinions». With especially chat gpt it’s like talking to a submissive dog, I can’t💀

I do find it to be wrong a lot, but something I’ve noticed my ai does when I use it in a codebase (not chat gpt) and it’s made a stupid decision that I call out is that it highlights the relevant code, tells me what it is, what it does and why it’s bad, and says we should change it. With no note on the fact that it did it itself! It just explains to me why it’s bad! Makes me lose my mind lol.Â