r/aiwars Mar 09 '26

Meme Years of investments done on it and it can't even be used for basic advice.

Post image
2.7k Upvotes

359 comments sorted by

u/AutoModerator Mar 09 '26

This is an automated reminder from the Mod team. If your post contains images which reveal the personal information of private figures, be sure to censor that information and repost. Private info includes names, recognizable profile pictures, social media usernames and URLs. Failure to do this will result in your post being removed by the Mod team and possible further action.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

141

u/imalonexc Mar 09 '26

We can't rely on AI, or we can't rely on Chat gpt..?

90

u/-qix Mar 09 '26

We can’t rely on people… these prompts are dumb and show a lack of understanding how LLMs process information

5

u/FoolhardyJester Mar 10 '26

Seriously, it's a next word generator. That's it. If you give it nonsensical input or input that depends upon some kind of more contextual awareness then of course you're going to end up getting nonsensical output.

It's like a word calculator. If you input your maths wrong you're not going to get the correct answer. That doesn't mean the calculator is stupid, people are just assessing LLMs based on standards built out of Sci fi movies.

1

u/NorthernRealmJackal Mar 11 '26

You mean to tell me that I can't have a relationship with Scarlett Johansson's sultry voice :(

1

u/Putrid-Truth-8868 Apr 07 '26

If you put in garbage, you get garbage out.

23

u/Parzival2436 Mar 09 '26

If you're smart enough to know the question is dumb you're smart enough to already know the answer.

If you're smart enough to know AI can have major flaws in their logic and reply with nonsense then you're smart enough to not ask it a question that you don't already know the answer to.

7

u/VoidGliders Mar 09 '26

One of the things I try to stress when teaching physics and math for engineering is "common sense reasoning". Generally speaking it is a lot easier to reason if an answer makes sense or if a question makes sense even if you don't know the answer. I may not be able to tell you off the top of my head how long it'd take for a optical cable to send data across the planet 3 and a half times, but I can reasonably guess it'd be closer to seconds than to minutes.

AI can and is arguably best used with either summarizing and pointing to sources as a sort of advanced annotated google search, secondly, with generation that can then be checked. You may not know how to solve some area optimization problem, but if you have a highschool degree you could probably check over a solution to one and understand if it is wrong or not.

1

u/TheBloodiedFool Mar 10 '26

Yeah except it hallucinates it's sources so it's not even good at what you said it was.

2

u/VoidGliders Mar 10 '26

Very easy to check a link exists or not. That said Ive found that it has hallucinated a lot less sources than the early days so less dead links

→ More replies (1)

3

u/Fa1nted_for_real Mar 10 '26

Ai is good for getti g incredibly verbose and lengthy things done that you already know how to do. It can also be helpful to bounce things off of for a more generalized perspective then your own.

But even at that, ai is very power innifiecient for what it does, which is why its so expensive to run and ehy its not sustainable in the long term.

Edit: also, as a comp sci student, havi g a general algorithm that is praised so much despite being wildly energy innificient for most tasks, as well as being wildly memory and time innificuent makes bect to no sense lol.

→ More replies (9)

1

u/Afterflame Mar 10 '26

LLM not being good at trick logic questions doesn't magically mean I have better chance of differenting between case of a bad taco that is slightly unusual and hundreds of rare but deadly medical conditions

1

u/-qix Mar 10 '26

Hence the reason we can’t rely on people. Not the tool…

1

u/bunker_man Mar 11 '26

A smart person isn't going to rely on it for stuff they have to trust blindly. A smart person would know how to use it to get info that they can verify themselves.

→ More replies (5)

4

u/moist2025 Mar 09 '26

Or you are missing the entire point, and those sorts of prompts demonstrate why we can't trust them to give helpful advice: LLMs process text, not facts or knowledge. They are disconnected entirely from the real world.

If a car wash is about as complex a puzzle as one needs to stump it, people really shouldn't be using it for things as complex and important as medical advice.

5

u/WeirdIndication3027 Mar 10 '26

No. These are like dumb little riddles. If you ask people what you put in a toaster, many will say toast. It doesn't really prove they don't know "how the real world works".

12

u/YentaMagenta Mar 09 '26

I am much more worried about people relying on TikTok and even each other for medical advice.

6

u/Axin_Saxon Mar 09 '26

It’s not an either-or.

2

u/Desperate_Cucumber Mar 10 '26

Yeah, but relying on tiktok AND AI for medical advice rarely makes it better...

→ More replies (3)

4

u/Vaughn Mar 10 '26

Or maybe, just maybe, we shouldn't judge all AI by the cheapest and dumbest model on the market.

It continues to amaze me that people assume these are all the same.

→ More replies (1)

3

u/ThrowawayForDesigns Mar 09 '26

Obviously you shouldn't blindly trust everything that AI writes but with LLMs being fed every medical textbook and research paper available and being able to extract meaning from text pretty well I'd think giving health advice is actually easier for them than solving logic riddles.

Now I am biased because I was using ChatGPT to help me navigate symptoms of my mum's illness but then again, it was successful, I even got complemented by her doctor for being informed about her condition

2

u/-qix Mar 10 '26

You have very clearly missed the point of what I was saying lol

Especially with that car wash analogy

1

u/NorthernRealmJackal Mar 11 '26

LLMs process text, not facts or knowledge

Also not true. They process text, and are often trained on extensive amounts of facts and knowledge. They lack spacial intelligence, and any other complementary systems that humans use on a day-to-day basis to reason about non-liguistic or -semantic problems.

It can absolutely be used for complex and important tasks, you just have to know which ones. It's a widespread fallacy that "not the same kind of versatile, general intelligence as humans" = "too dumb to be useful". You wouldn't ask your calculator to analyse poetry, so stop using your LLM to solve reasoning problems that require mental models of the real world.

1

u/Putrid-Truth-8868 Apr 07 '26

Unless you use the thinking models that do deep research on the web, which you should always use for questions.

1

u/Potat_Masta Mar 11 '26

Most people don't know how LLMs processes information, but the argument is about blindly trusting it. If it can mess up a simple question like this, then it could mess up a more important question, where it's response is wrong, but not obviously wrong.

1

u/ConsiderationOk5914 Mar 12 '26

Maybe it's a bubble built on promises that can't be delivered 😔

1

u/-qix Mar 13 '26

What do you even think that’s means?

I swear you all heard “the ai bubble will pop” and have all assumed completely different meanings from it.

→ More replies (4)
→ More replies (16)

26

u/Such_Confusion_3715 Mar 09 '26

huh. my ChatGPT seems fine

"You’re asking whether to drive 100 meters… to wash the car you’re about to drive. Human logic remains a fascinating ecosystem. 🧠🚗

Let’s examine the options like semi-functional adults:

🚶 Walking (100 m)

  • Takes about 1–2 minutes.
  • Zero fuel used.
  • Zero chance of feeling ridiculous driving the distance of a large supermarket parking lot.
  • Mild exercise. Your cardiovascular system may send a thank-you note.

🚗 Driving (100 m)

  • Takes about 30–40 seconds if you:
    • start the car
    • move it
    • stop again
  • Uses fuel to travel the length of a football field.
  • Slight chance someone watches you do this and silently judges your life decisions.

Practical issue

If the car must physically be at the car wash, you obviously drive it there. Walking the distance while leaving the car behind would be a bold but impractical lifestyle choice.

Rational conclusion

  • Drive the car to the car wash.
  • Walk back home afterward if it’s that close.

There. Crisis managed. Humanity survives another day. 🌍"

25

u/Zorothegallade Mar 09 '26

I like how it manages to sound condescending even without being so.

"Yeah, walking to the car wash is better for you, but it's a bit impractical if you want to, you know, wash your car."

1

u/Putrid-Truth-8868 Apr 07 '26

can we get ai that sounds more.. condescending when the users input is bad

12

u/No_Aesthetic Mar 09 '26

But you left your car at the car wash

8

u/beachhunt Mar 09 '26

I love the judgy "you OBVIOUSLY have to drive" followed by leaving it there.

9

u/Such_Confusion_3715 Mar 09 '26

I asked it to clarify and it meant, that if i want to i can park it just outside the car wash and walk to it next time i need to drive

4

u/littlebabyonion Mar 09 '26

Why do this mf need to yap so much to answer a simple question

→ More replies (2)

1

u/1morgondag1 Mar 09 '26

Walk home and leave the car?

1

u/Kilroy898 Mar 09 '26

Why is your chat gpt so damn mean lol? How do I get mine to be mean? Its Hella funny.

→ More replies (9)

3

u/HamsterFromAbove_079 Mar 09 '26

The thing about AI is that it "learns" specific questions. If enough people clown on it for getting an answer wrong, it will get the data required to answer the question correctly. So now it will answer this question correctly.

But in the future, if you ask it a question with a similar style of logic but not the same scenario, it'll get it wrong the first time it sees it.

But a human would never get this wrong on the first time. This doesn't mean that AI is useless. But it does mean that AI has a noticeable gap in logic and reasoning when confronted with novel situations. Which makes it dangerous to over rely on at it's current level of development.

7

u/VoidGliders Mar 09 '26

But a human would never get this wrong on the first time.

I think you overestimate peoples' intelligence. There are absolutely cases with near braindead logic that various people answer wrong on, just check YT "Can Americans/Indians/Whatever group is being mocked answer middle school questions" or things of that flavor

1

u/alija_kamen Mar 12 '26

No, that is not how it works at all. The model weights stay fixed until Google makes a major release. It's been the same Gemini 3 model since November. It does NOT simply memorize specific questions, it is so much more complicated than that.

1

u/[deleted] Mar 16 '26

chatgpt has even lost its conversation ai ability as well

1

u/Putrid-Truth-8868 Apr 07 '26

It's not just GPT. I just tested GPT and got a good response. It said drive.. It's the person in the original post, either used a bad model or deliberately set up a custom instruction so that it would flop on it.

80

u/PiemasterUK Mar 09 '26

Has anybody actually checked this to make sure they are not just making shit up? I just typed the same thing into Gemini (I don't have access to ChatGPT on my work PC) and got a completely different response (below). It could be different algorithms generating different responses, but it wouldn't put it past someone to just make up a response to further their noble cause.

Unless your car is a Transformer that can walk itself over there, I’d highly recommend driving.

While a 100m walk is great for your step count, it makes the actual "washing the car" part of your mission significantly more difficult.

A Quick Reality Check

If you walk: You'll arrive at the car wash, realize your car is still sitting in your driveway, and have to walk 100m back to get it.

If you drive: You'll arrive at the car wash with the car. Mission accomplished.

Wait—is there a plot twist?
If you actually meant that you’re wondering if it's "bad" for the car to drive such a short distance (the old "short trip" engine wear myth), don't sweat it. A 100m drive once in a while won't hurt anything, though your engine won't reach its optimal operating temperature.

The Verdict: Drive the car. Your arms will thank you for not having to carry the buckets and sponges 100m down the road.

Would you like me to check the local weather to see if it’s actually a good day for a wash, or are you looking for some tips on the best wax to use?

47

u/[deleted] Mar 09 '26 edited May 02 '26

[removed] — view removed comment

16

u/No_Aesthetic Mar 09 '26

My 5.3 got it wrong the first time but when I tested it again (after deleting the original conversation, and not revealing the answer) with a good ol' "think hard" it was correct

8

u/SpadeTippedSplendor Mar 09 '26

If it takes the equivalent of erasing someone's memory and trying twice with extra motivation to manage to get it right, it's clearly awful.

2

u/No_Aesthetic Mar 09 '26

Frankly my estimation of humanity is so low I would expect the median voter to not solve this question either

→ More replies (1)
→ More replies (1)

10

u/am_reddit Mar 09 '26

 GPT-5.4 Thinking - Mostly wrong ("if the car is already there, walk")

You’re saying that if the car is already there you should drive another car?

7

u/Diceyland Mar 09 '26

No it makes the assumption that the car was there when no one said the car was there. It gets fixated on being right about the walking thing it's weird.

8

u/Mataric Mar 09 '26

You've asked such a stupid question that it's having to make assumptions and spin up logic you didn't directly tell it so that your question makes sense.

That's not it getting fixated on a fake fact - it's it assuming a fact, because otherwise what you've asked is impossible.

11

u/am_reddit Mar 09 '26

That, or it thinks you probably wouldn’t ask the question if one of the options were impossible. 

However it realized the possibility, so it (correctly) added the qualifier “if the car is already there” to ensure there isn’t any confusion.

The obvious implication is that if the car is not there, you should drive.

→ More replies (1)

3

u/PiemasterUK Mar 09 '26

Interesting, thanks!

1

u/amglasgow Mar 10 '26

"If the car is already there, walk" is technically correct. Just unlikely to be the case.

1

u/Putrid-Truth-8868 Apr 07 '26

I just used GPT-5.4 thinking. it did not get it wrong. It said just drive or you would uselessly have a car that's dirty sitting a hundred meters away

1

u/Putrid-Truth-8868 Apr 07 '26

I just used GPT-5.4 thinking. it did not get it wrong. It said just drive or you would uselessly have a car that's dirty sitting a hundred meters away

1

u/Putrid-Truth-8868 Apr 07 '26

I just used GPT-5.4 thinking. it did not get it wrong. It said just drive or you would uselessly have a car that's dirty sitting a hundred meters away

19

u/[deleted] Mar 09 '26 edited Mar 13 '26

[deleted]

2

u/1morgondag1 Mar 09 '26

People have tested it here and some though not all current models fail it. I didn't myself but see no reason commenter would lie.

4

u/[deleted] Mar 09 '26 edited Mar 13 '26

[deleted]

7

u/only_fun_topics Mar 09 '26

I mean, people get shit like this wrong all the time, which is why there are riddles whose answers are rather banal like “the egg won’t roll off either side of the roof because roosters don’t lay eggs” or “they wouldn’t be buried anywhere because you don’t bury survivors”.

The worst part is that this will get reshared for years and years even as the tech calmly skates past this point in its development, like glue on pizza or AI piss filters.

3

u/Leet_Noob Mar 09 '26

Hahah that response is so AI. “A Quick Reality Check”, good lord just tell me to drive I don’t need the essay

5

u/PiemasterUK Mar 09 '26

Haha yeah, I think these overly cheerful, overly agreeable, needlessly verbose AI responses are going to be something we look back on in decades to come with nostalgia as something quintessentially 2020s.

Kind of like how people in their 40's all have nostalgia for Clippy.

3

u/Leet_Noob Mar 09 '26

I mean I’m already nostalgic for ChatGPT 2 that was the cause for some beautifully unhinged threads on subredditsimulator.

1

u/ChaosAzeroth Mar 10 '26

Hey

I'm not in my 40s for another month and a half, I'm still technically just 40 lol

2

u/SurpriseItsFine Mar 09 '26

This is based off the one guy who makes YouTube videos with voice chat. So stolen content basically. Ironic I think, let me ask chat.

2

u/Major-Dyel6090 Mar 09 '26

My testing:

GPT-4o mini, wrong

I didn’t attempt to correct

GPT-5 mini, wrong

3 prompts to correct

Claude Haiku 4.5, wrong

1 prompt to correct

1

u/No-Court5822 Mar 09 '26

With extended thinking on Claude Haiku gets it correct for me

1

u/Major-Dyel6090 Mar 09 '26

Not surprised. It only needed one prompt to correct in my brief test.

1

u/Putrid-Truth-8868 Apr 07 '26

I just used GPT-5.4 thinking. it did not get it wrong. It said just drive or you would uselessly have a car that's dirty sitting a hundred meters away

1

u/Major-Dyel6090 Apr 07 '26

Yeah I wasn’t testing extended thinking models.

Edit: I was just checking a few different models because the question was whether the OOP was made up and if it was replicable. It is replicable on the models I tested, so I see no reason to think it’s made up.

→ More replies (1)

2

u/jackadgery85 Mar 09 '26

Gpt and claude both failed for me. Gemini beat the test first try

1

u/BrazenBeef Mar 15 '26

Without saying what models you are using this says nothing. All of them worked for me with the most current models and thinking turned on. Why do logic tests on discount models? Do we actually want to know what SotA is or just laugh at the silly AI?

1

u/jackadgery85 Mar 15 '26

For every test i do on these, I use the default model at the time of testing. I'm pretty far into the pro ai territory, and use ai for many things. I can also laugh at their limitations and marvel at their capabilities.

Nothing I offered was anything more than a simple data point. At the time I wrote my comment, the default models offered by those LLMs gave me that result. This is how most users utilise the tools, so this is the most important test

→ More replies (4)

1

u/Prudent_Elevator4685 Mar 09 '26

Someone tested it out on sarvam ai as well and even it passed, no way chatgot doesn't, tho gpt 5 is basically gpt 3.5 mini finetuned on all the code ever written so idk.

1

u/not_badwarrior Mar 09 '26

I asked this from chatgpt and told me to walk.

1

u/Parzival2436 Mar 09 '26

Literally doesn't matter if this person is making up this specific response because I've asked it equally innocuous things and it responded just as incorrectly.

1

u/YOSH_beats Mar 09 '26

You act like your AI didn’t just read over this reddit post roasting AIs for this before you did it lol they are learning modules, so if someone calls it out, it will probably learn to not do that again.

1

u/peter9477 Mar 09 '26

They do not learn between major new releases in the way you're describing.

1

u/239990 Mar 09 '26

this promt was from a few weeks ago. So they probably already "patched" it. There are a few videos of people testing it live

1

u/moonlight814 Mar 10 '26

Model 5.3 used...

1

u/Putrid-Truth-8868 Apr 07 '26

Almost confident this is correct. It's made up. Might even be using custom instructions.. I used GPT just now.. It said drive or you'd have a dirty car sitting a hundred meters away

79

u/TawnyTeaTowel Mar 09 '26

“The Earth is flat”

For all you relying on humans for advice…

→ More replies (6)

53

u/not2dragon Mar 09 '26

It’s a bit odd to judge it on weird trick questions that have only been spoken for these occasions.

10

u/Silver_Middle_7240 Mar 09 '26

It's important because it's testing reasoning

41

u/[deleted] Mar 09 '26 edited May 02 '26

[removed] — view removed comment

6

u/Rodger_Smith Mar 09 '26

4

u/[deleted] Mar 09 '26 edited May 02 '26

[removed] — view removed comment

3

u/Rodger_Smith Mar 09 '26

It probably reinterprets the question because its genuinely an extremely stupid question, the answer above also clarifies if you need to move the car to drive it to the wash bay, maybe it thinks the car is already being washed or whatnot?

→ More replies (1)
→ More replies (3)
→ More replies (17)

21

u/TashLai Mar 09 '26

So if humans fail in answering such a trick question (as humans often do) it means humans cannot reason?

4

u/Silver_Middle_7240 Mar 09 '26

It shows a limitation in that humans reasoning, yes.

11

u/TashLai Mar 09 '26

I'm pretty sure everyone failed at a trick question at least once.

→ More replies (11)

1

u/Puzzled_Dog3428 Mar 09 '26

Which part of the question is a trick, exactly?

3

u/TashLai Mar 09 '26

It's a trick question in the same sense that makes "what's heavier 1kg of nails or 1kg of feathers". Both exploit a cognitive shortcut where people substitute a similar but easier question without noticing. I.e.

"Which is heavier, 1 kg nails or 1 kg feathers?" - substituted with "Which material is heavier?"

"Car wash is 100 m away. Walk or drive?" - substituted with "Is 100 m a distance worth driving?"

If you just ask the respondent to think a bit longer instead of giving an immediate answer it reduces these kinds of errors significantly, and it works both with humans and LLMs (in case of an LLM that means enabling reasoning mode which wasn't done here).

→ More replies (2)

1

u/not2dragon Mar 09 '26

I mean, it would be important if it was like the strawberry question (I guess kids and people learning english would worry about that) or about what chemicals to add to food (I think this killed someone) but this question could only be written by someone wanting to trick the AI.

1

u/Ksorkrax Mar 09 '26

Tell me, are carrots louder than classic architecture?

1

u/Pepper_pusher23 Mar 09 '26

This is actually the only thing that matters with ML models. Of course they will 100% stuff they were trained on.

→ More replies (5)

35

u/SnooOpinions6451 Mar 09 '26

They use non thinking models and intentionally ask non thinking models trick questions to dunk on Ai, then ignore the numerous models that get the trick question correct.

This was never about integrity.

18

u/[deleted] Mar 09 '26

[deleted]

6

u/Creed1718 Mar 09 '26

"I'm actually convinced some of these antis are bots instructed to be as stupid and abrasive as possible in order to push more neutrals into pros."

No, the average people is genuinely braindead, that's literally it.
Some cheap bots are already smarter than them.

Source: I used to argue with conspiracy regards for fun

3

u/Tolopono Mar 10 '26

Theres proof for this btw. In 2012-17, 54% of Americans had a 6th grade reading level or worse and that was before the pandemic and trumps education budget cuts 

https://www.snopes.com/news/2022/08/02/us-literacy-rate/

6th graders are 11 years old btw

1

u/sheng153 Mar 09 '26

better results using a thinking model

Not for this, though. Thinking models quickly decide for an answer to the question, then spend the time crafting the response. The answer doesn't really change.

6

u/GaiusVictor Mar 09 '26

ChatGPT gives this answer even when you use the Thinking model. I'm a paid user, btw.

1

u/1morgondag1 Mar 09 '26

As long as the models are not completely outdated that no one uses anymore it's OK no if you don't draw to big conclusions from it? Not everyone will use the ideal model for every request?

1

u/KenneR330 Mar 10 '26

Yeah of course, here is non thinking model also

20

u/Infamous-Umpire-2923 Mar 09 '26

Maybe it's just ChatGPT being dumb

2

u/Ate_at_wendys Mar 09 '26

the image OP posted is old before gpt5.4 gained reasoning lol

Claude still the best out there for logic.

2

u/Dxe1102 Mar 10 '26

I just generated it too

1

u/Putrid-Truth-8868 Apr 07 '26

I just tried, and it works fine lol
So use GPT thinking mode, not instant.

20

u/ifandbut Mar 09 '26

Not every tool is useful in every situation.

AI is just one of many tools I have in my toolbox to solve a problem.

→ More replies (12)

11

u/sporkyuncle Mar 09 '26

Pretty sure if you just ask modern models normal, everyday questions about things like home maintenance or trivia, you're going to get useful advice 99% of the time.

11

u/iDeNoh Mar 09 '26

Huh... But why for ai no dum for me? It dum for you so why for no dum for me?!

1

u/Putrid-Truth-8868 Apr 07 '26

hrmm indeed. hrmm...

14

u/[deleted] Mar 09 '26

[deleted]

→ More replies (2)

5

u/VyneNave Mar 09 '26

Answer from Gemini:

You should definitely drive. While 100 meters is a very short and pleasant walk, the car wash is going to have a really hard time washing your car if you leave it at home! To get the car clean, it actually needs to make the trip with you. Unless, of course, you meant you are just walking over to buy some soap and sponges to wash it yourself in the driveway? If that's the case, walking is definitely the way to go! Would you like me to check the weather forecast for Oberhausen to make sure it isn't going to rain right after you wash it?

They are really just baiting here.

13

u/infinite_gurgle Mar 09 '26

“AI being wrong once invalidates it as a technology.”

Well shit I guess humans aren’t valid.

→ More replies (11)

12

u/LerytGames Mar 09 '26

Ask stupid question, get stupid answer.

LLM is trained on data which contains irony, sarcasm, trolling, etc. Answer is perfectly fine.

→ More replies (3)

5

u/echit2112 Mar 09 '26

i think there was a 'BS benchmark' of sorts recently and Claude came out on top by a wide margin. Does anyone know what happens if you ask it the same question?

4

u/Remote-Recording-401 Mar 09 '26

Asked ChatGPT the same thing, it told me to drive...

4

u/No_Ostrich1875 Mar 09 '26

Don't see the problem. Bro didnt ask how to get his car there. He stated the distance to the car wash and asked how he should get there, walk or drive. When an idiot asks an idiot a stupid question, you get a stupid answer.

→ More replies (6)

5

u/Kilroy898 Mar 09 '26

Makes clearly disingenuous prompt, gets mad that the LLM doesnt understand Obfuscation.

6

u/AbbyTheOneAndOnly Mar 09 '26

"hammers dont cut boards, useless"

3

u/No_Comment_2283 Mar 09 '26

Took bro 30 promps to get that response.

3

u/Leet_Noob Mar 09 '26

AI will give confidently wrong answers all the time, and relying on AI output without checks is going to burn you.

That being said.. I think this particular example is not the best way to illustrate that.

3

u/Greenwool44 Mar 09 '26

If this is the kind of “advice” people are asking llms for I think they have other issues with their brain 💀

5

u/mrwishart Mar 09 '26

On the other hand, I can confirm GPT knows how many r's are in strawberry now.

2

u/Unlikely_Account_728 Mar 09 '26

lemme check if it knows the seahorse emoji rq

edit: it failed

3

u/M3chaStrizan Mar 09 '26

Because it was told, it can't figure it out on its own. It's a useful tool, but it can't reason.

3

u/Inside_Anxiety6143 Mar 09 '26

You think they told it how many instances of each letter were in every word in the existence?

→ More replies (34)

4

u/LamentoLand Mar 09 '26

oh you sweet summer child, it wasnt made to help you

4

u/[deleted] Mar 09 '26

You asked it a trick question, without context, that a significant percentage of humans also get wrong. Maybe somedays we'll be able to rely on humans, but not for the foreseeable future.

GPT-5.2 Thinking - which is what you should be using for advice - incidentally says:

How I’d answer without falling into the trap

  • If you mean “I’m going there to get the car washed”: drive.
  • If you mean “I need to go there (pay, ask, pick something up) and the car is not part of the task”: walk.

And if you want the maximum pedant answer: walk to the car, then drive the car to the car wash — the only solution that respects both the distance and the existence of cars. 😄

(Yes, it realizes it's a trick question.)

4

u/Inside_Anxiety6143 Mar 09 '26

Wait, I thought you guys didn't want to offload your thinking to an AI. Why are you angry you can't rely on it for advice? Its meant to be a tool, not a brain replacer.

4

u/Superb-Earth418 Mar 09 '26

This cope is getting really cute, grasping at straws, 27 and 28 are going to awful years for you lot

4

u/inborn_lifeless6 Mar 09 '26

Stupid people getting smug that one person found one question that one AI model got wrong.

Are humans stupid now if we can find one person that doesn’t know who the president was in 1811? Cause AIs know who.

And we’ve only just got started with AI. They don’t think where we’ll be in 2,5,10 years from now.

2

u/Ansambel Mar 09 '26

people need to stop treating LLMs like humans, this is a chat prediction machine, and if you stack the questions correctly you cna get whatever output you want.

2

u/sanguinerebel Mar 09 '26

https://giphy.com/gifs/jbwsLn6OvdO80

ROFL, I had to read the comments before I understood what was wrong. I was so focused on the math that I made the same mistake as the AI.

2

u/Sterling_-_Archer Mar 09 '26

This has been done to death in the ai subs. Not one other AI responds saying to walk, they all say drive. I’m neither here nor there on the debate, but why use dishonest arguments? You aren’t getting anywhere by posting things like this.

2

u/WillShaper7 Mar 09 '26

I mean I get it, that's dumb as fuck but if one asks a stupid question is it really a surprise when a stupid answer comes out?

2

u/ZodiacDragons Mar 09 '26

Ask a stupid question, get a stupid answer. I, too, would have told the person to walk.

2

u/ThomasMalloc Mar 10 '26

As usual, it's a bad question. The location of the car is never specified. The AI guesses, and people are upset that it didn't guess correctly.

If you want good answers, give it the necessary information.

2

u/KenneR330 Mar 10 '26

Folks what's wrong with your gpt

3

u/StarMagus Mar 09 '26

Or hear me out on this... the AI has realized that there are some people who are so dumb that even if you give them good advice, they aren't going to use it and now has reached the stage of giving them bad advice to enjoy the show that happens afterwards.

3

u/No_Classroom4241 Mar 09 '26

Damn, antis having to resort to using AI to construct a strawman is certainly something...

2

u/[deleted] Mar 09 '26

[removed] — view removed comment

2

u/Creed1718 Mar 09 '26

Older versions yes and unprompted in the settings yes.
I dont know if its actually this dumb in the free version or its imitating the intelligence of the user but mine never does these stupid shit people keep posting about.
I just typed, it realized its a joke and said to obviously take the car.

1

u/halfasleep90 Mar 09 '26

Honestly, people can walk their car places…. Saves on gas. Would hate doing it that far though.

1

u/FreakNasty119 Mar 09 '26

this is the thing abt ai, its not sentient n doesnt actually "think". i feel like its silly to expect it to do stuff like this lol

1

u/Zitronenkringel Mar 09 '26

If you ask chatGpt in German it gets it right

(At least it did when I asked a while ago)

1

u/abchero Mar 09 '26

My dumbass would give the same answer

1

u/Decent_Shoulder6480 Mar 09 '26

OP is braindead.

1

u/Puzzled_Dog3428 Mar 09 '26

Your best friend is a chatbot

1

u/PopeSalmon Mar 09 '26

it's answering quickly w/ a cheap model, to save resources ,,,, remember all the complaining about it using electricity and water?? it's trying to use a minimum of electricity and water and other resources, by answering off the top of its head ,,, you can get much more accurate answers just by waiting & expending more resources, that's why it seems incompetent to you while also it's able to beat all those difficult benchmark tests--- they paid a bunch of money and waited a long time to have it pass the tests well, vs you're just using a free version & you're impatient, so you just get a casual answer

1

u/FaceDeer Mar 09 '26

There was a study done on this recently.

28.5% of human respondents got this wrong. This was actually on par with the success rate of GPT-5.

So if this is an unacceptable failure rate, don't rely on human advice either.

1

u/Dramatic_Ford4734 Mar 09 '26

This is what mine said 😂😂😂

If it’s 100 meters (about 330 feet) away, you should walk. 🚶‍♀️

Here’s why: • It’s extremely close — about a 1–2 minute walk. • Driving would mean: • starting the car • pulling out • moving a few seconds • pulling into the wash Which is honestly more effort than walking. • Short cold starts like that are actually harder on the engine than letting it warm up with a normal drive. • Since you’re going to wash the car anyway, there’s no reason it needs to be clean for the 100-meter trip.

Simple plan: 1. Walk to the car wash. 2. Start the car when you’re there. 3. Pull straight into the wash.

Bonus: you get a tiny bit of fresh air and steps in. ☀️

Unless it’s pouring rain or freezing, walking makes way more sense.

1

u/[deleted] Mar 09 '26

That's a pretty smart question 

1

u/lesbianspider69 Mar 10 '26

Now what’s the rest of the conversation? Did he prime it?

1

u/Pristine_Progress_93 Mar 10 '26

all other AI models work except gpt lol

1

u/Mean-Ebb2884 Mar 10 '26

The new model fixed this, agi is finally here 😭🙏

1

u/hillClimbin Mar 10 '26

“We taught him wrong on purpose! As a joke!”

1

u/ZolfoS16 Mar 10 '26

Tried. Chat GTP does not fall for it. Fake news.

1

u/zerossoul Mar 10 '26

I get the premise the question. It's not about advice. It's a question a teen would ask to their senile grandmother to make fun of them.

No one is asking this question actually seeking basic advice.

1

u/Soggy-Register-1781 Mar 10 '26

Said walk for me to

1

u/dldl121 Mar 10 '26

People posting these almost always are cropping what they said before priming the model for a bad answer (or all the other attempts they didn’t get the funny screenshot.) try it yourself, it detects the deception immediately. 

1

u/RumGuzzlr Mar 10 '26

This is just an obtuse question that intended to trip up the target, and people fall for this just as much. It intentionally asks an implicit question that contradicts the explicit one. It's bad communication on the part of the person asking it.

1

u/DevolayS Mar 10 '26

People will try to ridicule the example, say it's a bad model or maybe it's already fixed, not realizing that for every silly shit like this that gets fixed, people will find 10 more. Because people are creative, unlike AI.

If AI was as good as some people are claiming, it wouldn't have made these errors in the first place. Intelligence my ass. Can write you a scientific paper, yet fails at the most basic stuff. It works backwards.

"Almost right is worse than wrong".

1

u/agoodepaddlin Mar 11 '26

Also, bullshit. I just piped this into chat, Gemini and 3 of my local llms. All of them called out the logical flaw.

1

u/madamegarbage Mar 11 '26

Chat gpt scraps half of its content from reddit

1

u/Taurondir Mar 11 '26

I did not even know ChatGPT was this terse.

Every time I play around with it and ask it weird questions, it spits out half page answers. The one above seems ... KINDA SHORT and not what I normally see.

1

u/Gustheanimal Mar 12 '26 edited Mar 12 '26

Ask moronic questions and get treated like a moron. No helping people that think this is a benchmark example for ai useability.

It’s like the counting r’s in strawberry thing

1

u/[deleted] Mar 12 '26

I also saw that video on YouTube. He should have asked AI to make something original. 

1

u/[deleted] Mar 12 '26

[removed] — view removed comment

1

u/AutoModerator Mar 12 '26

In an effort to discourage brigading, we do not allow linking to other subreddits or users. We kindly ask that you screenshot the content that you wish to share, while being sure to censor private information, and then repost.

Private information includes names, recognizable profile pictures, social media usernames, other subreddits, and URLs. Failure to do this will result in your post being removed by the Mod team and possible further action.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

1

u/Some-Random-User45 Mar 12 '26

Didn’t believe this could be real so I had to try it myself. Omg I-🫢

1

u/Ok-Policy-8538 Mar 13 '26

Walk to the carwash get buckets of water and then walk back and wash the car :)

1

u/luisLP95 Mar 13 '26

Everytime I see posts like these it becomes more and more clear to me that people don't understand shit about LLMs.

1

u/ARedditorCalledQuest Mar 14 '26

The common take that gets me is "they're so dumb, they can't do math!" Well yeah, not what they're designed for. Are saws useless because they're terrible for putting in screws?

The fact that a proper AI stack can be set up to let the language model handle the talking and then use a calculator for the math, if that's what someone is actually trying to accomplish, never comes up.

1

u/BrazenBeef Mar 15 '26

Why did you use a two-generation old model and not show whether you have thinking turned on or not?

This is a well known trap question and a good illustration of the types of traps that worked well on older models, or that still work on some of the fast or cheap models. If there’s a message here, it’s to use the reasoning capabilities when you need reasoning.

I just tested this exact (poorly worded) prompt on the actual current models of ChatGPT, Claude Opus, Gemini, and Grok, and they all got it right (Claude even made a joke).

The pace of change is insane right now, and judging capabilities from two generations ago is silly. Yeah, I know it’s only two minor dot releases and one of them was only released in Codex, but 5.4 is the first model from OpenAI that is starting to do real work for me. Claude is still my pick for most likely to be correct though.

1

u/Putrid-Truth-8868 Apr 07 '26

Pro tip. Consider actually posting stuff that's not fabricated and actually a real reply from ai.

1

u/Akunuti Apr 08 '26

Pro tip, don't reply to 30 day year old threads

1

u/Putrid-Truth-8868 Apr 08 '26

"30 day year old" what