Over for Google and OpenAI

loyolaxavvierretard · Jun 12, 2025

‘A complete accuracy collapse’: Apple throws cold water on the potential of AI reasoning – and it's a huge blow for the likes of OpenAI, Google, and Anthropic

Presented with complex logic puzzles, AI reasoning models simply gave up

www.itpro.com

Apparently, the researchers say that the reasoning models have 0 accuracy as the logical reasoning tests go up in complexity

The ques for the established benchmarks might already have answers baked into the training set of the models so they were inaccurate when assessing a model's accuracy

@Jason Voorhees career extended by 20 years

Link to paper

https://ml-site.cdn-apple.com/papers/the-illusion-of-thinking.pdf

loyolaxavvierretard · Jun 12, 2025

@Snicket

loyolaxavvierretard · Jun 12, 2025

@Chimera

DirtyBlonde · Jun 12, 2025

chatgpt couldn't understand my 1 sentence prompt today, can confirm

loyolaxavvierretard · Jun 12, 2025

DirtyBlonde said:
chatgpt couldn't understand my 1 sentence prompt today, can confirm

Its for purely logical reasoning tasks though.

Specifically highly complex tasks

2025cel · Jun 12, 2025

exactly, AI is extremely gay and over-hyped to the point where it cringes you

loyolaxavvierretard · Jun 12, 2025

2025cel said:
exactly, AI is extremely gay and over-hyped to the point where it cringes you

Glad you fixed your retarded cellphone

loyolaxavvierretard · Jun 12, 2025

2025cel · Jun 12, 2025

loyolaxavvierretard said:
Glad you fixed your retarded cellphone

Nah I didn't bother with fixing that shit I just threw it away.

pcs are much more convenient, metaphysically

Deleted member 158882 · Jun 12, 2025

Can’t be bothered with this tech stuff.

Just going outside and doing anything mogs.

Gonthar · Jun 12, 2025

Lol, Apple has been left behind in the AI department, that's why they desperately try to downplay the importance of AI...

loyolaxavvierretard · Jun 12, 2025

Gonthar said:
Lol, Apple has been left behind in the AI department, that's why they desperately try to downplay the importance of AI...

The problems in the paper were not very tough though. The river problem was simple enough for humans to do it

Deleted member 91663 · Jun 12, 2025

loyolaxavvierretard said:
@Snicket

I haven't been following the story very closely.
What's the bottleneck of AI in personal phone usage?
Surely some kind of GPT based model could do an adequate job? But obviously it's more complex than that.

piec · Jun 12, 2025

this is why grok mogs

Lord Shadow · Jun 12, 2025

obviously lol for anyone thinking AI wasnt going to be any different from just a smarter google search than its over @lifeless

loyolaxavvierretard · Jun 12, 2025

Snicket said:
I haven't been following the story very closely.
What's the bottleneck of AI in personal phone usage?
Surely some kind of GPT based model could do an adequate job? But obviously it's more complex than that.

Its not personal usage to be specific.

The reasoning models just fail when given deterministic algorithmic problems.

It should be doable for them since its within the context window but they just "give up" before attempting as more variables are stacked on

Basedman420 · Jun 12, 2025

because Ai is stupid they cant think

Basedman420 · Jun 12, 2025

Basedman420 said:
because Ai is stupid they cant think

i just realised i typed "they'

Bitchwhipper2 · Jun 13, 2025

Chatgpt is a genuine retard tbh

loyolaxavvierretard · Jun 13, 2025

Bitchwhipper2 said:
Chatgpt is a genuine retard tbh

Still has a good knowledge base though. 2030 will be either complete bust and start an AI winter or a complete boom

gpsl · Jun 13, 2025

Tell that to the soldiers who get their faces blasted off by machine learning drones

loyolaxavvierretard · Jun 13, 2025

gpsl said:
Tell that to the soldiers who get their faces blasted off by machine learning drones

You dont need high level reasoning for that.

A bqsic Coordinate system and tracer technology can do that

Bitchwhipper2 · Jun 13, 2025

loyolaxavvierretard said:
Still has a good knowledge base though. 2030 will be either complete bust and start an AI winter or a complete boom

Yea, it has a good bit of knowledge to draw upon.

Good data analytics. But asking for its own take on philosophical ponderings is just shooting yourself in the foot

gpsl · Jun 13, 2025

loyolaxavvierretard said:
You dont need high level reasoning for that.

A bqsic Coordinate system and tracer technology can do that

Openai should hire the basement chinese to fix their retarded models

Gonthar · Jun 13, 2025

Snicket said:
I haven't been following the story very closely.
What's the bottleneck of AI in personal phone usage?
Surely some kind of GPT based model could do an adequate job? But obviously it's more complex than that.

They can't run the top LLM models locally, you would need a very powerful computer and lots of RAM for that, mostly they are run in the cloud on special servers.

Deleted member 91663 · Jun 13, 2025

Gonthar said:
They can't run the top LLM models locally, you would need a very powerful computer and lots of RAM for that, mostly they are run in the cloud on special servers.

Good point. Hadn't considered this.
Why can’t phones have cloud-based AI instead of on-device?

Deleted member 91663 · Jun 13, 2025

loyolaxavvierretard said:
Its not personal usage to be specific.

The reasoning models just fail when given deterministic algorithmic problems.

It should be doable for them since its within the context window but they just "give up" before attempting as more variables are stacked on

Interesting.

Is Apple limited by trying to develop on-device AI versus using something server-based like Chat GPT or Gemini instead?

Gonthar · Jun 13, 2025

Snicket said:
Good point. Hadn't considered this.
Why can’t phones have cloud-based AI instead of on-device?

It still takes a few seconds until you get a response from ChatGPT, cloud performance can fluctuate a lot depending on how many users are online and using that service, or the Internet speed, etc., a phone would simply lag too much if you would have to wait for seconds for a response to your various requests.

Deleted member 91663 · Jun 13, 2025

Gonthar said:
It still takes a few seconds until you get a response from ChatGPT, cloud performance can fluctuate a lot depending on how many users are online and using that service, or the Internet speed, etc., a phone would simply lag too much if you would have to wait for seconds for a response to your various requests.

Yeah, makes sense. And with over 1 billion iPhone users worldwide, the cloud infrastructure costs would be astronomical.

Over for Google and OpenAI

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

‘A complete accuracy collapse’: Apple throws cold water on the potential of AI reasoning – and it's a huge blow for the likes of OpenAI, Google, and Anthropic

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

DirtyBlonde

Prehistoric Lurker

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

2025cel

signalling

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

2025cel

signalling

Deleted member 158882

Reality is all that matters.

Gonthar

Emerald

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

Deleted member 91663

Luminary

piec

Luminary

Lord Shadow

Luminary

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

Basedman420

Did Nothing Wrong

Basedman420

Did Nothing Wrong

Bitchwhipper2

WhiteGymmax

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

gpsl

Not your daddy

loyolaxavvierretard

𝕯𝖝𝕯 𝖈𝖗𝖊𝖜 . Alonso

Bitchwhipper2

WhiteGymmax

gpsl

Not your daddy

Gonthar

Emerald

Deleted member 91663

Luminary

Deleted member 91663

Luminary

Gonthar

Emerald

Deleted member 91663

Luminary

Similar threads

Users who are viewing this thread