
loyolaxavvierretard
π―ππ― ππππ . Alonso
- Joined
- Mar 1, 2025
- Posts
- 18,651
- Reputation
- 46,149

βA complete accuracy collapseβ: Apple throws cold water on the potential of AI reasoning β and it's a huge blow for the likes of OpenAI, Google, and Anthropic
Presented with complex logic puzzles, AI reasoning models simply gave up

Apparently, the researchers say that the reasoning models have 0 accuracy as the logical reasoning tests go up in complexity
The ques for the established benchmarks might already have answers baked into the training set of the models so they were inaccurate when assessing a model's accuracy
@Jason Voorhees career extended by 20 years


Link to paper
Last edited: