Clock Reasoning Problems

When AI reasoning goes wrong: Microsoft Research shows more tokens can mean more problems

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Large language models (LLMs) are ...

Ars Technica

New Apple study challenges whether AI models truly “reason” through problems

In early June, Apple researchers released a study suggesting that simulated reasoning (SR) models, such as OpenAI’s o1 and o3, DeepSeek-R1, and Claude 3.7 Sonnet Thinking, produce outputs consistent ...

Science News

A look under the hood of DeepSeek’s AI models doesn’t provide all the answers

It’s been almost a year since DeepSeek made a major AI splash. In January, the Chinese company reported that one of its large language models rivaled an OpenAI counterpart on math and coding ...

Futurism

Top “Reasoning” AI Models Can be Brought to Their Knees With an Extremely Simple Trick

A team of Apple researchers has found that advanced AI models’ alleged ability to “reason” isn’t all it’s cracked up to be. But marketing aside, there’s no agreed-upon industrywide definition for what ...

Hosted on MSN

Apple research claims popular AI models fail at hard reasoning: Why does it matter?

Over the weekend, Apple released new research that accuses most advanced generative AI models from the likes of OpenAI, Google and Anthropic of failing to handle tough logical reasoning problems.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results