If you’re a hacker you may well have a passing interest in math, and if you have an interest in math you might like to hear about the direction of mathematical research. In a talk on this topic [Kevin ...
Top artificial intelligence systems now ace many textbook-style math questions, yet they still fall apart on genuinely new problems. The gap between polished performance on familiar benchmarks and ...
News that large language models (LLM) have made major advances in solving Erdős problems – a set of problems formulated by the renowned 20 th-century mathematician Paul Erdős – has created an ...
AI math proof verification reached a new frontier as DeepMind’s AlphaProof Nexus solved nine open Erdős research problems with Lean-verified proofs, some unsolved for 56 years. The May 2026 Science Ne ...
Large language models struggle to solve research-level math questions. It takes a human to assess just how poorly they perform. By Siobhan Roberts A few weeks ago, a high school student emailed Martin ...
In a landmark study, OpenAI researchers reveal that large language models will always produce plausible but false outputs, even with perfect data, due to fundamental statistical and computational ...
Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI’s new model when he made an unexpected discovery. After ...