A Mathematician with early access to XAI Grok 4.20, found a new Bellman function for one of the problems he had been working on with my student N. Alpay. Not an Erdős problem, but original research.
Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...
Grok 4 by xAI was released on July 9, and it's surged ahead of competitors like DeepSeek and Claude at LMArena, a leaderboard for ranking generative AI models. However, these types of AI rankings ...
There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement ...
Artificial Intelligence (AI) is becoming an integral part of daily life, including everyday calculations. But how well do these systems actually handle basic math? And how much should users trust them ...
Is Grok 4.2 the most intelligent coding model we’ve seen yet? With its release in January 2026, this AI powerhouse has already sparked conversations across the tech world. In this comparison, World of ...
What if the key to solving your most complex challenges wasn’t a team of experts, but a single, innovative AI? Enter Grok 4, a model that doesn’t just promise smarter assistance—it delivers it with ...
Elon Musk's xAI has launched its new flagship AI model, Grok-4, which demonstrates leading performance in various academic, reasoning, and coding benchmarks. Elon Musk's xAI today announced Grok 4, ...
During xAI’s launch of Grok 4 on Wednesday night, Elon Musk said — while livestreaming the event on his social media platform, X — that his AI company’s ultimate goal was to develop a “maximally truth ...
Elon Musk's frontier generative AI startup xAI formally opened developer access to its Grok 4.1 Fast models last night and introduced a new Agent Tools API—but the technical milestones were ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results