AI MODELS FAIL AT SOCCER BETTING, GROK WORST

AI DESK■ 1 MIN READ

SAT, JUN 13, 2026

■ AI-SUMMARIZED FROM 1 SOURCE ▸ TIMELINE

Major AI systems from Google, OpenAI, Anthropic, and xAI perform poorly when predicting Premier League match outcomes. xAI's Grok shows particularly weak performance.

A recent evaluation tested leading AI models on their ability to forecast English Premier League results. Google, OpenAI, Anthropic, and xAI systems all demonstrated significant limitations in soccer prediction tasks. xAI's Grok underperformed compared to its competitors, suggesting the model struggles with the complexity of sports analytics. The findings highlight a gap between general AI capabilities and domain-specific prediction accuracy. Soccer betting requires understanding team form, player injuries, tactical matchups, and contextual variables that extend beyond typical training data. Current AI models lack reliable mechanisms for integrating these real-time factors. The results suggest that despite advances in large language models, predicting sports outcomes remains a challenging problem. Specialized models trained specifically on sports data may outperform generalist AI systems in this domain.

■ SOURCES

► Ars Technica

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

P806NEW YORKER USES AI ART FOR AI STORY, SPARKS DEBATE

The New Yorker's profile of OpenAI CEO Sam Altman featured an AI-generated illustration, raising questions about whether AI coverage should rely on AI tools.

JUST NOW— AI Desk

P801COUNT ANYTHING AI CUTS OBJECT DETECTION ERRORS IN HALF

A new AI model called "Count Anything" can identify and count objects in any image using only text prompts, halving error rates compared to existing systems. The breakthrough addresses a persistent challenge in computer vision, though dense crowds and ambiguous terms still pose problems.

JUST NOW— AI Desk

P796APPLE'S NEW SIRI AI SHOWS PROMISE IN EARLY MACOS TESTING

A tester who previously dismissed Siri and Apple Intelligence is reconsidering after 24 hours with the redesigned Siri AI in macOS 27 Golden Gate's developer beta.

2H AGO— AI Desk

P789CLAUDE CREATES BROWSER GAME AMID AI SAFETY DEBATE

Anthropic's Claude AI model generated a playable browser game called Shepherd's Dog, sparking discussion about AI capabilities and risks in the developer community.

4H AGO— AI Desk

◄ BACK TO NEWS

AI MODELS FAIL AT SOCCER BETTING, GROK WORST

■ MORE FROM THE AI DESK

■ SUBSCRIBE TO THE DAILY BRIEF