:

TOP AI MODELS HALVE PERFORMANCE ON COMPLEX CHARTS

AI DESK2 MIN READ
SUN, APR 19, 2026

■ AI-SUMMARIZED FROM 1 SOURCE ▸ TIMELINE

A new benchmark reveals that even the best AI models struggle significantly with complicated visualizations. The RealChart2Code test shows leading proprietary models lose nearly 50% of their performance when handling complex charts built from real-world data.

Researchers have introduced RealChart2Code, a benchmark designed to measure how well AI models can interpret and process complex data visualizations. The test evaluates 14 leading AI models against charts and graphics constructed from actual datasets, revealing a substantial performance gap compared to simpler chart interpretation tasks. The findings expose a critical weakness in current AI capabilities. While models excel at basic chart analysis, their ability to handle real-world complexity drops dramatically. This performance degradation affects both open-source and proprietary models, with even top-tier commercial systems experiencing roughly 50% accuracy loss. The implications are significant for practical applications. Businesses and organizations often work with intricate visualizations—multi-layered dashboards, overlapping datasets, and complex annotations—that current AI models struggle to parse accurately. This gap between simple and complex chart interpretation could limit AI adoption in data analysis and business intelligence roles. RealChart2Code addresses a notable blind spot in existing benchmarks. Previous tests typically focus on simplified or standardized visualizations that don't reflect the messy reality of production data. The new benchmark uses real-world datasets to create charts that more accurately represent actual use cases. The benchmark's findings suggest that improving AI performance on complex visualizations should be a priority for model developers. Enhanced chart understanding could unlock valuable applications in data science, financial analysis, scientific research, and report generation. These results highlight the gap between AI capabilities on controlled tasks versus real-world scenarios. As organizations increasingly rely on AI for data interpretation, addressing this performance drop will be essential for building trustworthy systems that can handle production environments.

■ SOURCES

The Decoder

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

Singapore's Sea Ltd. has established a dedicated team to identify and pursue AI investments, signaling a strategic pivot beyond its e-commerce core business. The move reflects the company's search for new growth opportunities in artificial intelligence.

21H AGOAI Desk

Tech executives are laying off workers based on AI capabilities they may not fully grasp, according to Box founder Aaron Levie. The trend has accelerated dramatically, with 2026 layoffs already approaching 2025's total.

21H AGOAI Desk

AI startup Shift is offering free home cleaning services in New York and plans to expand to London, but the deal requires homeowners to let the company film cleaners performing household chores.

21H AGOIndustry Desk

Bank of England Governor Andrew Bailey revealed that British banks remain unable to access Anthropic's Mythos AI tool. Bailey called for coordinated international efforts to address cybersecurity challenges.

21H AGOAI Desk

■ SUBSCRIBE TO THE DAILY BRIEF

ONE EMAIL, 5 STORIES, 06:00 UTC. UNSUBSCRIBE ANYTIME.