Turing Award winner Richard Sutton argues that conventional generative AI systems cannot perform genuine scientific discovery because they lack built-in evaluation mechanisms to assess their own results.
Sutton identifies a fundamental limitation in current generative AI: without the ability to evaluate outputs, these systems cannot sustain scientific breakthroughs. Novel discoveries emerge only momentarily before disappearing, he explains.
According to Sutton, AI systems like AlphaGo and AlphaProof demonstrate what's required for genuine creativity in science—integrated feedback loops that allow systems to test and validate their own findings. This evaluation capacity separates truly creative AI from systems that merely generate plausible-sounding but unverified outputs.
The distinction matters for fields where verification and reproducibility are essential. Generative models excel at pattern recognition and synthesis but lack the iterative testing mechanisms necessary for scientific validation. Sutton's assessment suggests that achieving AI-driven scientific discovery will require architectural changes beyond current generative approaches, incorporating built-in evaluation and refinement capabilities.
Microsoft released MAI-Thinking-1, a new AI model family designed to improve reasoning capabilities. The launch includes seven distinct model variants targeting different use cases.
Perplexity has announced a new Computer feature that divides processing between on-device and cloud-based models, keeping sensitive data private while improving token efficiency.
Microsoft has introduced Scout, an AI agent that operates directly within Teams to automate routine office tasks. The agent functions like a human colleague, handling repetitive work without requiring manual intervention.
Artificial intelligence is accelerating the speed at which developers can build and test product prototypes, fundamentally changing software development workflows. The shift enables faster iteration cycles and reduces time-to-market for new features and products.