:

APPLE SHRINKS GEMINI AI FOR ON-DEVICE SIRI

AI DESK2 MIN READ
FRI, MAY 29, 2026

■ AI-SUMMARIZED FROM 1 SOURCE ▸ TIMELINE

Apple is working to compress Google's Gemini model to run directly on iPhones, though cloud processing will likely remain necessary for full functionality.

Apple is attempting to optimize Google's Gemini AI model for on-device performance on iPhones, according to reports. The effort aims to power an upgraded version of Siri with advanced language capabilities while maintaining user privacy through local processing. The challenge lies in model compression. Gemini's full version requires substantial computational resources typically available only in data centers. Apple's engineering teams are working to reduce the model's size and complexity without significantly degrading performance—a process known as quantization and distillation. Despite these efforts, a hybrid approach appears likely. Certain tasks would run locally on the device for speed and privacy, while more demanding queries would route to Apple's cloud infrastructure. This balances the benefits of on-device AI—faster responses, offline capability, and reduced data transmission—with the superior performance of server-side processing. The move aligns with industry trends toward bringing AI capabilities closer to users. Apple has previously emphasized on-device processing for privacy-sensitive features. However, local language models require significant storage space and processing power, making the trade-offs non-trivial on mobile devices. Implementing Gemini on iPhones would represent a shift from current Siri functionality, which relies heavily on cloud processing. An enhanced on-device model could handle more complex queries, context awareness, and natural language understanding with reduced latency. The partnership leverages Google's AI expertise while allowing Apple to maintain control over the user experience and data handling. Terms of the arrangement remain undisclosed. Success depends on Apple's ability to compress Gemini to a practical size—likely in the gigabytes range—without excessive quality loss. Device storage constraints on base iPhone models could limit which versions receive the full local implementation. No timeline for deployment has been announced. The project remains in development as Apple evaluates technical feasibility and performance benchmarks against its standards.

■ SOURCES

Ars Technica

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

Singapore's Sea Ltd. has established a dedicated team to identify and pursue AI investments, signaling a strategic pivot beyond its e-commerce core business. The move reflects the company's search for new growth opportunities in artificial intelligence.

9H AGOAI Desk

Tech executives are laying off workers based on AI capabilities they may not fully grasp, according to Box founder Aaron Levie. The trend has accelerated dramatically, with 2026 layoffs already approaching 2025's total.

9H AGOAI Desk

AI startup Shift is offering free home cleaning services in New York and plans to expand to London, but the deal requires homeowners to let the company film cleaners performing household chores.

9H AGOIndustry Desk

Bank of England Governor Andrew Bailey revealed that British banks remain unable to access Anthropic's Mythos AI tool. Bailey called for coordinated international efforts to address cybersecurity challenges.

9H AGOAI Desk

■ SUBSCRIBE TO THE DAILY BRIEF

ONE EMAIL, 5 STORIES, 06:00 UTC. UNSUBSCRIBE ANYTIME.