DEEPINFRA RAISES $107M SERIES B FOR INFERENCE EXPANSION
AI DESK■ 2 MIN READ
TUE, MAY 5, 2026■ AI-SUMMARIZED FROM 2 SOURCES ▸ TIMELINE
DeepInfra, a dedicated inference cloud platform, secured $107 million in Series B funding co-led by 500 Global and Georges Harik. The startup currently supports over 190 open-source models.
DeepInfra's latest funding round positions the inference cloud startup to expand its global infrastructure capacity. The Series B was co-led by venture capital firm 500 Global and Georges Harik, a prominent investor and former Yahoo executive.
The startup operates a specialized platform designed to run inference workloads for open-source language models and other AI systems. With support for more than 190 open models, DeepInfra serves developers and enterprises seeking cost-effective alternatives to proprietary AI services.
Inference—the computational process of running trained AI models to generate predictions or text—has become a critical bottleneck as demand for large language models grows. DeepInfra focuses specifically on this segment, offering dedicated infrastructure optimized for model serving rather than training.
The company competes in a crowded market that includes platforms like Together, Replicate, and cloud providers offering inference services. DeepInfra distinguishes itself through support for a broad range of open models and API-first infrastructure designed for developers.
The funding amount—$107 million—represents substantial investor confidence in the inference infrastructure category. This reflects broader market recognition that inference workloads will remain computationally intensive and commercially significant as AI adoption expands.
The capital will likely fund server expansion, engineering resources, and go-to-market efforts. DeepInfra's focus on open models positions it to benefit from the ongoing shift toward open-source AI systems, particularly as enterprises seek alternatives to proprietary models from major cloud providers.
The inference infrastructure market remains relatively nascent compared to other AI infrastructure segments, with significant room for consolidation and specialization. DeepInfra's funding validates investor appetite for specialized inference platforms targeting developers building AI applications.
■ MORE FROM THE STARTUPS DESK
Triomics, an AI platform automating data-heavy tasks for oncologists, secured $22M in Series B funding. The raise follows a $15M Series A in 2024.
22H AGO— AI Desk
Xcena secured $135 million in Series B funding at a $570 million valuation for its MX1 chip, which handles data orchestration and KV cache management directly within memory modules.
22H AGO— AI Desk
Pittsburgh-based Gray Swan, which stress-tests AI models for frontier labs, secured $40M in Series A funding at a $200M valuation. The round was co-led by Wing VC and Madrona.
YESTERDAY— AI Desk
H1, a healthcare SaaS startup, secured $40 million in funding from CVS Health. The investment signals continued investor confidence in specialized software platforms despite AI disruption concerns.
YESTERDAY— Industry Desk