Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way
By Julie Bort
Published on March 23, 2026.
Stanford adjunct professor and founder Zain Asgar has raised an $80 million Series A for his startup, Gimlet Labs, which aims to solve the AI inference bottleneck problem in a more efficient way. The round was led by Menlo Ventures. The company claims to be the first and only “multi-silicon inference cloud” which allows an AI workload to be simultaneously run across diverse types of hardware. It can split an AI app's work across both traditional CPUs and AI-tuned GPUs, as well as high-memory systems. The product is designed for the largest AI model labs and data centers. Asgar believes that apps are only using existing hardware 15 to 30 percent of the time.
Read Original Article