The creators of the open source project vLLM have announced that they transitioned the popular tool into a VC-backed startup, Inferact, raising $150 million in seed funding at an $800 million ...
Shakti P. Singh, Principal Engineer at Intuit and former OCI model inference lead, specializing in scalable AI systems and LLM inference. Generative models are rapidly making inroads into enterprise ...
AI is inspiring organizations to rethink a fundamental IT concept: the data center. For decades, the data center was a centralized place. It was a handful of large, secure facilities where ...
Distributed inference when the participants are only machines or electronic devices, e.g., sensors, has been explored extensively in the signal processing and machine learning literature. However, ...
Microsoft has come swinging in the battle of custom hyperscale silicon, debuting its “AI inference powerhouse” Maia 200 accelerator. Built on Taiwan Semiconductor Manufacturing Company's (TSMC) 3nm ...