Model Inference Edge - Search News

Ultra-low-bit LLM Inference Allows AI-PC CPUs And Discrete Client GPUs To Approach High-end GPU-Level (Intel)

A new technical paper titled “Pushing the Envelope of LLM Inference on AI-PC and Intel GPUs” was published by researcher at ...

InfoWorld

Edge AI: The future of AI inference is smarter local compute

Smaller models, lightweight frameworks, specialized hardware, and other innovations are bringing AI out of the cloud and into ...

TMCnet

Nokia Strengthens Edge AI Capabilities Through Strategic Collaboration with Blaize on Hybrid Inference Solutions Across Asia Pacific Regions

Under the MOU, Blaize and Nokia intend to collaborate on: ...

Semiconductor Engineering

Outlier-aware Quantization Framework Co-designed With Heterogeneous NVM For SLM Deployment on Edge Platforms (UCSD et al.)

Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design” was published by researchers at ...

RCR Wireless News

The big dig – how edge AI inference shifts the fibre bottleneck

As AI workloads move from centralised training to distributed inference, the industry’s fibre infra challenge is changing ...

EDN

Designing edge AI for industrial applications

Edge AI addresses high-performance, low-latency requirements by embedding intelligence directly into industrial devices.

Yahoo Finance

Can Cloudflare's Edge AI Inference Reshape Cost Economics?

Cloudflare’s NET AI inference strategy has been different from hyperscalers, as instead of renting server capacity and aiming to earn multiples on hardware costs that hyperscalers do, Cloudflare ...

Business Wire

Cloudflare partners with Databricks to bring AI inference to the edge through MLflow and the Databricks Marketplace

Cloudflare joins MLflow as an active contributor to bridge the gap between model training and deploying inference on the edge SAN FRANCISCO--(BUSINESS WIRE)--Cloudflare, Inc. (NYSE: NET), the leading ...

abc27

Akamai Sharpens Its AI Edge with Launch of Akamai Cloud Inference

New service gives companies the ability to realize a 3x improvement in throughput, 60% less latency, and 86% lower cost than traditional hyperscale infrastructure CAMBRIDGE, Mass., March 27, 2025 ...

TMCnet

Ultralytics Launches YOLO26, Setting a New Global Standard for Edge-First Vision AI

Ultralytics, the global leader in open-source vision AI, today announced the launch of Ultralytics YOLO26, the most advanced and deployable YOLO (You Only Look Once) model to date. Engineered from the ...

SDxCentral

AI IXP from Moonshot and QAI Moon brings inference closer to the edge

Moonshot Energy, QumulusAI (QAI Moon), and Connected Nation Internet Exchange Points (IXP.us) collaborated on a nationwide AI ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results