Inference - Search News

Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models

Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models - SiliconANGLE ...

DIGITIMES Report: Enterprise AI Enters Deployment Phase, Shifting Compute Architectures Toward Inference

As enterprise adoption of generative AI accelerates, a new phase of infrastructure demand is beginning to take shape.

Anthropic in discussions to buy inference chips from UK startup Fractile – report

Anthropic has held discussions with Fractile to buy inference chips from the UK-based startup when its hardware becomes ...

The Next Web

Nebius paid $643 million for 20 people because inference is where the money is

Nebius pays $643M for Eigen AI, a 20-person MIT spinout that maximises tokens per GPU. In the neocloud race, inference optimisation is the competitive edge.

The Manila Times

Zero Latency Launches Zerogrid Closed Beta, a Distributed AI Inference Grid

Select Fortune 1000 enterprises, tier 1 telcos and leading DevOps platforms join the first constraint-aware AI inference grid ...

16d

General Compute Launches ASIC-First Inference Cloud for Autonomous AI Agents

General Compute today announced its inference cloud platform built for AI agents, working with early partners now ahead ...

Forbes

The Rise Of The AI Inference Economy

Forbes contributors publish independent expert analyses and insights. I write about the economics of AI. When OpenAI’s ChatGPT first exploded onto the scene in late 2022, it sparked a global obsession ...

Nebius shares jump 12% as $643M Eigen AI deal boosts inference ambitions

The $643M Eigen AI deal directly upgrades Nebius’s inference efficiency via Eigen’s optimization stack, strengthening its ...

Pioneering AI Inference Acceleration Provider Selects Silicom's Inference-Specific Solution

Silicom Ltd. (NASDAQ: SILC), a leading provider of networking and data infrastructure solutions, today announced that one of ...

SDxCentral

Viavi test platform brings security to inference and AI data centers

Viavi Solutions has unveiled the latest iteration of its CyberFlood testing platform.

Ventureburn

DeepInfra Raises $107M To Scale Global Inference Infrastructure

DeepInfra raises $107M to expand global inference capacity, support new AI models, and enhance developer tooling across its ...

InfoQ

QCon AI Boston 2026 Schedule: Agents in Production, Inference Cost, and AI in the SDLC

The schedule for QCon AI Boston 2026 (June 1-2) is now live. The two-day program groups sessions around context engineering, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results