Cloud

AI inference strains enterprise infrastructure—F5 says telcos can help

By Mitch Wagner May 5, 2026 10:00am

artificial intelligence (AI) F5 inference cloud infrastructure

A telco tower routes AI inference traffic between enterprise buildings and edge devices, showing telcos' opportunity as AI in — Enterprises running multiple AI models in production face a distributed systems challenge. And telcos may be able to help. (Google Gemini)

F5 research finds 78% of enterprises run AI inference, creating distributed system complexity
Telcos are positioned to evolve beyond connectivity into AI infrastructure hosting, offering compute, routing and sovereign cloud services
Identity and intent — what F5 calls "layer eight" — are emerging as the next AI security frontier, beyond what today's tools handle

Enterprises have gone beyond AI pilots and proofs of concept and are deploying AI at scale, creating management complexity greater than most organizations anticipated.

F5's 2026 State of Application Strategy report finds that organizations face challenges routing traffic between models based on application requirements, securing traffic and governing AI in production, Shawn Wormke, F5 SVP of product management, told Fierce.

The report, based on responses from more than 1,100 IT decision-makers globally, found that 78% of organizations now run their own AI inference rather than relying on public AI services, and that enterprises are operating an average of seven AI models simultaneously in production.

The effect is to transform AI into a distributed systems problem. As with hybrid multi-cloud application delivery — a challenge enterprises and their service providers have been navigating for years — managing inference across multiple models demands routing, fallback logic, security, observability and governance at scale. The report found that 52% of organizations are already chaining or orchestrating multiple models, and that multi-model orchestration has outpaced even prompt engineering as the leading adaptation technique, suggesting enterprises are designing systems rather than tuning individual models.

What this means for telcos

For communications service providers, the implications cut both operationally and commercially.

On the operational side, inference is following the same trajectory as general application workloads — spreading from centralized data centers toward edges, devices and distributed environments. As agentic AI frameworks proliferate across multiple systems, the volume and latency sensitivity of AI traffic will place new demands on network infrastructure, Wormke said.

"Service providers are going to have to get away from just being dumb pipes and be able to provide higher-level services to their enterprise customers," he said. He noted that sovereign cloud deployments are already accelerating in Europe and Asia, and that neoclouds are emerging directly from telco companies in Asia as a result.

Layer eight: context, intent and the limits of today's security

The deepest challenge ahead, Wormke said, is what F5 internally calls "layer eight" — context and intent.

Layer seven of the OSI model covers what happens at the application protocol layer. Layer eight addresses the non-deterministic nature of AI: the need to understand the original intent of an agentic action and detect when that intent has drifted during execution.

Because AI systems are non-deterministic, evaluating individual transactions in isolation is insufficient. Layer eight requires assembling context across an entire interaction — agent identity, permissions, the original instruction and the actual outcome — to render a defensible security judgment. That is not a problem today's security tooling is built to solve.

artificial intelligence (AI) F5 inference cloud infrastructure enterprise Fierce Network Research Bulletin Cloud

AI inference strains enterprise infrastructure—F5 says telcos can help

Related

What this means for telcos

Related

Related

Layer eight: context, intent and the limits of today's security

Related