AI workloads require new network build-outs

According to the new AI Networks for AI Workloads report by Dell’Oro Group, spending on switches deployed in AI back-end networks is forecast to expand the Data Center Switch Market by 50 percent.

  • Tuesday, 16th January 2024 Posted 1 year ago in by Phil Alsop

Current data center switch market spending is on front-end networks used primarily to connect general-purpose servers. AI workloads will require a new back-end infrastructure buildout. The competition intensifies between InfiniBand and Ethernet as manufacturers vie for market dominance in AI back-end networks. While InfiniBand is expected to maintain its lead, Ethernet is forecast to make substantial gains, such as 20 revenue-share points by 2027.

“Generative AI applications usher in a new era in the age of AI, standing out for the sheer number of parameters that they have to deal with,” said Sameh Boujelbene, Vice President at Dell’Oro Group. “Several large AI applications currently handle trillions of parameters, with this count increasing tenfold annually. This rapid growth necessitates the deployment of thousands or even hundreds of thousands of accelerated nodes. Connecting these accelerated nodes in large clusters requires a data center-scale fabric, known as the AI back-end network, which differs from the traditional front-end network used mostly to connect general-purpose servers.

“This predicament poses the pivotal question: what is the most suitable fabric that can scale to hundreds of thousands and potentially millions of accelerated nodes while ensuring the lowest Job Completion Time (JCT)? One could argue that Ethernet is one speed generation ahead of InfiniBand. Network speed, however, is not the only factor. Congestion control and adaptive routing mechanisms are also important. We analyzed AI back-end network build-outs by the major Cloud Service Providers (such as Google, Amazon, Microsoft, Meta, Alibaba, Tencent, ByteDance, Baidu, and others) as well as various considerations driving their choices of the back-end fabric to develop our forecast,” continued Boujelbene.

Additional highlights from the AI Networks for AI Workloads Report:

AI networks will accelerate the transition to higher speeds. For example, 800 Gbps is expected to comprise the majority of the ports in AI back-end networks by 2025, within just two years of the latest 800 Gbps product introduction.

While most of the market demand will come from Tier 1 Cloud Service Providers, Tier 2/3 and large enterprises are forecast to be significant, approaching $10 B over the next five years. The latter group will favor Ethernet.

Failure to prioritise testing and integrate generative AI tools raises concerns as agentic AI adds pressure.

CIOs 'overspend' on cloud

Posted 1 day ago by Phil Alsop
43% of CIOs say their CEOs and/or board of directors have concerns about their company’s cloud spend.
Research revealed at Coterie Connect event highlights shifting team structures, evolving skills priorities, and urgent training needed for partner...
Endava has launched its latest research report “AI and the Digital Shift: Reinventing the Business Landscape”.

3,000% surge in enterprise use of AI/ML tools

Posted 1 week ago by Phil Alsop
Zscaler has released the ThreatLabz 2025 AI Security Report, based on insights from more than 536 billion AI transactions processed between February...
Over one in four (28%) British small business owners have used AI tools to help run their business.

Tech fragmentation cited as biggest cyber challenge

Posted 1 week ago by Phil Alsop
New Palo Alto Networks data shows 82% of UK organisations confident in their use of AI, despite AI being identified as biggest cyber risk for 2025.
MIT researchers crafted a new approach that could allow anyone to run operations on encrypted data without decrypting it first.