Juniper Networks introduces Ops4AI Lab

Accelerated time-to-value with assured Networking for AI configurations using Juniper, AMD, Broadcom, Intel, NVIDIA.

  • Wednesday, 17th July 2024 Posted 4 months ago in by Phil Alsop

Juniper Networks has introduced what it says is the first and most comprehensive multivendor lab for validating end-to-end automated AI Data Center solutions and automated operations with switching, routing, storage and compute solutions from leading vendors, as well as new Juniper Validated Designs (JVDs) that accelerate the time-to-value in deploying AI clusters. In addition, Juniper is releasing new key software enhancements that optimize the performance and management of AI workloads over Ethernet. Through these Operations for AI—Ops4AI—initiatives, Juniper is collaborating closely with a broad range of infrastructure ecosystem partners to enable the best AI workload performance via the most flexible and easiest-to-manage data center infrastructures.

As a key element of Juniper’s AI-Native Networking Platform, the existing Networking for AI solution consists of a spine-leaf data center architecture with a foundation of AI-optimized 400G and 800G QFX Series Switches and PTX Series Routers. The solution is secured via high performance firewalls with industry-leading effectiveness, and managed via Juniper Apstra

data center assurance software and the Marvis Virtual Network Assistant (VNA). Juniper Apstra and Marvis provide key Ops4AI capabilities, such as intent-based networking, multivendor switch management, application / flow / workload awareness, AIOps proactive actions and a GenAI conversational interface. With Juniper’s full Networking for AI solution, customers and partners can lower AI training Job Completion Times (JCTs), reduce latency during inferencing and increase GPU utilization while decreasing deployment times by up to 85 percent and reducing operations costs by up to 90 percent in some instances.

To simplify AI clusters and maximize network performance even further, Juniper has added new Ops4AI software enhancements that together offer unique value for customers. The enhancements being announced today include:

Fabric autotuning for AI: Telemetry from routers and switches are used to automatically calculate and configure optimal parameter settings for congestion control in the fabric using closed-loop automation capability in Juniper Apstra to deliver peak AI workload performance.

Global load-balancing: An end-to-end view of congestion hotspots in the network (i.e. local and downstream switches) is used to load-balance AI traffic in real-time, delivering lower latency, better network utilization and reduced JCTs.

End-to-end visibility from network to SmartNICs: Provides an end-to-end holistic view of the network, including SmartNICs from NVIDIA (BlueField and ConnectX), and others.

Industry’s first multivendor Ops4AI lab to collaborate with ecosystem and validate operations

Openness and collaboration are core to Juniper’s networking mission as they are the only way to move AI Data Centers from their current early adopter stage to effective mass market deployments. End-to-end operations for multivendor AI Data Center infrastructure has been difficult, leading to vertically integrated AI Data Center solutions that are vendor-locked and lead-time challenged. As a result, Juniper has launched the industry's first Ops4AI Lab with participation from Juniper’s partner ecosystem including Broadcom, Intel, NVIDIA, WEKA and other industry leaders. The Ops4AI Lab, located at Juniper’s Sunnyvale, CA corporate headquarters, is open for all qualified customers and partners who want to test their own AI workloads using the most advanced GPU compute, storage technologies, Ethernet-based networking fabrics and automated operations. Ops4AI Lab testing using validated Ethernet fabrics delivers comparable performance to InfiniBand-based AI infrastructure.

Users requesting a slot in the Juniper Ops4AI lab should contact their local Juniper Networks sales team.

Juniper Validated Designs to provide assurance

Juniper Validated Designs are detailed implementation documents that give new customers confidence that the solution and topology they have chosen is well characterized, well tested and repeatable, resulting in faster time to successful deployment. All JVDs are proven integrated solutions, tested in best practice designs based on specific platforms and software versions.

Juniper has released the first pre-validated blueprint specifically for AI data centers, built on NVIDIA A100 and H100 compute, storage from Juniper’s ecosystem partners, and Juniper’s portfolio of data center leaf and spine switches. This new Ops4AI JVD complements Juniper’s existing JVDs for automated, secure data centers which include QFX and PTX spines, QFX leaf switching, data center automation, and Juniper’s SRX and vSRX/cSRX solutions for data center security.

Register for Premier Virtual Networking for AI Event on July 23

Organizations are invited to join the CUBE’s Bob Laliberte and Juniper AI experts on July 23 for Juniper’s Seize the AI Moment virtual event, a deep dive into the rapidly evolving AI Data Center ecosystem with AMD, Broadcom, ePlus, Intel, WEKA, and AI Data Center end-users Deutsche Bahn and PayPal. Attendees can learn how these extraordinary industry leaders and customers are creating sustainable, high-performance AI Data Centers purpose-built for today and for the future. 

Guardz expands in EMEA

Posted 4 days ago by Phil Alsop
Through a new partnership with Infinigate Cloud, Guardz will help to secure SMBs and support the MSP community across EMEA.
Data centre operators can now achieve the unparalleled speeds needed for the most demanding Artificial Intelligence (AI) applications, thanks to a...

Dell Technologies boosts AI for enterprises

Posted 4 days ago by Phil Alsop
Dell Technologies continues to make enterprise AI adoption easier with the Dell AI Factory, expanding the world’s broadest AI solutions portfolio....

AMD accelerates Exascale Computing

Posted 4 days ago by Phil Alsop
El Capitan, powered by the AMD Instinct MI300A APU, becomes the second AMD supercomputer to surpass the Exascale barrier, placing #1 on the Top500...
Global system integrator won over by simplicity, security and speed of the Cloudbrink service.
The Seeq platform will be leveraged to maximize production and increase energy efficiency across the largest biorefinery in Europe.
This global service forms part of the recently launched Intelligent Security portfolio and increases Logicalis' proactive threat-hunting capabilities...

Pure Storage invests in CoreWeave

Posted 5 days ago by Phil Alsop
Pure Storage and CoreWeave have announced Pure Storage’s strategic investment in CoreWeave to accelerate AI cloud services innovation. Alongside...