AMD debuts AMD Intuition MI350 Collection accelerator chips with 35X higher inferencing

AMD unveiled its complete end-to-end built-in AI platform imaginative and prescient and launched its open, scalable rack-scale AI infrastructure constructed on {industry} requirements at its annual Advancing AI occasion.

The Santa Clara, California-based chip maker introduced its new AMD Intuition MI350 Collection accelerators, that are 4 occasions sooner on AI compute and 35 occasions sooner on inferencing than prior chips.

AMD and its companions showcased AMD Intuition-based merchandise and the continued development of the AMD ROCm ecosystem. It additionally confirmed its highly effective, new, open rack-scale designs and roadmap that convey management Rack Scale AI efficiency past 2027.

“We are able to now say we’re on the inference inflection level, and it will likely be the driving force,” mentioned Lisa Su, CEO of AMD, in a keynote on the Advancing AI occasion.

In closing, in a jab at Nvidia, she mentioned, “The way forward for AI is not going to be constructed by anybody firm or inside a closed system. Will probably be formed by open collaboration throughout the {industry} with everybody bringing their finest concepts.”

AMD unveiled the Intuition MI350 Collection GPUs, setting a brand new benchmark for efficiency, effectivity and scalability in generative AI and high-performance computing. The MI350 Collection, consisting of each Intuition MI350X and MI355X GPUs and platforms, delivers a 4 occasions generation-on-generation AI compute improve and a 35 occasions generational leap in inferencing, paving the best way for transformative AI options throughout industries.

“We’re tremendously excited concerning the work you might be doing at AMD,” mentioned Sam Altman, CEO of Open AI, on stage with Lisa Su.

He mentioned he couldn’t consider it when he heard concerning the specs for MI350 from AMD, and he was grateful that AMD took his firm’s suggestions.

AMD mentioned its newest Intuition GPUs can beat Nvidia chips.

AMD demonstrated end-to-end, open-standards rack-scale AI infrastructure—already rolling out with AMD Intuition MI350 Collection accelerators, fifth Gen AMD Epyc processors and AMD Pensando Pollara community interface playing cards (NICs) in hyperscaler deployments resembling Oracle Cloud Infrastructure (OCI) and set for broad availability in 2H 2025. AMD additionally previewed its subsequent era AI rack referred to as Helios.

Will probably be constructed on the next-generation AMD Intuition MI400 Collection GPUs, the Zen 6-based AMD Epyc Venice CPUs and AMD Pensando Vulcano NICs.

“I believe they’re focusing on a special kind of buyer than Nvidia,” mentioned Ben Bajarin, analyst at Artistic Methods, in a message to GamesBeat. “Particularly I believe they see the neocloud alternative and a complete host of tier two and tier three clouds and the on-premise enterprise deployments.”

Bajarin added, “We’re bullish on the shift to full rack deployment methods and that’s the place Helios suits through which will align with Rubin timing. However because the market shifts to inference, which we’re simply initially with, AMD is effectively positioned to compete to seize share. I additionally suppose, there are many prospects on the market who will worth AMD’s TCO the place proper now Nvidia could also be overkill for his or her workloads. In order that is space to observe, which once more will get again to who the proper buyer is for AMD and it is perhaps a really totally different buyer profile than the client for Nvidia.”

The newest model of the AMD open-source AI software program stack, ROCm 7, is engineered to satisfy the rising calls for of generative AI and high-performance computing workloads— whereas dramatically enhancing developer expertise throughout the board. (Radeon Open Compute is an open-source software program platform that enables for GPU-accelerated computing on AMD GPUs, significantly for high-performance computing and AI workloads). ROCm 7 options improved help for industry-standard frameworks, expanded {hardware} compatibility, and new improvement instruments, drivers, APIs and libraries to speed up AI improvement and deployment.

In her keynote, Su mentioned, “Opennesss ought to be greater than only a buzz phrase.”

The Intuition MI350 Collection exceeded AMD’s five-year objective to enhance the vitality effectivity of AI coaching and high-performance computing nodes by 30 occasions, finally delivering a 38 occasions enchancment. AMD additionally unveiled a brand new 2030 objective to ship a 20 occasions improve in rack-scale vitality effectivity from a 2024 base yr, enabling a typical AI mannequin that at the moment requires greater than 275 racks to be skilled in fewer than one totally utilized rack by 2030, utilizing 95% much less electrical energy.

AMD additionally introduced the broad availability of the AMD Developer Cloud for the worldwide developer and open-source communities. Objective-built for speedy, high-performance AI improvement, customers may have entry to a totally managed cloud atmosphere with the instruments and suppleness to get began with AI tasks – and develop with out limits. With ROCm 7 and the AMD Developer Cloud, AMD is decreasing boundaries and increasing entry to next-gen compute. Strategic collaborations with leaders like Hugging Face, OpenAI and Grok are proving the ability of co-developed, open options. The announcement bought some cheers from people within the viewers, as the corporate mentioned it might give attendees developer credit.

Broad Companion Ecosystem Showcases AI Progress Powered by AMD

AMD prospects mentioned how they’re utilizing AMD AI options to coach at the moment’s main AI fashions, energy inference at scale and speed up AI exploration and improvement.

Meta detailed the way it has leveraged a number of generations of AMD Intuition and Epyc options throughout its information heart infrastructure, with Intuition MI300X broadly deployed for Llama 3 and Llama 4 inference. Meta continues to collaborate carefully with AMD on AI roadmaps, together with plans to leverage MI350 and MI400 Collection GPUs and platforms.

Oracle Cloud Infrastructure is among the many first {industry} leaders to undertake the AMD open rack-scale AI infrastructure with AMD Intuition MI355X GPUs. OCI leverages AMD CPUs and GPUs to ship balanced, scalable efficiency for AI clusters, and introduced it’s going to provide zettascale AI clusters accelerated by the newest AMD Intuition processors with as much as 131,072 MI355X GPUs to allow prospects to construct, prepare, and inference AI at scale.

AMD says its Intuition GPUs are extra environment friendly than Nvidia’s.

Microsoft introduced Intuition MI300X is now powering each proprietary and open-source fashions in manufacturing on Azure.

HUMAIN mentioned its landmark settlement with AMD to construct open, scalable, resilient and cost-efficient AI infrastructure leveraging the total spectrum of computing platforms solely AMD can present.Cohere shared that its high-performance, scalable Command fashions are deployed on Intuition MI300X, powering enterprise-grade LLM inference with excessive throughput, effectivity and information privateness.

Within the keynote, Pink Hat described how its expanded collaboration with AMD allows production-ready AI environments, with AMD Intuition GPUs on Pink Hat OpenShift AI delivering highly effective, environment friendly AI processing throughout hybrid cloud environments.

“They’ll get probably the most out of the {hardware} they’re utilizing,” mentioned the Pink Hat exec on stage.

Astera Labs highlighted how the open UALink ecosystem accelerates innovation and delivers better worth to prospects and shared plans to supply a complete portfolio of UALink merchandise to help next-generation AI infrastructure.Marvell joined AMD to share the UALink change roadmap, the primary really open interconnect, bringing the last word flexibility for AI infrastructure.