UPDATED 10:44 EDT / APRIL 03 2024

Martin Yip, head of product marketing, EC2 compute and networking services at AWS, discusses Nvidia GPUs and AI AI

The Nvidia and AWS blueprint for a GPU-powered future

Over time, materials such as coal, steel and crude oil have served as the global economy’s backbone. Today, that mantle falls on the silicon chips that power enterprise-critical servers and graphics processing units.

Enter Amazon Web Services Inc. and its Ceiba project. Underpinned by Nvidia GPU tech, it aims to enhance control, security and performance as companies embed AI deeper into their operations.

“We have a great partnership with Nvidia, and we are bringing a lot of their new GPUs to the cloud,” said Martin Yip (pictured), head of product marketing, EC2 compute and networking services at AWS. “One of the big announcements is that we are working with them on something called Project Ceiba, which is building a supercomputer for them in the cloud using their Grace Blackwell architecture [and] bringing that platform to AWS for the use of Nvidia‘s R&D.”

Yip spoke with theCUBE’s chief analyst Dave Vellante at the Nvidia GTC event, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed the hardware/software synergy driving the next wave of AIOps.

Empowering enterprises: The AWS AI stack

Moore’s law is on steroids right now. Microchips are progressing at a quicker rate than ever and spurring the current AI boom. To bring order to the beautiful chaos, AWS envisions a comprehensive AI stack designed to empower developers and enterprises alike. With a focus on choice and flexibility, AWS enables customers to unlock the full potential of AI, according to Yip.

“We’re going to see an explosion in new applications and customers figuring out how generative AI will fit into customer experiences,” Yip said. “Whether it is building new models, making it easier for customers to do something like create an image or summarize a text or something else that we haven’t thought of yet, it will be interesting in the next couple of years.”

Companies understand the nuances of training and inference workloads, ideally with low latency and real-time responsiveness. With specialized instances tailored to distinct requirements, AWS ensures optimal performance and efficiency in AI applications, Yip added.

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE Research’s coverage of the Nvidia GTC event

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU