Connect with us

Hi, what are you looking for?

The Independent TradersThe Independent Traders

Tech News

Nvidia banking on TensorRT to expand generative AI dominance

Illustration by Alex Castro / The Verge

Nvidia looks to build a bigger presence outside GPU sales as it puts its AI-specific software development kit into more applications.

Nvidia announced that it’s adding support for its TensorRT-LLM SDK to Windows and models like Stable Diffusion. The company said in a blog post that it aims to make large language models (LLMs) and related tools run faster.

TensorRT speeds up inference, the process of going through pretrained information and calculating probabilities to come up with a result — like a newly generated Stable Diffusion image. With this software, Nvidia wants to play a bigger part in the inference side of generative AI.

Its TensorRT-LLM breaks down LLMs and lets them run faster on Nvidia’s H100 GPUs. It works with LLMs like…

Continue reading…

You May Also Like

Tech News

Unity Earlier this week, Unity, the company that makes the Unity video game engine popular with indie developers, announced that it was changing its...

Tech News

Illustration: The Verge X CEO Linda Yaccarino announced a series of changes to her executive team, including a shakeup to the company’s sales organization...

Tech News

Image: Brazil Climate Summit At the moment I arrived at the Brazil Climate Summit event, it felt like home to me. As I opened...

Tech News

The Logitech G Pro X Superlight 2 mouse. | Photo by Sean Hollister / The Verge I called it the real magic mouse, but...