Chat With RTX brings custom local chatbots to Nvidia AI PCs

Are you looking to showcase your brand in front of the gaming industry’s top leaders? Learn more about GamesBeat Summit sponsorship opportunities here.

Personalized AI experience

Nvidia is introducing Chat with RTX to create personalized local AI chatbots on Windows AI PCs.

It’s the latest attempt by Nvidia to turn AI on its graphics processing units (GPUs) into a mainstream tool used by everyone.

The new offering, Chat with RTX, allows users to harness the power of personalized generative AI directly on their local devices, showcasing the potential of retrieval-augmented generation (RAG) and TensorRT-LLM software. At the same time, it doesn’t burn up a lot of data center computing and it helps with local privacy so that users don’t have to worry about their AI chats.

Chatbots have become an integral part of daily interactions for millions globally, typically relying on cloud servers with Nvidia GPUs. However, the Chat with RTX tech demo shifts this paradigm by enabling users to enjoy the benefits of generative AI locally, using the processing power of Nvidia GeForce RTX 30 Series GPUs or higher with a minimum of 8GB of video random access memory (VRAM).

Integration of multimedia content

What sets Chat with RTX apart is its ability to include information from multimedia sources, particularly YouTube videos and playlists, Nvidia said.

Users can integrate knowledge from video content into their chatbot, enabling contextual queries. For instance, users can seek travel recommendations based on their favorite influencer’s videos or obtain quick tutorials and how-tos from educational resources.

The application’s local processing capabilities ensure fast results, and importantly, user data stays on the device. By eliminating the need for cloud-based services, Chat with RTX allows users to handle sensitive data without sharing it with third parties or requiring an internet connection.

System requirements and future possibilities

To experience Chat with RTX, users need a GeForce RTX 30 Series GPU or higher with a minimum of 8GB of VRAM, along with Windows 10 or 11 and the latest Nvidia GPU drivers.

Developers can explore the potential of accelerating large language models (LLMs) with RTX GPUs by referring to the TensorRT-LLM RAG developer reference project available on GitHub. Nvidia encourages developers to participate in the Generative AI on Nvidia RTX developer contest, running until February 23, offering opportunities to win prizes such as a GeForce RTX 4090 GPU and a full, in-person conference pass to Nvidia GTC.

What's Hot

OnePlus 13 cameras are modest, but promise big improvements

NYT Connections: hints and answers for Saturday, October 26

The Hinterlands are the best part of Dragon Age: Inquisition

Chat With RTX brings custom local chatbots to Nvidia AI PCs

Can’t make it to PGC Helsinki? Join us from anywhere in the world with a Virtual Access ticket!

The Asus ROG Zephyrus M16 is on sale — don’t buy the 4090 configuration

Sony explains why PS5 Pro is so expensive

The Asus ROG Zephyrus G16 with Copilot+ is on sale today

OnePlus 13 cameras are modest, but promise big improvements

NYT Connections: hints and answers for Saturday, October 26

The Hinterlands are the best part of Dragon Age: Inquisition

Android 15: everything you need to know

OnePlus 13 cameras are modest, but promise big improvements

NYT Connections: hints and answers for Saturday, October 26

The Hinterlands are the best part of Dragon Age: Inquisition

Subscribe to Updates

What's Hot

Chat With RTX brings custom local chatbots to Nvidia AI PCs

GB Event

Personalized AI experience

Integration of multimedia content

System requirements and future possibilities

Related Posts