NVIDIA Launches Chat with RTX, a Free, Personalizable Massive Language Mannequin Chatbot for GeForce GPUs

February 19, 2024

1

NVIDIA has launched a free tech demo, Chat with RTX, which gives the power to run a custom-made generative synthetic intelligence (gen AI) chatbot on their native machine — offering they have an NVIDIA GeForce RTX 30-series GPU or increased with no less than 8GB of video RAM (VRAM), anyway.

“Chat with RTX makes use of retrieval-augmented era (RAG), NVIDIA TensorRT-LLM software program, and NVIDIA RTX acceleration to convey generative AI capabilities to native, GeForce-powered Home windows PCs,” NVIDIA’s Jesse Clayton explains. “Customers can shortly, simply join native information on a PC as a dataset to an open supply giant language mannequin like Mistral or Llama 2, enabling queries for fast, contextually related solutions.”

NVIDIA needs to place a big language mannequin in your GPU with Chat with RTX, a free tech demo for Home windows machines. (📹: NVIDIA)

It is the customization facet that NVIDIA hopes will make Chat with RTX stand out from the software-as-a-service choices flooding the market: the chatbot might be linked to shops of native information — from plain textual content to Microsoft Phrase paperwork and PDF information — in addition to YouTube movies and playlists as a way to present information and context lacking from its coaching, enhancing its capacity to formulate helpful responses.

“Since Chat with RTX runs domestically on Home windows RTX PCs and workstations, the supplied outcomes are quick — and the person’s information stays on the machine,” Clayton provides. “Moderately than counting on cloud-based LLM companies, Chat with RTX lets customers course of delicate information on a neighborhood PC with out the necessity to share it with a 3rd social gathering or have an web connection.”

The software program can pull context from native information, in addition to YouTube movies and playlists. (📷: NVIDIA)

Whereas Chat with RTX is described by the corporate as a “tech demo” — appropriate with Home windows 10 or increased, NVIDIA GeForce RTX 30-series GPUs with 8GB of VRAM or increased, and the corporate’s newest graphics card drivers — NVIDIA is hoping it should result in extra.

“Chat with RTX reveals the potential of accelerating LLMs with RTX GPUs,” Clayton says. “The app is constructed from the TensorRT-LLM RAG developer reference undertaking, accessible on GitHub. Builders can use the reference undertaking to develop and deploy their very own RAG-based purposes for RTX, accelerated by TensorRT-LLM.”

Chat with RTX is now accessible to obtain on the NVIDIA web site.

Supply hyperlink