Because the world rushes to utilize the most recent wave of AI applied sciences, one piece of high-tech {hardware} has change into a surprisingly scorching commodity: the graphics processing unit, or GPU.
A top-of-the-line GPU can promote for tens of hundreds of {dollars}, and main producer Nvidia has seen its market valuation soar previous $2 trillion as demand for its merchandise surges.
GPUs aren’t simply high-end AI merchandise, both. There are much less highly effective GPUs in telephones, laptops, and gaming consoles, too.
By now you’re most likely questioning: What’s a GPU, actually? And what makes them so particular?
What Is a GPU?
GPUs had been initially designed primarily to rapidly generate and show advanced 3D scenes and objects, resembling these concerned in video video games and computer-aided design software program. Trendy GPUs additionally deal with duties resembling decompressing video streams.
The “mind” of most computer systems is a chip known as a central processing unit (CPU). CPUs can be utilized to generate graphical scenes and decompress movies, however they’re usually far slower and fewer environment friendly at these duties in comparison with GPUs. CPUs are higher fitted to basic computation duties, resembling phrase processing and shopping net pages.
How Are GPUs Totally different From CPUs?
A typical fashionable CPU is made up of between 8 and 16 “cores,” every of which might course of advanced duties in a sequential method.
GPUs, however, have hundreds of comparatively small cores, that are designed to all work on the similar time (“in parallel”) to realize quick total processing. This makes them well-suited for duties that require numerous easy operations which will be executed on the similar time, quite than one after one other.
Conventional GPUs are available two important flavors.
First, there are standalone chips, which regularly are available add-on playing cards for giant desktop computer systems. Second are GPUs mixed with a CPU in the identical chip bundle, which are sometimes present in laptops and sport consoles such because the PlayStation 5. In each circumstances, the CPU controls what the GPU does.
Why Are GPUs So Helpful for AI?
It seems GPUs will be repurposed to do greater than generate graphical scenes.
Most of the machine studying strategies behind synthetic intelligence, resembling deep neural networks, rely closely on numerous types of matrix multiplication.
It is a mathematical operation the place very giant units of numbers are multiplied and summed collectively. These operations are well-suited to parallel processing and therefore will be carried out in a short time by GPUs.
What’s Subsequent for GPUs?
The number-crunching prowess of GPUs is steadily growing because of the rise within the variety of cores and their working speeds. These enhancements are primarily pushed by enhancements in chip manufacturing by corporations resembling TSMC in Taiwan.
The dimensions of particular person transistors—the fundamental parts of any pc chip—is reducing, permitting extra transistors to be positioned in the identical quantity of bodily area.
Nonetheless, that isn’t the complete story. Whereas conventional GPUs are helpful for AI-related computation duties, they don’t seem to be optimum.
Simply as GPUs had been initially designed to speed up computer systems by offering specialised processing for graphics, there are accelerators which might be designed to hurry up machine studying duties. These accelerators are sometimes called knowledge middle GPUs.
A few of the hottest accelerators, made by corporations resembling AMD and Nvidia, began out as conventional GPUs. Over time, their designs developed to higher deal with numerous machine studying duties, for instance by supporting the extra environment friendly “mind float” quantity format.
Different accelerators, resembling Google’s tensor processing items and Tenstorrent’s Tensix cores, had been designed from the bottom on top of things up deep neural networks.
Information middle GPUs and different AI accelerators usually include considerably extra reminiscence than conventional GPU add-on playing cards, which is essential for coaching giant AI fashions. The bigger the AI mannequin, the extra succesful and correct it’s.
To additional velocity up coaching and deal with even bigger AI fashions, resembling ChatGPT, many knowledge middle GPUs will be pooled collectively to kind a supercomputer. This requires extra advanced software program to correctly harness the out there quantity crunching energy. One other method is to create a single very giant accelerator, such because the “wafer-scale processor” produced by Cerebras.
Are Specialised Chips the Future?
CPUs haven’t been standing nonetheless both. Latest CPUs from AMD and Intel have built-in low-level directions that velocity up the number-crunching required by deep neural networks. This extra performance primarily helps with “inference” duties—that’s, utilizing AI fashions which have already been developed elsewhere.
To coach the AI fashions within the first place, giant GPU-like accelerators are nonetheless wanted.
It’s potential to create ever extra specialised accelerators for particular machine studying algorithms. Just lately, for instance, an organization known as Groq has produced a “language processing unit” (LPU) particularly designed for operating giant language fashions alongside the traces of ChatGPT.
Nonetheless, creating these specialised processors takes appreciable engineering sources. Historical past exhibits the utilization and recognition of any given machine studying algorithm tends to peak after which wane—so costly specialised {hardware} could change into rapidly outdated.
For the common client, nonetheless, that’s unlikely to be an issue. The GPUs and different chips within the merchandise you utilize are prone to preserve quietly getting quicker.
This text is republished from The Dialog underneath a Inventive Commons license. Learn the unique article.
Picture Credit score: Nvidia