NVLink: the fast link between GPUs : Learn

NVLink is NVIDIA's high-speed interconnect between graphics processors (GPUs), letting cards exchange data far faster than the standard system bus. It matters when a model is split across several GPUs. A DGX Spark is a single chip with one shared memory pool, so there is no NVLink inside it.

What does NVLink actually do?

NVLink is a direct, high-speed connection between NVIDIA graphics processors (GPUs). The normal way two cards in one machine talk is over the system bus, which is comparatively slow. NVLink is a dedicated bridge that lets them exchange data far faster. That only matters when you have a reason to make two cards work as one: usually because a model is too big for a single card, so you split it across two and the pieces have to keep talking to each other every token.

Why does it barely apply to a DGX Spark?

A DGX Spark is a single chip. The GPU and the processor share one pool of memory, so there is nothing to bridge: no second card, no card-to-card link, no NVLink. The whole reason NVLink exists is to paper over the gap between separate cards, and a Spark has no gap to paper over.

So if you read a multi-GPU guide that leans on NVLink and try to map it onto a Spark, the advice will not land. The concept you want there is not NVLink but the shared unified pool. NVLink becomes relevant the moment you step up to a box with two or more discrete cards, and not a moment before.

NVLink: the fast link between GPUs

At a glance

Where NVLink does and does not apply

What does NVLink actually do?

Why does it barely apply to a DGX Spark?

NVLink helps with

NVLink will not

Related terms

At a glance

Where NVLink does and does not apply

What does NVLink actually do?

Why does it barely apply to a DGX Spark?

NVLink helps with

NVLink will not

Related terms

Go deeper