Add to Compare

NVIDIA Quadro RTX4000

Turing GPU
2,304 NVIDIA^® CUDA^® Cores
288 NVIDIA^® Tensor Cores
36 NVIDIA^® RT Cores
8GB GDDR6 Memory
Up to 416GB/s Memory Bandwidth
43T RTX-OPS
6.0 Giga Rays/s Rays Cast
7.1 TFLOPS FP32 Performance
14.2 TFLOPS FP16 Performance
28.5 TOPS INT8 Performance
57.0 TFLOPS of Tensor Operation
Max. Power Consumption: 160W
3x DisplayPort 1.4
1x VirtualLink

The Most Advanced Single-slot Professional Graphics Solution

Quadro RTX 4000 combines the NVIDIA Turing GPU architecture with the latest memory and display technologies, to deliver the best performance and features in a single-slot PCI-e form factor. Enjoy greater fluidity with photorealistic rendering, experience faster performance with AI-enabled applications and create detailed, lifelike VR experiences more cost-effectively and across a broader range of workstation chassis configurations.

Performance Features

Turing GPU Architecture

Based on state-of-the-art 12nm FFN (FinFET NVIDIA) high-performance manufacturing process customized for NVIDIA to incorporate 2304 CUDA cores, the Quadro RTX 4000 GPU is the most powerful computing platform for HPC, AI, VR and graphics workloads on professional desktops in a single slot form factor. The Turing GPU architecture enables the biggest leap in computer real-time graphics rendering since NVIDIA’s invention of programmable shaders in 2001. It includes 13.6 billion transistors on die size of 545 mm2. Able to deliver more than 7.1TFLOPS of single-precision (FP32), 14.2 TFLOPS of half-precision (FP16), 28.5 TOPS of integer-precision (INT8), and 57.0 TFLOPs of tensor operation capability, it supports a wide range of compute-intensive workloads flawlessly.

RT Cores

New dedicated hardware-based ray-tracing technology allows the GPU for the first time to real-time render film quality, photorealistic objects and environments with physically accurate shadows, reflections, and refractions. The real-time ray-tracing engine works with NVIDIA OptiX, Microsoft DXR, and Vulkan APIs to deliver a level of realism far beyond what is possible using traditional rendering techniques. RT cores accelerate the Bounding Volume Hierarchy (BVH) traversal and ray casting functions using low number of rays casted through a pixel.

Enhanced Tensor Cores

New mixed-precision cores purpose-built for deep learning matrix arithmetic, delivering 8x TFLOPS for training, compared to previous generation. Quadro RTX 4000 utilizes 288 Tensor Cores; each Tensor Core performs 64 floating point fused multiply-add (FMA) operations per clock, and each SM performs a total of 1024 individual floating point operations per clock. In addition to supporting FP16/FP32 matrix operations, new Tensor Cores added INT8 (2048 integer operations per clock) and experimental INT4 and INT1 (binary) precision modes for matrix operations.

Advanced Shading Technologies

Mesh Shading: Compute-based geometry pipeline to speed geometry processing and culling on geometrically complex models and scenes. Mesh shading provides up to 2x performance improvement on geometry-bound workloads. Variable Rate Shading (VRS): Gain rendering efficiency by varying the shading rate based on scene content, direction of gaze, and motion. Variable rate shading provides similar image quality with 50% reduction in shaded pixels. Texture Space Shading: Object/texture space shading to improve the performance of pixel shader-heavy workloads such as depth-of-field and motion blur. Texture space shading provides greater throughput with increased fidelity by reusing pre-shaded texels for pixel-shader heavy VR workloads.

High Performance GDDR6 Memory

Built with Turing’s vastly optimized 8GB GDDR6 memory subsystem for the industry’s fastest graphics memory (416 GB/s peak bandwidth), Quadro RTX 4000 is the ideal platform for latency-sensitive applications handling large datasets. Quadro RTX 4000 delivers up to greater than 70% more memory bandwidth compared to previous generation.

Single Instruction, Multiple Thread (SIMT)

New independent thread scheduling capability enables finer-grain synchronization and cooperation between parallel threads by sharing resources among small jobs.

Advanced Streaming Multiprocessor (SM) Architecture

Combined shared memory and L1 cache improve performance significantly, while simplifying programming and reducing the tuning required to attain best application performance. Each SM contains 96 KB of L1/shared memory, which can be configured for various capacities depending on compute or graphics workload. For compute cases, up to 64 KB can be allocated to the L1 cache or shared memory, while graphics workload can allocate up to 48 KB for shared memory; 32 KB for L1 and 16 KB for texture units. Combining the L1 data cache with the shared memory reduces latency and provides higher bandwidth.

Mixed-Precision Computing

Double the throughput and reduce storage requirements with 16-bit floating point precision computing to enable the training and deployment of larger neural networks. With independent parallel integer and floating-point data paths, the Turing SM is also much more efficient on workloads with a mix of computation and addressing calculations.

Graphics Preemption

Pixel-level preemption provides more granular control to better support time-sensitive tasks such as VR motion tracking.

Compute Preemption

Preemption at the instruction-level provides finer grain control over compute tasks to prevent long-running applications from either monopolizing system resources or timing out.

H.264 and HEVC Encode/Decode Engines

Deliver faster than real-time performance for transcoding, video editing, and other encoding applications with two dedicated H.264 and HEVC encode engines and a dedicated decode engine that are independent of 3D/compute pipeline.

NVIDIA GPU BOOST 4.0

Automatically maximize application performance without exceeding the power and thermal envelope of the card. Allows applications to stay within the boost clock state longer under higher temperature threshold before dropping to a secondary temperature setting base clock.

Image Quality

Full-Scene Antialiasing (FSAA)

Dramatically reduce visual aliasing artifacts or "jaggies" with up to 64X FSAA (128X with SLI) for unparalleled image quality and highly realistic scenes.

32K Texture and Render Processing

Texture from and render to 32K x 32K surfaces to support applications that demand the highest resolution and quality image processing./p>

Display Features

VirtualLink™

New open industry standard connectivity for next-generation VR headsets delivering four high-speed HBR3 DisplayPort lanes, USB3.1 data channel and up to 27 watts of power. The alternate mode of USB-C is optimized for latency and bandwidth demands to deliver increased display resolution and incorporate high-bandwidth cameras for tracking and augmented reality with VR headset.

Multi-View

Ability to render four separate views in a single pass to dramatically reduce the graphics pipeline workload and improving realism. With 2X the projection centers from previous generation, the Simultaneous Multi-Projection (SMP) engine performs up to 2X the geometry rendering workload. This allows for more flexibility with position-independent views to produce more creative scenes.

DisplayPort 1.4

Support up to four 5K monitors @ 60Hz, or dual 8K displays per card. Quadro RTX 4000 supports HDR color for 4K @ 120Hz for 10/12b HEVC decode and up to 4K @ 60Hz for 10b HEVC encode. Each DisplayPort connector is capable of driving ultra-high resolutions of 4096x2160 @ 120 Hz with 30-bit color.

NVIDIA^® Mosaic™ Technology

Transparently scale the desktop and applications across up to 4 GPUs and 16 displays from a single workstation while delivering full performance and image quality.

NVIDIA^® Quadro Sync II

Synchronize the display and image output of up to 32 displays from 8 GPUs (connected through two Sync II boards) in a single system, reducing the number of machines needed to create an advanced video visualization environment.

NVIDIA^® nView^® Desktop Software

Gain unprecedented end-user control of the desktop experience for increased productivity in single large display or multi-display environments.

Frame Lock Connector Latch

Each frame lock connector is designed with a self-locking retention mechanism to secure its connection with the frame lock cable to provide robust connectivity and maximum productivity.

OpenGL Quad Buffered Stereo Support

Provide a smooth and immersive 3D Stereo experience for professional applications.

Ultra High Resolution Desktop Support

Get more Mosaic topology choices with high resolution displays devices with a 32K Max desktop size.

Professional 3D Stereo Synchronization

Robust control of stereo effects through a dedicated connection to directly synchronize 3D stereo hardware to a Quadro graphics card.

Software Support

Turing Optimized Software

Deep learning frameworks such as Caffe2, MXNet, CNTK, TensorFlow, and others deliver dramatically faster training times and higher multi-node training performance. GPU accelerated libraries such as cuDNN, cuBLAS, and TensorRT delivers higher performance for both deep learning inference and High-Performance Computing (HPC) applications.

NVIDIA^® CUDA^® Parallel Computing Platform

Natively execute standard programming languages like C/C++ and Fortran, and APIs such as OpenCL, OpenACC and Direct Compute to accelerates techniques such as ray tracing, video and image processing, and computation fluid dynamics.

Unified Memory

A single, seamless 49-bit virtual address space allows for the transparent migration of data between the full allocation of CPU and GPU memory.

NVIDIA^® GPUDirect for Video

GPUDirect for Video speeds communication between the GPU and video I/O devices by avoiding unnecessary system memory copies and CPU overhead.

NVIDIA Enterprise-Management Tools

Maximize system uptime, seamlessly manage wide-scale deployments and remotely control graphics and display settings for efficient operations.

NVIDIA Packaging and Accessories

NVIDIA Quadro RTX4000
Quadro RTX Quick Start Guide
Quadro Support Guide
1 DisplayPort to DVI Adapter
1 DisplayPort to HDMI Adapter
1 USB-C to DisplayPort Adapter

CUDA Parallel Processing cores	2304
NVIDIA Tensor Cores	288
NVIDIA RT Cores	36
Frame Buffer Memory	8 GB GDDR6
RTX-OPS	43T
Rays Cast	8 Giga Rays/Sec
Peak Single Precision (FP32) Performance	7.1 TFLOPS
Peak Half Precision (FP16) Performance	14.2 TFLOPS
Peak Integer Operation (INT8) Performance	28.5 TOPS
Deep Learning TeraFLOPS¹	57.0 TFLOPS
Memory Interface	256-bit
Memory Bandwidth	Up to 416 GB/s
Max Power Consumption	160 W
Graphics Bus	PCI Express 3.0 x 16
Display Connectors	DP 1.4 (3) + VirtualLink (1)
Form Factor	4.4” H x 9.5” L
Product Weight	479 g
Thermal Solution	Active
NVIDIA^® 3D Vision^®and 3D Vision Pro	Support via 3 pin mini DIN
Frame Lock	Compatible (with Quadro Sync II)
NVLink Interconnect	N/A
Power Connector	8-pin PCIe

1 FP16 matrix multiply with FP16 or FP32 accumulate

Supported Platforms

Microsoft Windows 10 (64-bit)
Microsoft Windows 8 and 8.1 (64-bit)
Microsoft Windows 7 (64-bit)
Linux^® - Full OpenGL implementation, complete with NVIDIA and ARB extensions (64-bit)

3D Graphics Architecture

Scalable geometry architecture
Hardware tessellation engine
NVIDIA^® GigaThread™ engine with 3 async copy engines
Shader Model 5.1 (OpenGL 4.5 and DirectX 12)
Up to 32K x 32K texture and render processing
Transparent multisampling and super sampling
16x angle independent anisotropic filtering
32-bit per-component floating point texture filtering and blending
64x full scene antialiasing (FSAA)/128x FSAA in SLI Mode
Decode acceleration for MPEG-2, MPEG-4 Part 2 Advanced Simple Profile, H.264, HEVC, MVC, VC1, DivX (version 3.11 and
later), and Flash (10.1 and later)
Dedicated H.264 & HEVC Encoder²
Blu-ray dual-stream hardware acceleration (supporting HD picture-in-picture playback)
NVIDIA GPU Boost (Automatically improves GPU engine throughput to maximize application performance)

2 This feature requires implementation by software applications and is not a stand-alone utility. Please contact quadrohelp@nvidia.com for details on availability.

NVIDIA CUDA Parallel Processing Architecture

New RT (Ray Tracing) Core per SM
Turing SM Architecture (streaming multi-processor design that delivers greater processing efficiency)
Dynamic Parallelism (GPU dynamically spawns new threads without going back to the CPU)
Mixed-precision (1-, 4-, 8-, 16-, 32- and 64-bit) computing
API support includes:- CUDA C, CUDA C++, DirectCompute 5.0, OpenCL, Java, Python, and Fortran
Configurable up to 96 KB of RAM (dedicated shared memory size per SM)

Advanced Display Features

Support for any combination of four connected displays
Four DisplayPort 1.4 outputs (supporting resolutions such as 3840 x 2160 @ 120 Hz, 5120x2880 @ 60Hz and 7680 x 4320 @ 60Hz)
DisplayPort to VGA, DisplayPort to DVI (single-link and dual-link) and DisplayPort to HDMI cables (resolution support based on dongle specifications)
HDR support over DisplayPort 1.4 (SMPTE 2084/2086, BT. 2020) (4K @ 60Hz 10b/12b HEVC Decode, 4K @ 60Hz 10b HEVC Encode)
HDCP 2.2 support over DisplayPort & HDMI connectors
12-bit internal display pipeline (hardware support for 12-bit scanout on supported panels, applications and connection)
NVIDIA^® 3D Vision™ technology, 3D DLP, Interleaved, and other 3D stereo format support
Full OpenGL quad buffered stereo support
Underscan/overscan compensation and hardware scaling
NVIDIA^® nView^® multi-display technology
Support for large-scale, ultra-high resolution visualization using the NVIDIA^® SVS platform which includes NVIDIA^® Mosaic, NVIDIA^® Sync and NVIDIA^® Warp/Blend technologies

DisplayPort and HDMI Digital Audio

Support for the following audio modes: - Dolby Digital (AC3), DTS 5.1, Multi-channel (7.1) LPCM, Dolby Digital Plus (DD+), and MPEG-2/MPEG-4 AAC
DisplayPort Data rates of 48 KHz
Word sizes of 16-bit, 20-bit, and 24-bit
HDMI Digital Audio Data rates of 44.1 KHz, 48 KHz, 88.2 KHz, 96 KHz, 176 KHz, and 192 KHz

Find the latest drivers for your NVIDIA product?

NVIDIA Quadro RTX4000
Language	Version	Description
Quick Guide
(English)	Null ( 2017/3/20 )	Quadro Quick Installation Guide Total size: [ 997 KB ]
(Multilanguage)	V01 ( 2018/4/1 )	Supporting Models :P400, P600, P620, P1000, P2000, P4000, P5000, P6000, GP100, K420, K620, K1200, K2200, M4000, M5000 Quick start guide for Quadro series Total size: [ 2348 KB ]
(简体中文)	Null ( 2016/6/13 )	Quadro快速入門指南 Total size: [ 1192 KB ]
Tesla Data Sheet
(English)	Null ( 2016/10/20 )	DGX-1 Total size: [ 1356 KB ]
(English)	Null ( 2016/10/20 )	Tesla P100 Total size: [ 947 KB ]
(English)	Null ( 2016/10/20 )	Tesla P40 Total size: [ 4317 KB ]
(English)	Null ( 2016/10/20 )	Tesla P4 Total size: [ 4814 KB ]
(English)	Null ( 2016/10/20 )	Tesla M40 24GB Total size: [ 5396 KB ]
(繁體中文)	Null ( 2016/10/20 )	Tesla P100規格書 Total size: [ 1854 KB ]
(繁體中文)	Null ( 2016/10/20 )	DGX-1深度學習系統規格書 Total size: [ 974 KB ]
DM
(English)	Null ( 2016/6/13 )	Quadro Full Series DM Total size: [ 3167 KB ]
(繁體中文)	Null ( 2016/6/13 )	Quadro全系列中文型錄 Total size: [ 18612 KB ]
NVS Data Sheet
(English)	Null ( 2015/11/24 )	NVS810 Total size: [ 1298 KB ]
(English)	Null ( 2015/11/24 )	NVS510 Total size: [ 1886 KB ]
(English)	Null ( 2015/11/24 )	NVS315 Total size: [ 1149 KB ]
(English)	Null ( 2015/11/24 )	NVS310 Total size: [ 1188 KB ]
(繁體中文)	Null ( 2015/11/24 )	NVS810規格書 Total size: [ 1008 KB ]
(繁體中文)	Null ( 2015/11/24 )	NVS510規格書 Total size: [ 1366 KB ]
(繁體中文)	Null ( 2015/11/24 )	NVS315規格書 Total size: [ 1359 KB ]
(繁體中文)	Null ( 2015/11/24 )	NVS310規格書 Total size: [ 1420 KB ]
Tegra Data Sheet
(繁體中文)	Null ( 2016/6/13 )	Jetson TX1開發套件規格書 Total size: [ 9592 KB ]
(繁體中文)	Null ( 2016/6/13 )	Jetson TK1開發套件規格書 Total size: [ 9191 KB ]
Quadro Data Sheet
(English)	Null ( 2017/3/20 )	Quadro GP100 Total size: [ 1896 KB ]
(English)	Null ( 2016/9/14 )	Quadro P6000 Total size: [ 376 KB ]
(English)	Null ( 2016/9/14 )	Quadro P5000 Total size: [ 374 KB ]
(English)	Null ( 2017/3/20 )	Quadro P4000 Total size: [ 1545 KB ]
(English)	Null ( 2017/3/20 )	Quadro P2000 Total size: [ 1441 KB ]
(English)	Null ( 2017/3/20 )	Quadro P1000 Total size: [ 574 KB ]
(English)	Null ( 2017/3/20 )	Quadro P600 Total size: [ 603 KB ]
(English)	Null ( 2017/3/20 )	Quadro P400 Total size: [ 1454 KB ]
(English)	Null ( 2016/6/13 )	Quadro M6000 24GB Total size: [ 687 KB ]
(English)	Null ( 2015/11/24 )	Quadro M5000 Total size: [ 692 KB ]
(English)	Null ( 2015/11/24 )	Quadro M4000 Total size: [ 684 KB ]
(English)	Null ( 2016/6/13 )	Quadro M2000 Total size: [ 578 KB ]
(English)	Null ( 2015/11/24 )	Quadro K2200 Total size: [ 589 KB ]
(English)	Null ( 2015/11/24 )	Quadro K1200 Total size: [ 577 KB ]
(English)	Null ( 2015/11/24 )	Quadro K620 Total size: [ 595 KB ]
(English)	Null ( 2016/10/21 )	Quadro K420 2GB Total size: [ 601 KB ]
(Eastern Language)	Null ( 2017/3/20 )	Quadro GP100 規格書 Total size: [ 1916 KB ]
(繁體中文)	Null ( 2016/9/14 )	Quadro P6000規格書 Total size: [ 503 KB ]
(繁體中文)	Null ( 2016/9/14 )	Quadro P5000規格書 Total size: [ 457 KB ]
(繁體中文)	Null ( 2017/3/20 )	Quadro P4000 規格書 Total size: [ 1636 KB ]
(繁體中文)	Null ( 2017/3/20 )	Quadro P2000 規格書 Total size: [ 1481 KB ]
(繁體中文)	Null ( 2017/3/20 )	Quadro P1000 規格書 Total size: [ 1443 KB ]
(繁體中文)	Null ( 2017/3/20 )	Quadro P600 規格書 Total size: [ 1504 KB ]
(繁體中文)	Null ( 2017/3/20 )	Quadro P400規格書 Total size: [ 1504 KB ]
(繁體中文)	Null ( 2016/6/13 )	Quadro M6000 24GB規格書 Total size: [ 3151 KB ]
(繁體中文)	Null ( 2015/11/24 )	Quadro M5000規格書 Total size: [ 678 KB ]
(繁體中文)	Null ( 2015/11/24 )	Quadro M4000規格書 Total size: [ 639 KB ]
(繁體中文)	Null ( 2016/6/13 )	Quadro M2000規格書 Total size: [ 2749 KB ]
(繁體中文)	Null ( 2016/6/27 )	Quadro K2200規格書 Total size: [ 2741 KB ]
(繁體中文)	Null ( 2015/11/24 )	Quadro K1200規格書 Total size: [ 8284 KB ]
(繁體中文)	Null ( 2016/6/27 )	Quadro K620規格書 Total size: [ 2867 KB ]
(Thai)	Null ( 2016/10/20 )	Quadro P6000 Total size: [ 2905 KB ]
(Thai)	Null ( 2016/10/20 )	Quadro P5000 Total size: [ 5806 KB ]
(Vietnamese)	Null ( 2016/10/20 )	Quadro P6000 Total size: [ 3424 KB ]
(Vietnamese)	Null ( 2016/10/20 )	Quadro P5000 Total size: [ 3550 KB ]

Revised web page of product spec and information won't be noticed, product colorbox printing shows the actual information of the product.
Above product spec is for reference only, actual spec rely on the real product and Leadtek keeps the right to alter. Each sales region will impacts the product difference, please contact your supplier for making sure the actual product information.
The adapter, cable and software listed on the web page are for reference only and Leadtek keeps the right to alter, revised information won't be noticed.
Above brand name and product name are trademark of each corresponding company.
NVIDIA RTX PRO, NVIDIA RTX Workstation GPU and NVIDIA Data Center GPU are designed, built and tested by NVIDIA.

Ultra High-end NVIDIA RTX PRO / RTX Series

NVIDIA RTX PRO™ 6000 Blackwell Workstation Edition

Blackwell GPU / 24,064 CUDA Cores / 752 Tensor Cores / 188 RT Cores / 96GB DDR7 Memory with ECC / 600W

High End NVIDIA RTX PRO / RTX Series

NVIDIA RTX PRO™ 5000 Blackwell | RTX PRO™ 5000 72GB Blackwell

Blackwell GPU / 14,080 CUDA Cores / 440 Tensor Cores / 110 RT Cores / 48GB | 72GB DDR7 Memory with ECC

Mid-range NVIDIA RTX PRO / RTX Series

NVIDIA RTX PRO™ 2000 Blackwell

Blackwell GPU / 4,352 CUDA Cores / 136 Tensor Cores/34 RT Cores / 16GB DDR7 Memory with ECC

Entry-level NVIDIA RTX Series

NVIDIA RTX A400

Ampere GPU / 768 CUDA Cores / 24 Tensor Cores / RT Cores / 4GB DDR6 Memory

NVIDIA Long-Life Product