The NVIDIA A40 GPU is an evolutionary leap in performance and multi-workload capabilities from the data center, combining best-in-class professional graphics with powerful compute and AI acceleration to meet today’s design, creative, and scientific challenges. Driving the next generation of virtual workstations and server-based workloads, NVIDIA A40 brings state-of-the-art features for ray-traced rendering, simulation, virtual production, and more to professionals anytime, anywhere.

View NVIDIA A40 Data Sheet

Powered by the NVIDIA Ampere Architecture

NVIDIA Ampere Architecture CUDA® Cores

Double-speed processing for single-precision floating point (FP32) operations and improved power efficiency provide significant performance improvements for graphics and simulation workflows, such as complex 3D computer-aided design (CAD) and computer-aided engineering (CAE).

Second-Generation RT Cores

With up to 2X the throughput over the previous generation and the ability to concurrently run ray tracing with either shading or denoising capabilities, second-generation RT Cores deliver massive speedups for workloads like photorealistic rendering of movie content, architectural design evaluations, and virtual prototyping of product designs. This technology also speeds up the rendering of ray-traced motion blur for faster results with greater visual accuracy.

Third-Generation Tensor Cores

New Tensor Float 32 (TF32) precision provides up to 5X the training throughput over the previous generation to accelerate AI and data science model training without requiring any code changes. Hardware support for structural sparsity doubles the throughput for inferencing. Tensor Cores also bring AI to graphics with capabilities like DLSS, AI denoising, and enhanced editing for select applications.

48GB of GPU Memory

Ultra-fast GDDR6 memory, scalable up to 96GB with NVLink, gives data scientists, engineers, and creative professionals the large memory necessary to work with massive datasets and workloads like data science and simulation.

Third-Generation NVIDIA NVLink®

Connect two A40 GPUs together to scale from 48GB of GPU memory to 96GB. Increased GPU-to-GPU interconnect bandwidth provides a single scalable memory to accelerate graphics and compute workloads and tackle larger datasets. A new, more compact NVLink connector enables functionality in a wider range of servers.

Virtualization-Ready

Next-generation improvements with NVIDIA virtual GPU (vGPU) software allow for larger, more powerful virtual workstation instances for remote users, enabling high-end remote design, AI, and compute workloads.

PCI Express Gen 4

PCI Express Gen 4 doubles the bandwidth of PCIe Gen 3, improving data-transfer speeds from CPU memory for data-intensive tasks like AI, data science, and 3D design. Faster PCIe performance also accelerates GPU direct memory access (DMA) transfers, providing faster I/O communication of video data between the GPU and GPUDirect® for Video-enabled devices delivering a powerful solution for live broadcast. A40 is backwards compatible with PCI Express Gen 3 for deployment flexibility.

Data Center Efficiency and Security

Featuring a dual-slot, power efficient design, NVIDIA A40 is up to 2X as power efficient as the previous generation that is validated with a wide range of NVIDIA-Certified systems from worldwide OEMs. The NVIDIA A40 also provides secure and measured boot with hardware root of trust capability, ensuring that firmware has not been tampered with or corrupted.

Key applications

Virtual Desktop Infrastructure (VDI)

The NVIDIA A40 GPU represents a significant advancement in Virtual Desktop Infrastructure (VDI), offering robust capabilities tailored to the demands of modern remote work environments. Built on NVIDIA’s Ampere architecture, the A40 integrates a large number of CUDA cores and advanced graphical features to deliver exceptional performance and visual fidelity for virtual desktops. It supports multiple concurrent users, easily accessing high-resolution graphics and compute-intensive applications, making it an ideal solution for industries such as finance, healthcare, and design, where seamless remote collaboration and secure data processing are crucial.
The efficient encoding and decoding capabilities of the A40 ensure smooth streaming and user responsiveness, enhancing the productivity and flexibility of remote workers. Integration with NVIDIA virtualization technologies, including NVIDIA GRID and VMware vSphere, enables effective resource allocation and management, optimizing IT infrastructure and reducing operational costs. Overall, the NVIDIA A40 GPU sets a new standard for VDI solutions, allowing organizations to deliver high-performance virtual desktops that can compete with traditional workstation setups while maintaining security and scalability in distributed work environments.

High-Performance Computing (HPC)

The NVIDIA A40 GPU is a powerful solution designed to elevate high-performance computing (HPC) environments to new levels of performance and efficiency. Built on NVIDIA’s Ampere architecture, the A40 features a significant number of CUDA cores and Tensor cores optimized for handling complex scientific simulations, computational fluid dynamics, molecular modeling, and other data-intensive HPC workloads with unmatched speed and accuracy.
High memory bandwidth and support for NVLink technology enable seamless data communication and efficient scaling across multiple GPUs, allowing researchers and scientists to effectively tackle larger and more complex computational problems. The A40’s ability to perform mixed-precision calculations enhances computational performance without sacrificing accuracy, making it an ideal choice for accelerating AI and machine learning tasks within HPC workflows.
Integrated with NVIDIA’s parallel computing platform and CUDA libraries, the A40 simplifies the development and deployment of optimized software solutions, enabling organizations to make breakthroughs in scientific research, engineering simulations, and beyond. In summary, the NVIDIA A40 GPU sets a new standard for performance and scalability in HPC, providing the computational power necessary to drive innovation and discoveries across various fields of science and industry.

Deep Learning and AI (Artificial Intelligence)

The NVIDIA A40 GPU excels in accelerating deep learning and artificial intelligence applications, leveraging the cutting-edge Ampere architecture to deliver unparalleled performance and efficiency. Specifically designed for AI workloads, the A40 boasts a high density of CUDA cores and Tensor cores, enabling the processing of vast amounts of data and complex neural network models with exceptional speed and precision.
This makes it ideal for training large-scale AI models, performing inference tasks, and conducting groundbreaking research in areas such as natural language processing, computer vision, and autonomous systems. The A40’s support for mixed-precision computations further optimizes performance, balancing calculation accuracy with efficiency to expedite the training and inference processes.
Integrated with NVIDIA’s comprehensive software ecosystem, including CUDA, cuDNN, and TensorRT, the A40 streamlines the development and deployment of AI applications, reducing time to insight and enhancing the productivity of researchers and data analysts. Overall, the NVIDIA A40 GPU sets a new standard for deep learning and artificial intelligence acceleration, enabling organizations to unlock new opportunities for innovation and discovery.

High-End Rendering

The NVIDIA A40 graphics processor is a powerful solution for advanced rendering applications, leveraging the cutting-edge Ampere architecture to deliver exceptional performance and efficiency. Designed specifically for demanding rendering workloads in industries such as media and entertainment, architecture, and automotive design, the A40 features a robust array of CUDA Cores and Tensor Cores. This configuration enables the handling of complex 3D rendering tasks, real-time ray tracing, and AI-assisted graphics with remarkable speed and precision.
The A40 supports NVIDIA’s RTX technology, allowing for photorealistic rendering and real-time simulation of lighting, shadows, and reflections, streamlining creative workflows and reducing time-to-market for digital content creators and designers. Its high memory bandwidth ensures smooth handling of large datasets and intricate visual details, while compatibility with professional NVIDIA software tools, such as RTX Renderer and Omniverse, simplifies integration with existing pipelines.
Overall, the NVIDIA A40 graphics processor redefines high-end rendering capabilities, offering unmatched performance and fidelity, empowering professionals to create stunning visual experiences and push the boundaries of digital content creation.

Specifications

GPU Memory48 GB GDDR6 with error-correcting code (ECC)
GPU Memory Bandwidth696 GB/s
InterconnectącznośćNVIDIA NVLink 112.5 GB/s (bidirectional)
PCIe Gen4: 64GB/s
NVLink2-way low profile (2-slot)
Display Ports3x DisplayPort 1.4*
Max Power Consumption300 W
Form Factor4.4″ (H) x 10.5″ (L) Dual Slot
ThermalPassive
vGPU Software SupportNVIDIA Virtual PC, NVIDIA Virtual Applications, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server, NVIDIA AI Enterprise
vGPU Profiles SupportedSee the Virtual GPU Licensing Guide
NVENC | NVDEC1x | 2x (includes AV1 decode)
Secure and Measured Boot with Hardware Root of TrustYes (optional)ak (opcjonalnie)
NEBS ReadyLevel 3
Power Connector8-pin CPU

* A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools.

Dmensions of the NVIDIA A40 GPU:

Professional Features

NVIDIA A40 combines the performance and features necessary for large-scale display experiences, VR, broadcast-grade streaming, and more.

Multi-Display Technology

Drive massive cave automatic virtual environments (CAVEs), video walls, virtual sets and broadcast, and location-based entertainment deployments with support for multiple 8K monitors, NVIDIA Mosaic multi-display technology with bezel correction, and NVIDIA’s Warp and Blend SDK.

Quadro Sync

Synchronize multiple NVIDIA A40 GPUs with displays or projectors to create large-scale visualizations with NVIDIA Quadro Sync technology.

Video Encode and Decode

With dedicated video encoder (NVENC) and decoder engines (NVDEC), access the performance needed to work with multiple streams simultaneously, export video faster, and use multi-stream video applications for broadcast, security, and video serving.

Immersive VR

Power the most immersive augmented reality (AR) and virtual reality (VR) experiences on the highest-resolution head-mounted displays (HMDs) with accelerated graphics and increased display bandwidth. Four-way VR SLI enables peak performance, assigning 2 NVLink connected GPUs to each eye.

Enterprise Drivers

Virtual workstations powered by Quadro Virtual Data Center Workstation (Quadro vDWS) software leverage the same Quadro platform as physical workstations, benefiting from extensive testing across a broad range of industry applications and certifications from over 100 independent software vendors (ISVs) to ensure optimal performance and stability.

EGX Platform for Professional Visualization

The NVIDIA A40, along with NVIDIA vGPU software, is at the heart of the next-generation NVIDIA EGX™ platform certified by NVIDIA for professional visualization, delivering the performance and features that can power professional graphics and computing anywhere.

LEARN MORE

The NVIDIA A40 is Available Now

Contact the Sales Department