Author
Updated
15 Sep 2024Form Number
LP1773PDF size
19 pages, 602 KBAbstract
The NVIDIA A40 is a powerful data center GPU for visual computing, delivering high performance and capabilities to professionals for graphics-based workloads such as ray traced rendering, high-performance virtual workstations, simulation, 3D design, VR, and virtual production. The A40 GPU is a graphics-based virtualization solution for designers, engineers, scientists, and creatives that need this performance from anywhere in the world.
This product guide provides essential presales information to understand the NVIDIA A40 GPU and its key features, specifications, and compatibility. This guide is intended for technical specialists, sales specialists, sales engineers, IT architects, and other IT professionals who want to learn more about the NVIDIA A40 GPU and consider its use in IT solutions.
Change History
Changes in the September 15, 2023 update:
- Added the Controlled status column to Table 1 - Part number information section
Introduction
The NVIDIA A40 is a powerful data center GPU for visual computing, delivering high performance and capabilities to professionals for graphics-based workloads such as ray traced rendering, high-performance virtual workstations, simulation, 3D design, VR, and virtual production. The A40 GPU is a graphics-based virtualization solution for designers, engineers, scientists, and creatives that need this performance from anywhere in the world.
The third-generation Tensor Core technology supports a broad range of math precisions providing a unified workload accelerator for data analytics, AI training, AI inference, and HPC. Accelerating both scale-up and scale-out workloads on one platform enables elastic data centers that can dynamically adjust to shifting application workload demands. This simultaneously boosts throughput and drives down the cost of data centers.
Did you know?
The NVIDIA A40 GPU is a high-performance GPU that offers local DisplayPort video. It supports up to four 5K monitors at 60Hz, or dual 8K displays at 60Hz per card, using Display Stream Compression (DSC). The NVIDIA A40 supports HDR color for 4K at 60Hz for 10/12b HEVC decode and up to 4K at 60Hz for 10b HEVC encode. Each DisplayPort connector can drive ultra-high resolutions of 4096x2160 at 120 Hz with 30-bit color.
Part number information
The following table shows the part numbers for the NVIDIA A40 GPU.
The NVIDIA A40 GPU is Controlled which means the GPU is not offered in certain markets, as determined by the US Government.
The PCIe option part numbers includes the following:
- One NVIDIA A40 GPU with full-height (3U) adapter bracket attached
- Documentation
GPUs without a CEC chip: The NVIDIA A40 GPU is offered without a CEC chip (look for "w/o CEC" in the name). The CEC is a secondary Hardware Root of Trust (RoT) module that provides an additional layer of security, which can be used by customers who have high regulatory requirements or high security standards. NVIDIA uses a multi-layered security model and hence the protection offered by the primary Root of Trust embedded in the GPU is expected to be sufficient for most customers. The CEC defeatured products still offer Secure Boot, Secure Firmware Update, Firmware Rollback Protection, and In-Band Firmware Update Disable. Specifically, without the CEC chip, the GPU does not support Key Revocation or Firmware Attestation. CEC and non-CEC GPUs of the same type of GPU can be mixed in field upgrades.
Features
The ThinkSystem NVIDIA A40 48GB PCIe Gen4 Passive GPU offers the following features:
- NVIDIA Ampere Architecture
NVIDIA A40 is the world's most powerful data center GPU for visual computing, offering high performance real-time ray tracing, AI-accelerated compute, and professional graphics rendering. Building upon the major enhancements from the Turing GPU, the NVIDIA Ampere architecture enhances ray tracing operations, tensor matrix operations, and concurrent executions of FP32 and INT32 operations.
- CUDA Cores
The NVIDIA Ampere architecture’s CUDA cores bring up to 2X the single-precision floating point (FP32) throughput compared to the previous generation, providing significant performance improvements for graphics workflows such as 3D model development and compute for workloads such as desktop simulation for computer-aided engineering (CAE).
- 2nd Generation RT Cores
Incorporating 2nd generation ray tracing engines, the NVIDIA Ampere architecture provides incredible ray traced rendering performance. A single NVIDIA A40 board can render complex professional models with physically accurate shadows, reflections, and refractions to empower users with instant insight.
Working in concert with applications leveraging APIs such as NVIDIA OptiX, Microsoft DXR and Vulkan ray tracing, servers based on NVIDIA A40 will power truly interactive design workflows to provide immediate feedback for unprecedented levels of productivity. NVIDIA A40 is up to 2x faster in ray tracing compared to the previous generation. This technology also speeds up the rendering of ray-traced motion blur for faster results with greater visual accuracy.
- 3rd Generation Tensor Cores
Purpose-built for deep learning matrix arithmetic at the heart of neural network training and inferencing functions, the NVIDIA A40 includes enhanced Tensor Cores that accelerate more datatypes (TF32 and BF16) and includes a new Fine-Grained Structured Sparsity feature that delivers up to 2X throughput for tensor matrix operations compared to the previous generation.
- High Speed GDDR6 Memory
Built with 48GB GDDR6 memory, the NVIDIA A40 provides the industry’s largest graphics memory footprint to address the largest datasets and models in latency-sensitive professional applications.
- Error Correcting Code (ECC) on Graphics Memory
Meet strict data integrity requirements for mission critical applications with uncompromised computing accuracy and reliability.
- 5th Generation NVDEC Engine
NVDEC, implemented in concert with software applications, is well suited for transcoding and video playback applications for real-time decoding. The following video codecs are supported for hardware-accelerated decoding: MPEG-2, VC-1, H.264 (AVCHD), H.265 (HEVC), VP8, VP9, and AV1. Pairing this technology with NVIDIA Ampere architecture- based Tensor Cores, the A40 can quickly apply AI and inferencing to real-time video.
- 7th Generation NVENC Engine
NVENC can take on the most demanding 4K or 8K video encoding tasks to free up the graphics engine and the CPU for other operations. NVENC also enables virtual workstations to stream up to 8K content for high fidelity design and rendering workloads. In addition, the NVIDIA A40 provides better encoding quality than software-based x264 encoders.
- Preemption
Preemption at the instruction-level provides finer grain control over compute and graphics tasks to prevent longer-running applications from either monopolizing system resources or timing out.
- Multi-GPU technology - 3rd Generation NVIDIA NVLink
Connect two NVIDIA A40 cards with NVLink to double the effective memory footprint and scale application performance by enabling GPU-to-GPU data transfers at rates up to 112.5 GB/s (total bandwidth).
- DisplayPort 1.4a
Supports up to four 5K monitors at 60Hz, or dual 8K displays at 60Hz per card. The NVIDIA A40 supports HDR color for 4K at 60Hz for 10/12b HEVC decode and up to 4K at 60Hz for 10b HEVC encode. Each DisplayPort connector can drive ultra-high resolutions of 4096x2160 at 120 Hz with 30-bit color.
A40 is configured for virtualization by default with physical display connectors disabled. The display outputs can be enabled via management software tools. See Lenovo support tip HT512536 for details: https://datacentersupport.lenovo.com/us/en/solution/ht512536
DisplayPort 1.4a supports 8K at 60Hz over a single cable using Display Stream Compression (DSC). The A40 has 3 DisplayPort 1.4a connectors, DSC capable 8K displays are required for the A40 to drive two 8K displays.
- NVIDIA Quadro® Mosaic Technology
Transparently scale the desktop and applications across up to 12 displays from 4 GPUs while delivering full performance and image quality.
- NVIDIA Quadro Sync II
Synchronize the display and image output of up to 24 displays from 8 GPUs (connected through two Sync II boards) in a single system, reducing the number of machines needed to create an advanced video visualization environment. For more information, see the NVIDIA Quadro Sync page, https://www.nvidia.com/en-us/design-visualization/solutions/quadro-sync/
- Frame Lock Connector Latch
Each frame lock connector is designed with a self-locking retention mechanism to secure its connection with the frame lock cable to provide robust connectivity and maximum productivity.
Technical specifications
The following table lists the NVIDIA A40 GPU specifications.
* With structural sparsity enabled
** To enable the DisplayPort ports, see https://datacentersupport.lenovo.com/us/en/solution/ht512536
Server support
The following tables list the ThinkSystem servers that are compatible.
- DisplayPort ports not supported and are disabled
- Double-wide GPUs are only supported in the SE450 with the 360mm chassis; not supported in the 300mm chassis
- DisplayPort ports not supported and are disabled
- DisplayPort ports not supported and are disabled.
- Only available via Lenovo Scalable Infrastructure (LeSI). Select "AI & HPC – LeSI Solutions" in the DCSC configurator. See the LeSI product guide for details.
Operating system support
The following table lists the supported operating systems:
Tip: These tables are automatically generated based on data from Lenovo ServerProven.
1 Ubuntu 22.04.3 LTS/Ubuntu 22.04.4 LTS
2 ISG will not sell/preload this OS, but compatibility and cert only.
3 The OS is not supported with EPYC 7003 processors.
NVIDIA GPU software
This section lists the NVIDIA software that is available from Lenovo.
NVIDIA vGPU Software (vApps, vPC, RTX vWS)
Lenovo offers the following virtualization software for NVIDIA GPUs:
- Virtual Applications (vApps)
For organizations deploying Citrix XenApp, VMware Horizon RDSH or other RDSH solutions. Designed to deliver PC Windows applications at full performance. NVIDIA Virtual Applications allows users to access any Windows application at full performance on any device, anywhere. This edition is suited for users who would like to virtualize applications using XenApp or other RDSH solutions. Windows Server hosted RDSH desktops are also supported by vApps.
- Virtual PC (vPC)
This product is ideal for users who want a virtual desktop but need great user experience leveraging PC Windows® applications, browsers and high-definition video. NVIDIA Virtual PC delivers a native experience to users in a virtual environment, allowing them to run all their PC applications at full performance.
- NVIDIA RTX Virtual Workstation (RTX vWS)
NVIDIA RTX vWS is the only virtual workstation that supports NVIDIA RTX technology, bringing advanced features like ray tracing, AI-denoising, and Deep Learning Super Sampling (DLSS) to a virtual environment. Supporting the latest generation of NVIDIA GPUs unlocks the best performance possible, so designers and engineers can create their best work faster. IT can virtualize any application from the data center with an experience that is indistinguishable from a physical workstation — enabling workstation performance from any device.
The following license types are offered:
- Perpetual license
A non-expiring, permanent software license that can be used on a perpetual basis without the need to renew. Each Lenovo part number includes a fixed number of years of Support, Upgrade and Maintenance (SUMS).
- Annual subscription
A software license that is active for a fixed period as defined by the terms of the subscription license, typically yearly. The subscription includes Support, Upgrade and Maintenance (SUMS) for the duration of the license term.
- Concurrent User (CCU)
A method of counting licenses based on active user VMs. If the VM is active and the NVIDIA vGPU software is running, then this counts as one CCU. A vGPU CCU is independent of the connection to the VM.
The following table lists the ordering part numbers and feature codes.
NVIDIA Omniverse Software (OVE)
NVIDIA Omniverse™ Enterprise is an end-to-end collaboration and simulation platform that fundamentally transforms complex design workflows, creating a more harmonious environment for creative teams.
NVIDIA and Lenovo offer a robust, scalable solution for deploying Omniverse Enterprise, accommodating a wide range of professional needs. This document details the critical components, deployment options, and support available, ensuring an efficient and effective Omniverse experience.
Deployment options cater to varying team sizes and workloads. Using Lenovo NVIDIA-Certified Systems™ and Lenovo OVX nodes which are meticulously designed to manage scale and complexity, ensures optimal performance for Omniverse tasks.
Deployment options include:
- Workstations: NVIDIA-Certified Workstations with RTX 6000 Ada GPUs for desktop environments.
- Data Center Solutions: Deployment with Lenovo OVX nodes or NVIDIA-Certified Servers equipped with L40, L40S or A40 GPUs for centralized, high-capacity needs.
NVIDIA Omniverse Enterprise includes the following components and features:
- Platform Components: Kit, Connect, Nucleus, Simulation, RTX Renderer.
- Foundation Applications: USD Composer, USD Presenter.
- Omniverse Extensions: Connect Sample & SDK.
- Integrated Development Environment (IDE)
- Nucleus Configuration: Workstation, Enterprise Nucleus Server (supports up to 8 editors per scene); Self-Service Public Cloud Hosting using Containers.
- Omniverse Farm: Supports batch workloads up to 8 GPUs.
- Enterprise Services: Authentication (SSO/SSL), Navigator Microservice, Large File Transfer, User Accounts SAML/Account Directory.
- User Interface: Workstation & IT Managed Launcher.
- Support: NVIDIA Enterprise Support.
- Deployment Scenarios: Desktop to Data Center: Workstation deployment for building and designing, with options for physical or virtual desktops. For batch tasks, rendering, and SDG workloads that require headless compute, Lenovo OVX nodes are recommended.
The following part numbers are for a subscription license which is active for a fixed period as noted in the description. The license is for a named user which means the license is for named authorized users who may not re-assign or share the license with any other person.
NVIDIA AI Enterprise Software
Lenovo offers the NVIDIA AI Enterprise (NVAIE) cloud-native enterprise software. NVIDIA AI Enterprise is an end-to-end, cloud-native suite of AI and data analytics software, optimized, certified, and supported by NVIDIA to run on VMware vSphere and bare-metal with NVIDIA-Certified Systems™. It includes key enabling technologies from NVIDIA for rapid deployment, management, and scaling of AI workloads in the modern hybrid cloud.
NVIDIA AI Enterprise is licensed on a per-GPU basis. NVIDIA AI Enterprise products can be purchased as either a perpetual license with support services, or as an annual or multi-year subscription.
- The perpetual license provides the right to use the NVIDIA AI Enterprise software indefinitely, with no expiration. NVIDIA AI Enterprise with perpetual licenses must be purchased in conjunction with one-year, three-year, or five-year support services. A one-year support service is also available for renewals.
- The subscription offerings are an affordable option to allow IT departments to better manage the flexibility of license volumes. NVIDIA AI Enterprise software products with subscription includes support services for the duration of the software’s subscription license
The features of NVIDIA AI Enterprise Software are listed in the following table.
Note: Maximum 10 concurrent VMs per product license
The following table lists the ordering part numbers and feature codes.
Find more information in the NVIDIA AI Enterprise Sizing Guide.
NVIDIA HPC Compiler Software
Auxiliary power cables
The A40 option part number does not ship with auxiliary power cables. Cables are server-specific due to length requirements. For CTO orders, auxiliary power cables are derived by the configurator. For field upgrades, cables will need to be ordered separately as listed in the table below.
Regulatory approvals
The NVIDIA A40 GPU has the following regulatory approvals:
- RCM
- BSMI
- CE
- FCC
- ICES
- KCC
- cUL, UL
- VCCI
Operating environment
The NVIDIA A40 GPU has the following operating characteristics:
- Ambient temperature
- Operational: 0°C to 50°C (-5°C to 55°C for short term*)
- Storage: -40°C to 75°C
- Relative humidity:
- Operational: 5-85% (5-93% short term*)
- Storage: 5-95%
* A period not more than 96 hours consecutive, not to exceed 15 days per year.
Warranty
One year limited warranty. When installed in a Lenovo server, the GPU assumes the server’s base warranty and any warranty upgrades.
Related publications
For more information, refer to these documents:
- ThinkSystem and ThinkAgile GPU Summary:
https://lenovopress.lenovo.com/lp0768-thinksystem-thinkagile-gpu-summary - ServerProven compatibility:
https://serverproven.lenovo.com/ - NVIDIA A40 product page:
https://www.nvidia.com/en-us/data-center/a40/ - NVIDIA Ampere Architecture page
https://www.nvidia.com/en-us/data-center/ampere-architecture/
Trademarks
Lenovo and the Lenovo logo are trademarks or registered trademarks of Lenovo in the United States, other countries, or both. A current list of Lenovo trademarks is available on the Web at https://www.lenovo.com/us/en/legal/copytrade/.
The following terms are trademarks of Lenovo in the United States, other countries, or both:
Lenovo®
ServerProven®
ThinkAgile®
ThinkSystem®
The following terms are trademarks of other companies:
AMD is a trademark of Advanced Micro Devices, Inc.
Intel® and Xeon® are trademarks of Intel Corporation or its subsidiaries.
Linux® is the trademark of Linus Torvalds in the U.S. and other countries.
Microsoft®, DirectX®, Windows Server®, and Windows® are trademarks of Microsoft Corporation in the United States, other countries, or both.
Other company, product, or service names may be trademarks or service marks of others.
Configure and Buy
Full Change History
Changes in the September 15, 2023 update:
- Added the Controlled status column to Table 1 - Part number information section
Announced: January 19, 2021
Course Detail
Employees Only Content
The content in this document with a is only visible to employees who are logged in. Logon using your Lenovo ITcode and password via Lenovo single-signon (SSO).
The author of the document has determined that this content is classified as Lenovo Internal and should not be normally be made available to people who are not employees or contractors. This includes partners, customers, and competitors. The reasons may vary and you should reach out to the authors of the document for clarification, if needed. Be cautious about sharing this content with others as it may contain sensitive information.
Any visitor to the Lenovo Press web site who is not logged on will not be able to see this employee-only content. This content is excluded from search engine indexes and will not appear in any search results.
For all users, including logged-in employees, this employee-only content does not appear in the PDF version of this document.
This functionality is cookie based. The web site will normally remember your login state between browser sessions, however, if you clear cookies at the end of a session or work in an Incognito/Private browser window, then you will need to log in each time.
If you have any questions about this feature of the Lenovo Press web, please email David Watts at [email protected].