Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Unparalleled AI and graphics performance for the data center.
Generative AI is fueling transformative change, unlocking a new frontier of opportunities for enterprises across every industry. To transform with AI, enterprises need more compute resources, greater scale, and a broad set of capabilities to meet the demands of an ever-increasing set of diverse and complex workloads.
The NVIDIA L40S GPU is the most powerful universal GPU for the data center, delivering end-to-end acceleration for the next generation of AI-enabled applications-from gen AI, LLM inference, small-model training and fine-tuning to 3D graphics, rendering, and video applications.
Powered by the NVIDIA Ada Lovelace Architecture | Fourth-Generation Tensor Cores
Hardware support for structural sparsity and optimized TF32 format provides out-of-the-box performance gains for faster AI and data science model training. Accelerate AI-enhanced graphics capabilities with DLSS to upscale resolution with better performance in select applications.
Third-Generation RT Cores
Enhanced throughput and concurrent ray-tracing and shading capabilities improve ray-tracing performance, accelerating renders for product design and architecture, engineering, and construction workflows. See lifelike designs in action with hardware-accelerated motion blur and stunning real-time animations.
Transformer Engine
Transformer Engine dramatically accelerates AI performance and improves memory utilization for both training and inference. Harnessing the power of the Ada Lovelace fourth-generation Tensor Cores, Transformer Engine intelligently scans the layers of transformer architecture neural networks and automatically recasts between FP8 and FP16 precisions to deliver faster AI performance and accelerate training and inference.
Data Center Ready
The L40S GPU is optimized for 24/7 enterprise data center operations and designed, built, tested, and supported by NVIDIA to ensure maximum performance, durability, and uptime. The L40S GPU meets the latest data center standards, is Network Equipment-Building System (NEBS) Level 3 ready, and features secure boot with root of trust technology, providing an additional layer of security for data centers.
- Generative AI
- LLM inference
- LLM fine-tuning and small-model training
- NVIDIA Omniverse™ Enterprise
- Rendering and 3D graphics
- Streaming and video content
Product Information
Product Information
Shipping & Returns
Shipping & Returns

Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Unparalleled AI and graphics performance for the data center.
Generative AI is fueling transformative change, unlocking a new frontier of opportunities for enterprises across every industry. To transform with AI, enterprises need more compute resources, greater scale, and a broad set of capabilities to meet the demands of an ever-increasing set of diverse and complex workloads.
The NVIDIA L40S GPU is the most powerful universal GPU for the data center, delivering end-to-end acceleration for the next generation of AI-enabled applications-from gen AI, LLM inference, small-model training and fine-tuning to 3D graphics, rendering, and video applications.
Powered by the NVIDIA Ada Lovelace Architecture | Fourth-Generation Tensor Cores
Hardware support for structural sparsity and optimized TF32 format provides out-of-the-box performance gains for faster AI and data science model training. Accelerate AI-enhanced graphics capabilities with DLSS to upscale resolution with better performance in select applications.
Third-Generation RT Cores
Enhanced throughput and concurrent ray-tracing and shading capabilities improve ray-tracing performance, accelerating renders for product design and architecture, engineering, and construction workflows. See lifelike designs in action with hardware-accelerated motion blur and stunning real-time animations.
Transformer Engine
Transformer Engine dramatically accelerates AI performance and improves memory utilization for both training and inference. Harnessing the power of the Ada Lovelace fourth-generation Tensor Cores, Transformer Engine intelligently scans the layers of transformer architecture neural networks and automatically recasts between FP8 and FP16 precisions to deliver faster AI performance and accelerate training and inference.
Data Center Ready
The L40S GPU is optimized for 24/7 enterprise data center operations and designed, built, tested, and supported by NVIDIA to ensure maximum performance, durability, and uptime. The L40S GPU meets the latest data center standards, is Network Equipment-Building System (NEBS) Level 3 ready, and features secure boot with root of trust technology, providing an additional layer of security for data centers.
- Generative AI
- LLM inference
- LLM fine-tuning and small-model training
- NVIDIA Omniverse™ Enterprise
- Rendering and 3D graphics
- Streaming and video content
Original: $25,262.99
-70%$25,262.99
$7,578.90Product Information
Product Information
Shipping & Returns
Shipping & Returns
Description
Cisco NVIDIA L40S Graphic Card - 48 GB GDDR6 - Full-height
Unparalleled AI and graphics performance for the data center.
Generative AI is fueling transformative change, unlocking a new frontier of opportunities for enterprises across every industry. To transform with AI, enterprises need more compute resources, greater scale, and a broad set of capabilities to meet the demands of an ever-increasing set of diverse and complex workloads.
The NVIDIA L40S GPU is the most powerful universal GPU for the data center, delivering end-to-end acceleration for the next generation of AI-enabled applications-from gen AI, LLM inference, small-model training and fine-tuning to 3D graphics, rendering, and video applications.
Powered by the NVIDIA Ada Lovelace Architecture | Fourth-Generation Tensor Cores
Hardware support for structural sparsity and optimized TF32 format provides out-of-the-box performance gains for faster AI and data science model training. Accelerate AI-enhanced graphics capabilities with DLSS to upscale resolution with better performance in select applications.
Third-Generation RT Cores
Enhanced throughput and concurrent ray-tracing and shading capabilities improve ray-tracing performance, accelerating renders for product design and architecture, engineering, and construction workflows. See lifelike designs in action with hardware-accelerated motion blur and stunning real-time animations.
Transformer Engine
Transformer Engine dramatically accelerates AI performance and improves memory utilization for both training and inference. Harnessing the power of the Ada Lovelace fourth-generation Tensor Cores, Transformer Engine intelligently scans the layers of transformer architecture neural networks and automatically recasts between FP8 and FP16 precisions to deliver faster AI performance and accelerate training and inference.
Data Center Ready
The L40S GPU is optimized for 24/7 enterprise data center operations and designed, built, tested, and supported by NVIDIA to ensure maximum performance, durability, and uptime. The L40S GPU meets the latest data center standards, is Network Equipment-Building System (NEBS) Level 3 ready, and features secure boot with root of trust technology, providing an additional layer of security for data centers.
- Generative AI
- LLM inference
- LLM fine-tuning and small-model training
- NVIDIA Omniverse™ Enterprise
- Rendering and 3D graphics
- Streaming and video content























