Product images are provided for reference and may not represent the exact model, configuration, or included components.

Overview

SKU: 4X67A90669
UPC: 889488712168
Condition: New
Write a Review

Lenovo 4X67A90669 L40S 48G PCIE GEN4 PAS

Lenovo 4X67A90669 NVIDIA L40S 48GB PCIe Gen4 GPU AcceleratorOverviewThe Lenovo 4X67A90669 is a professional-grade NVIDIA L40S GPU accelerator carrying…

$57,085.99
Ships same business day
In stock

Quantity:

Adding to cart… The item has been added
Compatibility guidance available for your deployment
Senior specialists for pre and post-sales support
Authorized sourcing and documentation support
Shipping and lead-time confirmation before install

Laura Bennett, IPSD Senior Specialist

Talk to Laura

200+ hrs training • U.S - based

Senior Specialist • 877-277-7147

Lenovo 4X67A90669 L40S 48G PCIE GEN4 PAS

$57,085.99

Overview

SKU: 4X67A90669
UPC: 889488712168
Condition: New

No Bots, Just Experts

Questions about this product? Free pre-sales support from a senior specialist — product questions, compatibility checks, BOM quotes, price confirmation — typically answered within one business day. Need camera placement or system design work? Engineering time is $175 per hour (qty 1 = 1 hour). Hardware buyers get up to one hour ($175) credited back on their order.

Description

Lenovo 4X67A90669 NVIDIA L40S 48GB PCIe Gen4 GPU Accelerator

Overview

The Lenovo 4X67A90669 is a professional-grade NVIDIA L40S GPU accelerator carrying 48 GB of GDDR6 on-board memory — the full-frame, passive-cooled form factor built for dense rack deployments where airflow is managed at the enclosure level rather than per-card. If you are provisioning compute for AI inference, video transcoding at scale, or GPU-accelerated workloads inside a Lenovo ThinkSystem server, the L40S at this memory capacity eliminates the GPU memory bottleneck that typically forces batching compromises on large model workloads. The card connects via a PCIe Gen4 x16 interface, delivering the bus bandwidth needed to keep the GPU fed under sustained throughput loads — check your server's PCIe slot assignment before ordering, as Gen4 x16 electrical is the minimum to realize full throughput. This Lenovo GPU option fits into the broader Lenovo server and compute catalog for integrators already standardized on ThinkSystem infrastructure.

Key Features

  • 48 GB GDDR6 with 864 GB/s Bandwidth: At 864 GB/s memory bandwidth, the L40S handles large model inference and multi-stream video analytics without throttling at the memory bus — a direct constraint for LLM inference and high-channel AI video workloads where smaller-VRAM cards force pipeline compromises.
  • PCIe Gen4 x16 Interface: Gen4 x16 doubles the host interface bandwidth versus Gen3, keeping the CPU-to-GPU transfer path from becoming the bottleneck during data-intensive batch inference or transcoding. Verify your server's slot configuration supports Gen4 electrical signaling — the card will operate at Gen3 speeds in a Gen3 slot but you lose measurable throughput.
  • Passive Cooling: No onboard fans means no fan-failure point and no per-card acoustic contribution. Passive cooling is standard for dense AI server deployments where chassis airflow (front-to-rear) handles thermal dissipation. This is not suitable for an open-air workstation or workbench — it requires a properly configured 1U/2U server chassis with adequate CFM across the card.
  • Full-Height / Full-Length, Dual-Slot Form Factor: The FH/FL 2-slot form factor is standard for enterprise server configurations. Confirm your chassis has double-wide PCIe slot clearance and that neighboring cards won't be displaced — particularly relevant in high-density 4-GPU or 8-GPU configurations.
  • Four DisplayPort 1.4a Outputs: Four DP 1.4a outputs support high-resolution display walls or multi-monitor visualization workstations up to 8K per output. For pure compute deployments these ports go unused, but they add flexibility if the card is ever repurposed for visualization or simulation environments.
  • DirectX 12 Ultimate, Shader Model 6.6, OpenGL 4.6, OpenCL 3.0: Full modern API coverage means the card integrates with simulation, visualization, and compute frameworks without compatibility gaps. OpenCL 3.0 support is relevant for cross-vendor compute pipelines; DirectX 12 Ultimate and Shader Model 6.6 cover graphics-side workloads if the deployment spans both rendering and AI inference.
  • CUDA Support: CUDA enablement is the baseline requirement for NVIDIA-ecosystem AI frameworks — PyTorch, TensorFlow, TensorRT, and the NVIDIA AI Enterprise stack all depend on it. Without CUDA, the GPU acceleration story for most enterprise AI workloads collapses entirely.
  • Multi-Certification Coverage: The 4X67A90669 carries RCM, BSMI, CE, FCC, ICES, KCC, and cUL/UL certifications — global deployment coverage for North America, Europe, Australia, Taiwan, Japan, and Korea. Enterprise procurement teams dealing with multi-region deployments can clear regulatory approval without sourcing region-specific SKUs.

Integration and Compatibility

This GPU card is designed for installation in Lenovo ThinkSystem servers that support PCIe Gen4 x16 full-height, full-length double-slot adapters. Before specifying this card into a server bill of materials, confirm three things: the target server's PCIe slot count and physical dimensions, the chassis airflow specification (passive cooling requires sufficient CFM across the card surface), and the server's total GPU slot power budget. As a PCIe GPU accelerator, the 4X67A90669 integrates with NVIDIA's CUDA ecosystem and supports frameworks commonly used in AI inference, professional visualization, and high-performance compute environments. The DisplayPort 1.4a outputs support connection to high-resolution displays for visualization-adjacent deployments. Review your server component compatibility matrix before finalizing the configuration — Lenovo's ServerProven program is the authoritative reference for validated server/GPU combinations. For storage and memory pairing in AI inference racks, also review enterprise storage options to ensure the data pipeline keeps pace with GPU throughput.

Frequently Asked Questions

Q: What server platforms is the Lenovo 4X67A90669 compatible with?

A: The 4X67A90669 is designed for Lenovo ThinkSystem servers that support a PCIe Gen4 x16 full-height, full-length double-slot GPU adapter. Consult Lenovo's ServerProven compatibility database with your specific server model to confirm support before ordering.

Q: Does the L40S require active cooling from the server chassis?

A: Yes. The 4X67A90669 uses passive cooling — there are no onboard fans. The card relies entirely on chassis airflow (front-to-rear server ventilation) for thermal management. It is not suitable for open-air or workstation enclosures that lack directed airflow across the PCIe slot area.

Q: Will a PCIe Gen3 slot work with this card?

A: The card is electrically compatible with PCIe Gen3 slots via backward compatibility, but it will operate at Gen3 bandwidth rather than Gen4. For workloads that are memory-bandwidth or transfer-intensive, this is a meaningful performance reduction. Gen4 x16 is the recommended configuration.

Q: What certifications does the 4X67A90669 carry?

A: The card is certified for RCM (Australia), BSMI (Taiwan), CE (Europe), FCC and ICES (North America), KCC (Korea), VCCI (Japan), and cUL/UL. This covers most major enterprise procurement regions without requiring a region-specific SKU variant.

Q: Is the NVIDIA L40S suitable for AI inference workloads?

A: The L40S with 48 GB GDDR6 and 864 GB/s memory bandwidth is a strong fit for large-model AI inference, multi-stream video analytics, and GPU-accelerated compute. CUDA support is confirmed, which is the baseline requirement for NVIDIA-ecosystem AI frameworks including TensorRT and the NVIDIA AI Enterprise stack.

Q: How many display outputs does this card provide?

A: The card provides four DisplayPort 1.4a outputs, supporting high-resolution display configurations up to 8K per output. In pure compute deployments these outputs are typically unused, but they add flexibility for visualization or simulation use cases.

James Everett
James Everett

The 4X67A90669 is a card I'd spec specifically into rack-dense AI inference builds where you need maximum VRAM per slot and can't afford fan-failure risk on a production inference node — the 48 GB GDDR6 capacity and passive cooling profile make it the right answer in that narrow but important scenario. The 864 GB/s memory bandwidth is what separates the L40S from lower-tier data center GPUs when you're running large transformer models that don't fit in 24 GB cards and need to sustain throughput across concurrent inference requests.

Technical Highlights:

  • 864 GB/s Memory Bandwidth: At this bandwidth ceiling, the card avoids the memory-bus stall that makes smaller GDDR6X cards bottleneck on large-batch inference. The difference is measurable in tokens-per-second on 70B+ parameter models.
  • PCIe Gen4 x16 Interface: Gen4 doubles the host-to-device transfer bandwidth versus Gen3 — relevant when streaming large datasets to the GPU for training-adjacent fine-tuning workloads, less critical for static-model inference but still the right baseline.
  • Passive Cooling, Dual-Slot FH/FL: No fan failure modes and no per-card acoustic load — in a 4-GPU or 8-GPU server bay, that adds up. The trade-off is absolute dependency on chassis airflow being correctly configured and maintained.

Deployment Considerations:

  • Passive cooling is a hard requirement on the chassis side — verify your server's CFM specification and airflow path before installing. A server chassis running at reduced fan speed or with blocked airflow will thermal-throttle this card under sustained load.
  • The FH/FL dual-slot form factor means it physically displaces two PCIe slot positions. In servers with 4 or more GPU bays, confirm physical slot spacing accommodates this before finalizing the BOM.

For AI inference racks running large language models or multi-stream GPU video analytics — particularly in Lenovo ThinkSystem environments where chassis airflow is engineered for passive GPU cards — the 4X67A90669 is the right specification choice over active-cooled alternatives that introduce fan-failure risk into production nodes.

Specifications
Weight: 2.00 lb
Interface: PCIe
Unspsc Code: 43201401
CUDA: Yes
Graphics processor family: NVIDIA
Graphics processor: L40S
Discrete graphics card memory: 48 GB
Graphics card memory type: GDDR6
Memory bandwidth (max: 864 GB/s
Interface type: PCI Express x16 4.0
DisplayPorts quantity: 4
DisplayPort version: 1.4a
DirectX version: 12 Ultimate
Shader model version: 6.6
OpenGL version: 4.6
OpenCL version: 3.0
Cooling type: Passive
Form factor: Full-Height/Full-Length (FH/FL)
Bracket height: Full-Height (FH)
Number of slots: 2
Certification: RCM BSMI CE FCC ICES KCC cUL, UL VCCI
Q&A
Reviews
Have Questions?

RELATED PRODUCTS

System Design, Deployment & Technical Support

Support services and planning resources for commercial surveillance, access control, and infrastructure deployments.

Fixed scope • Fixed price

System Design Assistance

  • Get help validating product compatibility
  • Coverage requirements
  • Storage planning and deployment architecture before you buy.
Request Design Help

Deployment & Configuration Support

  • Access fixed-scope support for rollout planning
  • User setup guidance
  • Migration and system standardization across single-site or multi-site deployments
View Support Services

Guides, Tools & Calculators

  • PoE requirements
  • Storage retention
  • Camera selection and deployment methodology
Open Technical Resources