Lenovo
SKU: 4X67B09287
Overview
Manufacturer-verified compatible cameras, recorders, mounts, accessories, and licenses for this product. Adjust quantities and add the entire bundle to your cart in one click.
Overview
Questions about this product? Free pre-sales support from a senior specialist — product questions, compatibility checks, BOM quotes, price confirmation — typically answered within one business day. Need camera placement or system design work? Engineering time is $175 per hour (qty 1 = 1 hour). Hardware buyers get up to one hour ($175) credited back on their order.
The HPE S6A73C is a full-fat professional GPU based on NVIDIA's Blackwell architecture — 96 GB of GDDR7 memory, 24,064 CUDA parallel processing cores, and a 512-bit memory interface delivering 1,597 GB/s of bandwidth. This is not a workstation card trimmed for cost; it is the RTX PRO 6000, positioned for demanding AI inferencing, large-model training runs, simulation, and 3D rendering workloads where GPU memory capacity and raw FP32 throughput are the binding constraints. If you are sizing a server for LLM inference, a visualization node, or a GPU-dense AI appliance and 96 GB of on-card memory matters, the S6A73C is the configuration to evaluate.
The S6A73C installs via PCIe and is positioned within the HPE server GPU portfolio for high-memory AI and professional visualization use cases. It carries an UNSPSC code of 43201401 (graphics cards/accelerators), consistent with procurement classification in enterprise hardware catalogs. Country of origin is listed as CN, TW, VN — relevant for procurement teams with TAA or NDAA supply-chain requirements who need to verify current HPE certification status independently through official channels.
For deployments targeting AI inference at scale, pair this card with sufficient system memory and a high-core-count CPU that can feed the PCIe bus without creating a host-side bottleneck. NVMe storage bandwidth is also worth sizing carefully when model weights exceed GPU memory and paging is unavoidable.
Buyers integrating this card into an AI compute cluster or GPU server build should also evaluate high-throughput networking to keep the data pipeline matched to the GPU's throughput ceiling. For reference on broader HPE compute and GPU options, see the full HPE catalog. If you are building an AI inferencing rack rather than a single-node workstation, compare this against other GPU accelerator configurations for total memory-per-rack economics.
Q: What is the GPU memory configuration on the HPE S6A73C?
A: The S6A73C is equipped with 96 GB of GDDR7 memory on a 512-bit memory interface, delivering 1,597 GB/s of memory bandwidth. This is the full RTX PRO 6000 memory configuration — not a reduced-capacity variant.
Q: What PCIe slot does the S6A73C require?
A: The S6A73C uses a standard PCIe interface. Verify slot physical length, auxiliary power connector availability, and chassis airflow clearance against your target server platform before ordering — a card of this class typically requires a full-length slot and dedicated auxiliary power.
Q: Is the S6A73C suitable for large language model inference?
A: The 96 GB GDDR7 frame buffer is a primary reason buyers specify this card for LLM inference. Larger on-card memory allows bigger models (or larger batches) to reside entirely on the GPU, avoiding the latency penalty of weight paging. The 1,597 GB/s bandwidth supports high token throughput on quantized workloads at FP4 (4 PFLOPS) or FP8 (2 PFLOPS) precision.
Q: Does the S6A73C support hardware ray tracing?
A: Yes. The card includes 188 fourth-generation RT Cores with peak ray tracing performance of 355 TFLOPS, suitable for real-time photorealistic rendering in architectural visualization, digital twin, and virtual production workflows.
Q: What is the weight of the S6A73C and are there chassis installation considerations?
A: The card weighs 2.87 lb. In server deployments, especially rack-mounted 1U or 2U chassis, verify GPU retention bracket and riser arm load ratings. Card sag or inadequate retention can cause PCIe contact issues over time in high-vibration data center environments.
Q: What compute precision tiers does the S6A73C support?
A: The S6A73C supports FP4 (4 PFLOPS), FP8 (2 PFLOPS), FP16/BF16 (1 PFLOP), TF32 (234 TFLOPS), and FP32 (120 TFLOPS) via its Tensor Core and CUDA core stack. Choose precision based on your workload tolerance for numerical accuracy vs. throughput — FP4 maximizes inference speed; FP32 is for workloads requiring full single-precision compute.

The number that defines the S6A73C for production AI deployments is 1,597 GB/s — that is the memory bandwidth this card delivers via its 512-bit GDDR7 interface, and it is the figure that separates GPU platforms that can sustain high-throughput inference batches from those that stall waiting on memory reads. When I am specifying a GPU for a 70B-class language model running at FP8 precision, memory bandwidth is the first number I check, and 1,597 GB/s is competitive at the top of the current PCIe GPU tier.
Technical Highlights:
Deployment Considerations:
The S6A73C is the right specification for an AI inference node or multi-GPU visualization server where 96 GB of on-card memory is the architectural requirement — not a luxury. If your workload fits comfortably in 48 GB or less, a lower-memory configuration in the same family will deliver better price-per-TFLOP. But if you are running large foundation models at production batch sizes without quantization, this is the card that removes memory capacity as a constraint.
Manufacturer-verified compatible cameras, recorders, mounts, accessories, and licenses for this product. Adjust quantities and add the entire bundle to your cart in one click.
Looking for more HPE products? Shop the full HPE catalog →
Support services and planning resources for commercial surveillance, access control, and infrastructure deployments.
Fixed scope • Fixed price