Product images are provided for reference and may not represent the exact model, configuration, or included components.

Overview

SKU: VCNRTXPRO6000BQ-PB
UPC: 751492796208
Condition: New
Write a Review

PNY VCNRTXPRO6000BQ-PB NVIDIA Blackwell Architecture 24 064 Cuda Cores 752 NVIDIA Tensor Cores 188

PNY VCNRTXPRO6000BQ-PB NVIDIA Blackwell RTX Pro 6000 GPU Overview The PNY VCNRTXPRO6000BQ-PB is an NVIDIA Blackwell-architecture professional GPU en…

$15,395.99
Ships same business day
In stock

Quantity:

Adding to cart… The item has been added
Compatibility guidance available for your deployment
Senior specialists for pre and post-sales support
Authorized sourcing and documentation support
Shipping and lead-time confirmation before install

Laura Bennett, IPSD Senior Specialist

Talk to Laura

200+ hrs training • U.S - based

Senior Specialist • 877-277-7147

PNY VCNRTXPRO6000BQ-PB NVIDIA Blackwell Architecture 24 064 Cuda Cores 752 NVIDIA Tensor Cores 188

$15,395.99

Overview

SKU: VCNRTXPRO6000BQ-PB
UPC: 751492796208
Condition: New

No Bots, Just Experts

Questions about this product? Free pre-sales support from a senior specialist — product questions, compatibility checks, BOM quotes, price confirmation — typically answered within one business day. Need camera placement or system design work? Engineering time is $175 per hour (qty 1 = 1 hour). Hardware buyers get up to one hour ($175) credited back on their order.

Description

PNY VCNRTXPRO6000BQ-PB NVIDIA Blackwell RTX Pro 6000 GPU

Overview

The PNY VCNRTXPRO6000BQ-PB is an NVIDIA Blackwell-architecture professional GPU engineered for real-time surveillance analytics, AI inference at the edge, and compute-intensive security applications. This is not a gaming card — it's a dual-slot, passively cooled workstation accelerator that delivers 120 TFLOPS of single-precision performance and 4 PFLOPS of peak AI throughput in configurations where thermal headroom and power budget matter.

Built on fifth-generation Tensor Cores (752 total) and fourth-generation RT Cores (188 total), the VCNRTXPRO6000BQ-PB handles real-time video decoding, multi-stream transcoding, and deep-learning inference across multiple concurrent tasks — exactly what you need when you're running object detection across dozens of camera feeds on a single node.

Key Features

  • 96 GB GDDR7 with ECC: Enough memory for simultaneous multi-stream decoding and model inference without model quantization or frequent reloads. Enterprise-grade ECC prevents silent data corruption — critical when your models are making decisions that affect physical security.
  • 120 TFLOPS FP32 Performance: Handles real-time H.264/H.265 transcoding at 4K resolution across 12–16 concurrent streams without frame drops. Measurable speedup over CPU-only pipelines in surveillance VMS architectures.
  • 4 PFLOPS Peak FP4 AI Performance: New low-precision tensor math accelerates inference workloads — object detection, face recognition, crowd counting — at 10x the throughput of equivalent FP32 operations. Practical benefit: inference latency drops from 80ms to 8ms per frame on quantized models.
  • 4x NVENC + 4x NVDEC + 4x JPEG Encode/Decode: Hardware video codec engines allow independent encoding and decoding streams. Deploy one VCNRTXPRO6000BQ-PB to ingest raw video from 4 separate cameras, re-encode for archive, and simultaneously stream to 4 different VMS instances — all without stealing CPU cycles.
  • 1597 GB/s Memory Bandwidth via 512-bit Interface: Saturated bandwidth means your inference models and video pipelines never wait on memory fetch latency. In practice, this eliminates the performance cliff you hit when moving from 1–2 concurrent streams to 8+.
  • Passive Thermal Solution: Zero-RPM cooling means silent operation in server rooms and branch office closets. No fan failure risk, no intake filter maintenance. Trade-off: requires adequate case airflow (typical in rackmount, problematic in fanless enclosures).
  • Up to 4 MIG (Multi-Instance GPU) Partitions @ 24GB each: Carve the card into 4 independent 24GB GPUs, each running separate tenant workloads or inference models. Eliminates card-level contention; useful in multi-tenant cloud or managed-security-service deployments.
  • PCI Express 5.0 x16: PCIe Gen 5 doubles bandwidth versus Gen 4 (64 GB/s upstream and downstream). Meaningful when pushing high-bitrate encoded video or frequent model weights to the GPU — reduces PCIe bottleneck in edge appliances.
  • 600W Maximum Power Consumption: Single PCIe CEM5 16-pin connector handles all power delivery. Fits standard dual-socket x86 workstations and 2U/4U rackmount servers with standard 750W+ PSUs — no exotic power distribution required.
  • 4x DisplayPort 2.1 Outputs: Native video display for engineering workstations, real-time monitoring dashboards, or live VMS console deployments. Useful in security operations centers running AI-augmented video walls.

Integration & Compatibility

The VCNRTXPRO6000BQ-PB integrates into any x86 Linux or Windows server with a free dual-slot PCIe x16 slot. NVIDIA CUDA 12.x SDK and cuDNN libraries enable integration with major surveillance and analytics platforms: DeepStream (NVIDIA's native video pipeline), VMS applications with NVIDIA GPU acceleration plugins (Milestone XProtect with Nvidia AI plugins, Genetec, Hanwha, etc.), and custom Python/C++ inference frameworks.

Passive cooling means the card operates reliably in temperature-controlled server rooms and climate-monitored branch offices. Do not deploy in direct sunlight, next to heating equipment, or in unventilated enclosures — case convection and ambient airflow are non-negotiable.

What's in the Box

PNY supplies the GPU module only. No additional accessories, mounting hardware, or power adapters are included. You provision the card into an existing server chassis or workstation with available PCIe x16 bandwidth and adequate 12V auxiliary power.

Frequently Asked Questions

Q: What's the warranty on the VCNRTXPRO6000BQ-PB?

A: NVIDIA professional GPUs typically ship with a manufacturer warranty covering defects in material and workmanship. Contact your reseller or NVIDIA support for the specific term applicable to your purchase date and region.

Q: Does the VCNRTXPRO6000BQ-PB support NVIDIA NVDEC for H.265 (HEVC) decoding?

A: Yes. The VCNRTXPRO6000BQ-PB includes 4x independent NVDEC engines capable of hardware H.264 and H.265 decoding. This is essential for surveillance applications where incoming camera streams arrive in H.265 to save bandwidth, and you need to re-encode or analyze in real time.

Q: Can I partition the VCNRTXPRO6000BQ-PB into separate instances for multi-tenant workloads?

A: Yes, via NVIDIA MIG (Multi-Instance GPU). You can create up to 4 independent GPU partitions, each with 24GB of memory and corresponding compute and codec resources. This is useful in managed security services or cloud environments where you want to isolate customer inference workloads.

Q: What's the power draw under sustained video transcoding?

A: Peak consumption is 600W. Under typical surveillance transcoding workloads (4–6 concurrent H.265 streams), expect 400–500W sustained draw. Always provision your server PSU for the 600W maximum to avoid thermal throttling under peak inference load.

Q: Does the VCNRTXPRO6000BQ-PB work in 1U rackmount servers?

A: The card is 10.5 inches long and occupies 2 slot widths. Standard 1U/2U rackmount servers with dual-slot PCIe clearance will accommodate it. Check your chassis specifications for PCIe slot layout and thermal exhaust routing — passive cooling requires case fans to move air across the heatsink.

Q: Is there an 850nm infrared (IR) module on the VCNRTXPRO6000BQ-PB itself?

A: No. The 850nm IR specification in your product metadata likely refers to a companion document or platform context. The VCNRTXPRO6000BQ-PB is a pure GPU — it does not emit or detect light. IR capability comes from camera hardware upstream; the GPU processes the incoming video stream.

Jerry Tildsen
Jerry Tildsen

I specify the VCNRTXPRO6000BQ-PB when a single-node surveillance analytics cluster needs to handle 12–20 concurrent camera feeds with real-time AI inference — object detection, person counting, anomaly detection — without spawning a farm of smaller GPUs. The 96GB unified memory pool and 4x NVDEC engines are the clinchers: you're not swapping models to disk or queuing inference tasks. This card was built for exactly this workload.

Technical Highlights:

  • 4 PFLOPS FP4 Peak AI Throughput: Quantized inference models (INT8, FP4) run at roughly 10x the throughput of FP32 operations. Real deployment impact: an object detection model that takes 80ms in FP32 drops to 8ms in quantized mode — that's the difference between frame-level responsiveness and obvious latency in a video stream.
  • 4x Independent NVDEC Engines: Decode four separate H.265 streams in parallel without GPU compute contention. Pair this with 4x NVENC and you can ingest raw video, re-encode for archive at lower bitrate, and ship analytics outputs to a VMS — all on a single card. No CPU bottleneck.
  • 1597 GB/s Memory Bandwidth: The 512-bit interface feeds your models fast enough that you rarely see memory as the limiting factor. When you scale from 4 concurrent inferences to 16, performance doesn't cliff — it scales linearly until GPU compute itself becomes saturated.
  • 96GB GDDR7 with ECC: Enough headroom to keep multiple high-resolution AI models resident simultaneously. ECC prevents silent bit flips — non-negotiable when your inference engine is flagging people for further investigation or triggering automated responses.

Deployment Considerations:

  • Passive cooling is silent and eliminates fan failure as a failure mode, but it demands adequate case airflow. If your server doesn't have case fans or is tucked in a hot closet, you'll thermal-throttle within minutes. Measure case intake/exhaust temperature before deployment.
  • The 600W power envelope is real. Don't assume a standard 750W datacenter PSU has enough headroom if you're also running dual CPUs and NVMe storage. Size the PSU for 800W+ to avoid brownout transients when the GPU ramps up under load.
  • PCIe x16 Gen 5 is the interface, but older servers with Gen 4 slots will still work — you'll just see reduced upstream/downstream bandwidth. Not a deal-breaker for surveillance, but it becomes visible if you're pushing encoded video at 500+ Mbps or doing frequent model weight updates.

Deploy the VCNRTXPRO6000BQ-PB into branch-office or edge appliances running Nvidia DeepStream, custom Python inference pipelines, or VMS platforms with native GPU plugins. If you're building a 10–20 camera AI analytics node with single-card simplicity, this is the accelerator that justifies the power and cooling investment.

Specifications
Ir Lowlight: 850nm
Upc: 3536403403638
Tensor Cores: 752 (fifth-generation)
Rt Cores: 188 (fourth-generation)
Single Precision Performance: 120 TFLOPS (FP32)
Peak FP4 AI Performance: 4 PFLOPS
Rt Core Performance: 355 TFLOPS
Gpu Memory: 96 GB GDDR7 with ECC
Memory Interface: 512-bit
Memory Bandwidth: 1597 GB/s
Power Consumption: Up to 600W
Multi Instance Gpu: Up to 4 MIGs @ 24GB
Nvenc Nvdec Jpeg: 4x | 4x | 4x
Graphics Bus: PCI Express 5.0 x16
Display Connectors: 4x DisplayPort 2.1
Form Factor: 4.4" (H) x 10.5" (L), dual slot
Thermal Solution: Passive
Power Connector: 1x PCIe CEM5 16-pin
Q&A
Reviews
Have Questions?

RELATED PRODUCTS

System Design, Deployment & Technical Support

Support services and planning resources for commercial surveillance, access control, and infrastructure deployments.

Fixed scope • Fixed price

System Design Assistance

  • Get help validating product compatibility
  • Coverage requirements
  • Storage planning and deployment architecture before you buy.
Request Design Help

Deployment & Configuration Support

  • Access fixed-scope support for rollout planning
  • User setup guidance
  • Migration and system standardization across single-site or multi-site deployments
View Support Services

Guides, Tools & Calculators

  • PoE requirements
  • Storage retention
  • Camera selection and deployment methodology
Open Technical Resources