NVIDIA HGX B300

Built for the Age of AI Reasoning

NVIDIA HGX B300 Instances

The NVIDIA HGX™ B300 platform is purpose-built for the next wave of AI reasoning. With enhanced compute, expanded memory, and ultra-fast networking, the B300 delivers breakthrough performance for the most complex AI workloads.

The NVIDIA HGX B300 is now available in the Cirrascale AI Innovation Cloud. Experience the platform optimized for reasoning and test-time scaling.

Order Your Instance Now

NVIDIA Partner Network Cloud Provider

Cirrascale Offers the HGX B300 for Unmatched Performance

Cirrascale offers the HGX B300 in its AI Innovation Cloud as an 8x NVIDIA Blackwell Ultra SXM configuration giving you full GPU-to-GPU bandwidth through NVIDIA NVLink™ Switch. As a premier accelerated scaleup x86 platform with up to 11X faster real-time inference performance on large language models and 7X more AI compute performance than Hopper.

The NVIDIA HGX B300 integrates NVIDIA Blackwell Ultra GPUs with high-speed interconnects to propel the data center into a new era of accelerated computing and generative AI. As a premier accelerated scale-up platform with up to 11x more inference performance than the previous generation, NVIDIA Blackwell-based HGX systems are designed for the most demanding generative AI, data analytics, and HPC workloads.

Real-Time Large Language Model Inference

HGX B300 achieves up to 11x higher inference performance over the previous NVIDIA Hopper™ generation for models such as Llama 3.1 405B.

The second-generation Transformer Engine uses custom Blackwell Tensor Core technology combined with TensorRT™-LLM innovations to accelerate inference for large language models (LLMs).

Next-Level Training Performance

The second-generation Transformer Engine, featuring 8-bit floating point (FP8) and new precisions, enables a remarkable 4x faster training for large language models like Llama 3.1 405B.

This breakthrough is complemented by fifth-generation NVLink with 1.8 TB/s of GPU-to-GPU interconnect, InfiniBand networking, and NVIDIA Magnum IO™ software. Together, these ensure efficient scalability for enterprises and extensive GPU computing clusters.

Why Cirrascale?

We're proud of the fact that we have worked with cloud pioneers from the very start. We were the trusted cloud backbone that helped OpenAI meet their cloud compute needs early on, and we continue to engage with today's bleeding edge AI companies, like yours.

Access to the Latest AI Accelerators

The Cirrascale AI Innovation Cloud contains today's latest accelerators including the NVIDIA HGX™ B300 and B200 GPUs all interconnected with NVIDIA Quantum InfiniBand networking.

Specialized Cloud and Managed Services

Work with us to tailor the right solution for you with our wide range of system configurations, optimized for your specific workload requirements.

Transparent, Budget Friendly Pricing

With our no-surprises billing, long-term discounts, and no data transfer fees; Cirrascale offers unmatched pricing that’s built around your needs.

Get Your HGX B300 Instances

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Ready To Get Started?

Ready to take advantage of our flat-rate monthly billing, no ingress/egress data fees, and fast multi-tiered storage?

Get Started