Close Menu
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
What's Hot

Ripple banking partner powers Elon Musk’s X Money rollout to users

June 26, 2026

Latin America’s Banks Invest Big in Digital Assets for 2026

June 26, 2026

$82K Breakout or $48K Drop

June 26, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
CatchTheBullCatchTheBull
Blockchain

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

By WebDeskApril 1, 20263 Mins Read
NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains
Share
Facebook Twitter LinkedIn Pinterest Email


Iris Coleman
Apr 01, 2026 15:38

NVIDIA’s Blackwell Ultra GPUs set new MLPerf Inference records with 2.7x faster DeepSeek-R1 processing, hitting 2.5 million tokens per second across 288 GPUs.





NVIDIA’s Blackwell Ultra GPUs have delivered record-breaking performance in the latest MLPerf Inference v6.0 benchmarks, achieving up to 2.7x faster token throughput compared to submissions just six months ago. The results, published April 1, 2026, push NVIDIA’s cumulative MLPerf wins to 291—nine times more than all other submitters combined since 2018.

The standout figure: four GB300 NVL72 systems running 288 Blackwell Ultra GPUs processed 2.49 million tokens per second on DeepSeek-R1 in offline mode. That’s the largest GPU configuration ever submitted to any MLPerf Inference benchmark.

Software Optimization Drives Massive Gains

What’s particularly striking isn’t just raw hardware muscle—it’s how much performance NVIDIA extracted from the same silicon through software improvements. The GB300 NVL72 delivered 8,064 tokens per second per GPU on DeepSeek-R1’s server scenario, up from 2,907 tokens six months prior. Same chips, 2.77x more output.

The performance jump came from several TensorRT-LLM enhancements: faster fused kernels, optimized attention data parallel processing, and better load balancing across ranks. For the new DeepSeek-R1 Interactive scenario—which demands 5x faster minimum token rates than standard server deployments—NVIDIA deployed disaggregated serving, Wide Expert Parallel sharding, and multi-token prediction to hit 250,634 tokens per second.

Partner Nebius achieved the 2.7x speedup, demonstrating how NVIDIA’s open software stack enables ecosystem optimization. The practical implication? Token production costs dropped by over 60% on existing infrastructure.

First and Only Across New Benchmarks

MLPerf v6.0 introduced several demanding new tests, and NVIDIA was the sole platform to submit results across all of them:

  • Qwen3-VL-235B-A22B: The first multimodal vision-language model in MLPerf, hitting 79 samples/sec offline
  • GPT-OSS-120B: OpenAI’s 120B-parameter MoE reasoning model, achieving 1.05 million tokens/sec offline
  • WAN-2.2-T2V-A14B: Text-to-video generation at 21 seconds latency in single-stream mode
  • DLRMv3: Transformer-based recommendation benchmark at 104,637 samples/sec

The multimodal Qwen3-VL submission used the vLLM open-source framework, while video generation ran on TensorRT-LLM VisualGen—both indicating how quickly the open-source ecosystem is building optimized pipelines for next-generation workloads.

Partner Ecosystem Shows Depth

Fourteen partners submitted results on the NVIDIA platform this round—the largest partner participation for any single platform in MLPerf history. ASUS, Cisco, CoreWeave, Dell, Google Cloud, HPE, Lenovo, and Supermicro all delivered competitive performance numbers, suggesting the Blackwell architecture has matured enough for broad enterprise deployment.

This breadth matters for AI infrastructure buyers evaluating vendor lock-in risk. The results arrived the same week NVIDIA announced a $2 billion strategic investment in Marvell Technology to expand AI infrastructure options, signaling the company’s push to position itself as the foundational layer for AI computing rather than a single-vendor solution.

What Comes Next

NVIDIA is leading development of MLPerf Endpoints, a new benchmark designed to measure real-world API performance under production traffic conditions. Current chip-level benchmarks can’t capture latency spikes, queuing behavior, or throughput degradation under sustained load—metrics that actually determine AI service economics.

For data center operators running inference at scale, the message from these results is clear: software optimization on existing Blackwell hardware may deliver more cost reduction than waiting for next-generation silicon. A 60% reduction in per-token costs changes the economics of deploying reasoning models like DeepSeek-R1 in production.

Image source: Shutterstock


Credit: Source link

Previous ArticleMORPHO Price Jumps 15% on pyUSD Vault Launch, But Resistance Looms
Next Article Crypto-Revenge ‘On Demand’ – Why Are Rogue Groups Taking Justice On Their Own Hands?

Related Posts

Latin America’s Banks Invest Big in Digital Assets for 2026

June 26, 2026

May inflation hits 4.1% as Polymarket sees 79% odds of zero Fed cuts in 2026

June 26, 2026

AAVE Price Prediction: Bulls Are Running Out of Road Below $89 Resistance

June 26, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Ripple banking partner powers Elon Musk’s X Money rollout to users

June 26, 2026

Latin America’s Banks Invest Big in Digital Assets for 2026

June 26, 2026

$82K Breakout or $48K Drop

June 26, 2026

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

Advertisement Banner

Welcome to CatchTheBull, your trusted source for the latest Crypto News and Airdrops. We bring you real-time updates, expert insights, and opportunities to stay ahead in the crypto world. Discover trending projects, market analyses, and airdrop details all in one place.

Join us on this journey to navigate the ever-evolving blockchain universe!

Facebook X (Twitter) Instagram YouTube
Top Insights

Why is AAVE Price Rising While Bitcoin Stays Below $60K? Key Reasons Aave Could Hit $100 Soon

Aave Founder Kulechov Dismisses Rumors of Selling AAVE at a 70% Discount, Teases Aavenomics 3.0

BitGo Implements 15% Workforce Reduction In Shift To AI Infrastructure

Get Informed

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

© 2026 CatchTheBull. All Rights Are Reserved.
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

Type above and press Enter to search. Press Esc to cancel.

  • bitcoinBitcoin(BTC)$59,618.000.54%
  • ethereumEthereum(ETH)$1,566.990.73%
  • tetherTether(USDT)$1.000.02%
  • binancecoinBNB(BNB)$564.591.94%
  • usd-coinUSDC(USDC)$1.000.01%
  • rippleXRP(XRP)$1.041.22%
  • solanaSolana(SOL)$72.158.94%
  • tronTRON(TRX)$0.319479-1.22%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.02-1.90%
  • HyperliquidHyperliquid(HYPE)$63.851.02%
  • dogecoinDogecoin(DOGE)$0.0752722.32%
  • RainRain(RAIN)$0.015643-0.66%
  • USDSUSDS(USDS)$1.000.00%
  • leo-tokenLEO Token(LEO)$9.20-2.12%
  • zcashZcash(ZEC)$418.844.13%
  • stellarStellar(XLM)$0.1790342.84%
  • moneroMonero(XMR)$318.233.94%
  • LABLAB(LAB)$19.025.50%
  • CantonCanton(CC)$0.1520992.67%
  • whitebitWhiteBIT Coin(WBT)$48.180.11%
  • cardanoCardano(ADA)$0.1472214.15%
  • chainlinkChainlink(LINK)$7.311.66%
  • USD1USD1(USD1)$1.000.00%
  • daiDai(DAI)$1.000.00%
  • Ethena USDeEthena USDe(USDE)$1.00-0.01%
  • the-open-networkGram (prev. Toncoin)(GRAM)$1.560.40%
  • bitcoin-cashBitcoin Cash(BCH)$197.435.31%
  • litecoinLitecoin(LTC)$41.782.59%
  • hedera-hashgraphHedera(HBAR)$0.071559-1.85%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • Global DollarGlobal Dollar(USDG)$1.000.02%
  • suiSui(SUI)$0.703.67%
  • avalanche-2Avalanche(AVAX)$6.353.22%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.02%
  • crypto-com-chainCronos(CRO)$0.0546220.28%
  • tether-goldTether Gold(XAUT)$4,057.141.11%
  • shiba-inuShiba Inu(SHIB)$0.0000040.61%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • nearNEAR Protocol(NEAR)$1.80-0.28%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.140.25%
  • BittensorBittensor(TAO)$212.801.56%
  • uniswapUniswap(UNI)$2.964.12%
  • pax-goldPAX Gold(PAXG)$4,061.211.10%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.057743-1.50%
  • AsterAster(ASTER)$0.632.10%
  • worldcoin-wldWorldcoin(WLD)$0.467429-3.86%
  • okbOKB(OKB)$75.000.07%
  • Ripple USDRipple USD(RLUSD)$1.000.00%
  • OndoOndo(ONDO)$0.3158172.58%
  • HTX DAOHTX DAO(HTX)$0.000002-0.83%