Close Menu
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
What's Hot

Ethereum Glamsterdam: What You Need to Know

April 1, 2026

Crypto-Revenge ‘On Demand’ – Why Are Rogue Groups Taking Justice On Their Own Hands?

April 1, 2026

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

April 1, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
CatchTheBullCatchTheBull
Blockchain

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

By WebDeskApril 1, 20263 Mins Read
NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains
Share
Facebook Twitter LinkedIn Pinterest Email


Iris Coleman
Apr 01, 2026 15:38

NVIDIA’s Blackwell Ultra GPUs set new MLPerf Inference records with 2.7x faster DeepSeek-R1 processing, hitting 2.5 million tokens per second across 288 GPUs.





NVIDIA’s Blackwell Ultra GPUs have delivered record-breaking performance in the latest MLPerf Inference v6.0 benchmarks, achieving up to 2.7x faster token throughput compared to submissions just six months ago. The results, published April 1, 2026, push NVIDIA’s cumulative MLPerf wins to 291—nine times more than all other submitters combined since 2018.

The standout figure: four GB300 NVL72 systems running 288 Blackwell Ultra GPUs processed 2.49 million tokens per second on DeepSeek-R1 in offline mode. That’s the largest GPU configuration ever submitted to any MLPerf Inference benchmark.

Software Optimization Drives Massive Gains

What’s particularly striking isn’t just raw hardware muscle—it’s how much performance NVIDIA extracted from the same silicon through software improvements. The GB300 NVL72 delivered 8,064 tokens per second per GPU on DeepSeek-R1’s server scenario, up from 2,907 tokens six months prior. Same chips, 2.77x more output.

The performance jump came from several TensorRT-LLM enhancements: faster fused kernels, optimized attention data parallel processing, and better load balancing across ranks. For the new DeepSeek-R1 Interactive scenario—which demands 5x faster minimum token rates than standard server deployments—NVIDIA deployed disaggregated serving, Wide Expert Parallel sharding, and multi-token prediction to hit 250,634 tokens per second.

Partner Nebius achieved the 2.7x speedup, demonstrating how NVIDIA’s open software stack enables ecosystem optimization. The practical implication? Token production costs dropped by over 60% on existing infrastructure.

First and Only Across New Benchmarks

MLPerf v6.0 introduced several demanding new tests, and NVIDIA was the sole platform to submit results across all of them:

  • Qwen3-VL-235B-A22B: The first multimodal vision-language model in MLPerf, hitting 79 samples/sec offline
  • GPT-OSS-120B: OpenAI’s 120B-parameter MoE reasoning model, achieving 1.05 million tokens/sec offline
  • WAN-2.2-T2V-A14B: Text-to-video generation at 21 seconds latency in single-stream mode
  • DLRMv3: Transformer-based recommendation benchmark at 104,637 samples/sec

The multimodal Qwen3-VL submission used the vLLM open-source framework, while video generation ran on TensorRT-LLM VisualGen—both indicating how quickly the open-source ecosystem is building optimized pipelines for next-generation workloads.

Partner Ecosystem Shows Depth

Fourteen partners submitted results on the NVIDIA platform this round—the largest partner participation for any single platform in MLPerf history. ASUS, Cisco, CoreWeave, Dell, Google Cloud, HPE, Lenovo, and Supermicro all delivered competitive performance numbers, suggesting the Blackwell architecture has matured enough for broad enterprise deployment.

This breadth matters for AI infrastructure buyers evaluating vendor lock-in risk. The results arrived the same week NVIDIA announced a $2 billion strategic investment in Marvell Technology to expand AI infrastructure options, signaling the company’s push to position itself as the foundational layer for AI computing rather than a single-vendor solution.

What Comes Next

NVIDIA is leading development of MLPerf Endpoints, a new benchmark designed to measure real-world API performance under production traffic conditions. Current chip-level benchmarks can’t capture latency spikes, queuing behavior, or throughput degradation under sustained load—metrics that actually determine AI service economics.

For data center operators running inference at scale, the message from these results is clear: software optimization on existing Blackwell hardware may deliver more cost reduction than waiting for next-generation silicon. A 60% reduction in per-token costs changes the economics of deploying reasoning models like DeepSeek-R1 in production.

Image source: Shutterstock


Credit: Source link

Previous ArticleMORPHO Price Jumps 15% on pyUSD Vault Launch, But Resistance Looms
Next Article Crypto-Revenge ‘On Demand’ – Why Are Rogue Groups Taking Justice On Their Own Hands?

Related Posts

AAVE Price Prediction: Targets $110-128 Range by May 2026 Despite Current Bearish Momentum

April 1, 2026

BNB Price Prediction: Testing $636 Resistance Before Potential Rally to $680

April 1, 2026

No Trend, No Divergence: The Prerequisite for Identifying Exhaustion

April 1, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Ethereum Glamsterdam: What You Need to Know

April 1, 2026

Crypto-Revenge ‘On Demand’ – Why Are Rogue Groups Taking Justice On Their Own Hands?

April 1, 2026

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

April 1, 2026

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

Advertisement Banner

Welcome to CatchTheBull, your trusted source for the latest Crypto News and Airdrops. We bring you real-time updates, expert insights, and opportunities to stay ahead in the crypto world. Discover trending projects, market analyses, and airdrop details all in one place.

Join us on this journey to navigate the ever-evolving blockchain universe!

Facebook X (Twitter) Instagram YouTube
Top Insights

US Spot Bitcoin ETFs Draw $1.3B in March, Marking First Monthly Inflow of 2026 – Crypto News Flash

AAVE Price Prediction: Targets $110-128 Range by May 2026 Despite Current Bearish Momentum

Nefarious Werewolf Society Is Minting Today — 10,000 Ethereum NFTs Launch on Q2’s First Day

Get Informed

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

© 2026 CatchTheBull. All Rights Are Reserved.
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

Type above and press Enter to search. Press Esc to cancel.

  • bitcoinBitcoin(BTC)$68,050.000.27%
  • ethereumEthereum(ETH)$2,127.981.38%
  • tetherTether(USDT)$1.000.07%
  • binancecoinBNB(BNB)$612.92-0.53%
  • rippleXRP(XRP)$1.350.53%
  • usd-coinUSDC(USDC)$1.000.02%
  • solanaSolana(SOL)$83.681.06%
  • tronTRON(TRX)$0.3160071.06%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.031.52%
  • dogecoinDogecoin(DOGE)$0.0924770.54%
  • USDSUSDS(USDS)$1.000.04%
  • whitebitWhiteBIT Coin(WBT)$52.09-0.01%
  • leo-tokenLEO Token(LEO)$10.030.37%
  • cardanoCardano(ADA)$0.2484633.16%
  • bitcoin-cashBitcoin Cash(BCH)$456.79-3.25%
  • HyperliquidHyperliquid(HYPE)$36.15-1.26%
  • chainlinkChainlink(LINK)$8.972.02%
  • moneroMonero(XMR)$337.463.37%
  • Ethena USDeEthena USDe(USDE)$1.000.05%
  • stellarStellar(XLM)$0.1705191.56%
  • CantonCanton(CC)$0.144352-2.10%
  • daiDai(DAI)$1.000.01%
  • USD1USD1(USD1)$1.00-0.01%
  • MemeCoreMemeCore(M)$2.446.54%
  • litecoinLitecoin(LTC)$54.100.09%
  • zcashZcash(ZEC)$247.14-2.08%
  • paypal-usdPayPal USD(PYUSD)$1.000.01%
  • avalanche-2Avalanche(AVAX)$9.132.69%
  • hedera-hashgraphHedera(HBAR)$0.0892912.49%
  • RainRain(RAIN)$0.008034-2.57%
  • shiba-inuShiba Inu(SHIB)$0.0000061.23%
  • suiSui(SUI)$0.892.00%
  • BittensorBittensor(TAO)$317.523.03%
  • the-open-networkToncoin(TON)$1.230.36%
  • crypto-com-chainCronos(CRO)$0.0706750.72%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.1000121.20%
  • Circle USYCCircle USYC(USYC)$1.120.02%
  • tether-goldTether Gold(XAUT)$4,720.391.64%
  • pax-goldPAX Gold(PAXG)$4,740.851.75%
  • mantleMantle(MNT)$0.702.03%
  • uniswapUniswap(UNI)$3.601.49%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • polkadotPolkadot(DOT)$1.26-0.14%
  • Global DollarGlobal Dollar(USDG)$1.000.01%
  • okbOKB(OKB)$84.621.28%
  • Pi NetworkPi Network(PI)$0.1769851.04%
  • SkySky(SKY)$0.0761801.52%
  • Falcon USDFalcon USD(USDF)$1.000.06%
  • AsterAster(ASTER)$0.670.65%
  • HTX DAOHTX DAO(HTX)$0.0000021.20%