Close Menu
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
What's Hot

THORChain Trading Resumes After Exploit Halt, But Confidence Test Remains

June 24, 2026

Bitcoin Price Craters To $59,000. The Worst Might Be Coming

June 24, 2026

Binance OTC Services See Accelerated Growth in 2026

June 24, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
CatchTheBullCatchTheBull
Blockchain

Notion Slashes AI Embedding Costs 80% After Ditching Spark for Ray

By WebDeskApril 9, 20263 Mins Read
Notion Slashes AI Embedding Costs 80% After Ditching Spark for Ray
Share
Facebook Twitter LinkedIn Pinterest Email


James Ding
Apr 09, 2026 16:48

Notion migrated from Spark on EMR to Ray, cutting embedding costs 80% and improving query latency 10x. Uber and Salesforce shared similar AI infrastructure wins.





Notion has slashed its AI embedding pipeline costs by more than 80% after migrating from Apache Spark to Ray, the distributed computing framework backed by Anyscale. The productivity software company also achieved 10x improvements in query latency while consolidating three separate jobs per region into one.

The migration details emerged at Ray Day Seattle on April 9, 2026, where ML engineers from Notion, Uber, Salesforce, and Apple shared hard-won lessons about scaling AI infrastructure.

What Notion Actually Changed

Mickey Liu, a software engineer on Notion’s search platform team, walked through the overhaul. Their original setup used a three-step Spark pipeline running on Amazon EMR: data chunking, third-party API calls for embedding generation, and writes to a vector store.

The pain points were predictable but severe. Double compute costs. Third-party API rate limits throttling throughput. Debugging nightmares when failures occurred across tools—driver and executor logs weren’t even persisted in YARN.

The new architecture streams Kafka data directly into a Ray cluster handling CPU chunking, GPU embedding generation, and vector store writes in a single pipeline. No intermediate S3 handoffs. What started as the backend for a Q&A feature in 2023 now powers all of Notion AI and custom agents.

Uber and Salesforce Report Similar Gains

Uber’s Peng Zhang detailed how their Michelangelo ML platform evolved from TensorFlow/Horovod to Ray with PyTorch. The standout move: separating CPU data-loading nodes from GPU training nodes in a heterogeneous cluster design. Result? GPU utilization jumped 20%, and training time dropped roughly 50% in select pipelines.

Salesforce tackled a different beast—summarizing documents up to 200,000 tokens long (roughly a short novel) with P95 latency under 15 seconds. Their team used Ray to chunk documents and run parallel inference across a distributed actor pool with vLLM, then merge results. They landed on 1-2 GPU data parallelism as the sweet spot after running scaling experiments directly on Ray.

Why This Matters Beyond These Companies

Robert Nishihara, Ray’s co-creator and Anyscale co-founder, opened the event by framing the core problem: AI infrastructure keeps getting harder. Multimodal data processing, reinforcement learning workloads, and multi-node LLM inference are pushing existing tools past their limits.

Every speaker landed on the same conclusion from different angles—their previous tooling ran out of road.

Apple engineers Charlie Chen and Haocheng Bian highlighted foundation model training challenges: massive unstructured data, billion-plus parameters, and sparse architectures like Mixture of Experts. Traditional engines fail because data pipelines and training frameworks run in separate environments with no shared context.

What’s Next

Ray Day Seattle kicked off Anyscale’s 2026 “Ray on the Road” tour—eight cities across three countries. The company is also running invite-only customer roundtables at each stop to preview their product roadmap.

For teams hitting similar walls with Spark or other distributed frameworks, Notion’s full technical writeup is available on their engineering blog under “Two Years of Vector Search at Notion.” The 80% cost reduction and 10x latency improvement offer a concrete benchmark for anyone evaluating similar migrations.

Image source: Shutterstock


Credit: Source link

Previous ArticleSEC, Treasury Officials Urge Congress To Pass Crypto Market Bill
Next Article Skygen.AI Unveils Autonomous Computer Operator: Why This Computer Use Release is a Game-Changer for the Web3 Era

Related Posts

Binance OTC Services See Accelerated Growth in 2026

June 24, 2026

Bitcoin Drops 2.3% to $61,053 Amid Macro Pressures

June 24, 2026

AAVE Price Prediction: Dead Cat Bounce or Real Base — $75 Is Make-or-Break Right Now

June 24, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

THORChain Trading Resumes After Exploit Halt, But Confidence Test Remains

June 24, 2026

Bitcoin Price Craters To $59,000. The Worst Might Be Coming

June 24, 2026

Binance OTC Services See Accelerated Growth in 2026

June 24, 2026

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

Advertisement Banner

Welcome to CatchTheBull, your trusted source for the latest Crypto News and Airdrops. We bring you real-time updates, expert insights, and opportunities to stay ahead in the crypto world. Discover trending projects, market analyses, and airdrop details all in one place.

Join us on this journey to navigate the ever-evolving blockchain universe!

Facebook X (Twitter) Instagram YouTube
Top Insights

SecondFi Fixes Cardano Wallet Flaw That Led to 16M ADA Theft

4 Risks Investors Should Watch

SecondFi Exploit Warning Puts Cardano DeFi Security Back Under Pressure

Get Informed

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

© 2026 CatchTheBull. All Rights Are Reserved.
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

Type above and press Enter to search. Press Esc to cancel.

  • bitcoinBitcoin(BTC)$60,776.00-2.76%
  • ethereumEthereum(ETH)$1,612.62-2.91%
  • tetherTether(USDT)$1.00-0.03%
  • binancecoinBNB(BNB)$562.56-2.47%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • rippleXRP(XRP)$1.07-3.36%
  • solanaSolana(SOL)$67.73-2.47%
  • tronTRON(TRX)$0.327165-0.58%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.03-0.64%
  • HyperliquidHyperliquid(HYPE)$63.231.73%
  • dogecoinDogecoin(DOGE)$0.075765-3.91%
  • USDSUSDS(USDS)$1.00-0.03%
  • RainRain(RAIN)$0.0158521.37%
  • leo-tokenLEO Token(LEO)$9.43-0.90%
  • zcashZcash(ZEC)$412.44-1.15%
  • stellarStellar(XLM)$0.185390-4.55%
  • moneroMonero(XMR)$316.84-0.17%
  • CantonCanton(CC)$0.151061-0.09%
  • whitebitWhiteBIT Coin(WBT)$49.39-2.95%
  • chainlinkChainlink(LINK)$7.40-2.68%
  • cardanoCardano(ADA)$0.146997-2.83%
  • LABLAB(LAB)$16.1810.41%
  • USD1USD1(USD1)$1.00-0.03%
  • daiDai(DAI)$1.00-0.01%
  • Ethena USDeEthena USDe(USDE)$1.00-0.05%
  • the-open-networkGram (prev. Toncoin)(GRAM)$1.580.40%
  • bitcoin-cashBitcoin Cash(BCH)$188.60-2.66%
  • MemeCoreMemeCore(M)$2.69-5.21%
  • hedera-hashgraphHedera(HBAR)$0.075485-2.66%
  • litecoinLitecoin(LTC)$40.91-2.47%
  • Circle USYCCircle USYC(USYC)$1.13-0.01%
  • Global DollarGlobal Dollar(USDG)$1.00-0.03%
  • avalanche-2Avalanche(AVAX)$6.41-0.71%
  • suiSui(SUI)$0.68-2.48%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.02%
  • crypto-com-chainCronos(CRO)$0.0566270.15%
  • shiba-inuShiba Inu(SHIB)$0.000004-4.43%
  • nearNEAR Protocol(NEAR)$1.95-1.34%
  • tether-goldTether Gold(XAUT)$3,992.06-2.44%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.14-0.19%
  • BittensorBittensor(TAO)$217.74-1.01%
  • worldcoin-wldWorldcoin(WLD)$0.53-0.69%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.057654-2.60%
  • pax-goldPAX Gold(PAXG)$3,992.86-2.48%
  • uniswapUniswap(UNI)$2.89-0.78%
  • mantleMantle(MNT)$0.50-2.48%
  • AsterAster(ASTER)$0.61-3.70%
  • Ripple USDRipple USD(RLUSD)$1.000.04%
  • okbOKB(OKB)$75.00-2.23%