Close Menu
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
What's Hot

STRC and SATA could open a $3 trillion digital credit market, says Matt Cole

June 24, 2026

Bitcoin Drops 2.3% to $61,053 Amid Macro Pressures

June 24, 2026

How BulkQuant Reflects the Growing Role of AI Trading Robots

June 24, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
CatchTheBullCatchTheBull
  • Home
  • Crypto News
  • Bitcoin
  • Altcoin
  • Blockchain
  • Airdrops News
  • NFT News
CatchTheBullCatchTheBull
Blockchain

LangSmith Enhances LLM Evaluations with Pytest and Vitest Integrations

By WebDeskJanuary 25, 20253 Mins Read
LangSmith Enhances LLM Evaluations with Pytest and Vitest Integrations
Share
Facebook Twitter LinkedIn Pinterest Email


Caroline Bishop
Jan 25, 2025 04:44

LangSmith introduces Pytest and Vitest integrations to enhance LLM application evaluations, offering improved testing frameworks for developers.





LangSmith has unveiled new integrations with Pytest and Vitest, aiming to streamline the evaluation process of Large Language Model (LLM) applications. These integrations, now in beta with version 0.3.0 of the LangSmith Python and TypeScript SDKs, provide developers with enhanced testing capabilities, according to LangChain’s blog.

Enhanced Testing Frameworks for LLM Evaluations

LLM evaluations (evals) are crucial for maintaining the reliability and quality of applications. By integrating with Pytest and Vitest, developers familiar with these frameworks can now leverage LangSmith’s advanced features, such as observability and sharing capabilities, without compromising on the developer experience they are accustomed to.

The integrations allow developers to debug tests more effectively, log detailed metrics beyond simple pass/fail results, and share results effortlessly across teams. The non-deterministic nature of LLMs adds complexity to debugging, which LangSmith addresses by saving inputs, outputs, and stack traces from test cases.

Utilizing Built-in Evaluation Functions

LangSmith provides built-in evaluation functions, such as expect.edit_distance(), which compute the string distance between test outputs and reference outputs. This feature is particularly useful for developers who need to ensure their applications consistently deploy the best version. Detailed insights into these functions can be found in LangSmith’s API reference.

Getting Started with Pytest and Vitest

To integrate with Pytest, developers need to add the @pytest.mark.langsmith decorator to their test cases. This setup logs all test case results, application traces, and feedback traces to LangSmith, providing a comprehensive view of the application’s performance.

Similarly, Vitest users can wrap their test cases in an ls.describe() block to achieve the same level of integration and logging. Both frameworks offer real-time feedback and can be seamlessly integrated into continuous integration (CI) pipelines, helping developers catch regressions early.

Advantages Over Traditional Evaluation Methods

Traditional evaluation methods often require predefined datasets and evaluation functions, which can be limiting. LangSmith’s new integrations offer flexibility by allowing developers to define specific test cases and evaluation logic, tailored to their application’s needs. This approach is particularly beneficial for applications that require testing across multiple tools or models with varying evaluation criteria.

The real-time feedback provided by these testing frameworks facilitates rapid iteration and local development, making it easier for developers to refine their applications quickly. Additionally, the integration with CI pipelines ensures that any potential regressions are identified and addressed early in the development process.

For more information on how to utilize these integrations, developers can refer to LangSmith’s comprehensive tutorials and how-to guides available on their documentation site.

Image source: Shutterstock


Credit: Source link

Previous ArticleNVIDIA Unveils OpenUSD Workflows to Propel Physical AI in Robotics and Autonomous Vehicles
Next Article EU Banks Urged To Embrace Digital Euro Amid Trump’s Stablecoin Push, Says ECB Board Member

Related Posts

Bitcoin Drops 2.3% to $61,053 Amid Macro Pressures

June 24, 2026

AAVE Price Prediction: Dead Cat Bounce or Real Base — $75 Is Make-or-Break Right Now

June 24, 2026

BTC Price Prediction: Every Major Moving Average Is Overhead — Bears Own This Chart

June 24, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

STRC and SATA could open a $3 trillion digital credit market, says Matt Cole

June 24, 2026

Bitcoin Drops 2.3% to $61,053 Amid Macro Pressures

June 24, 2026

How BulkQuant Reflects the Growing Role of AI Trading Robots

June 24, 2026

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

Advertisement Banner

Welcome to CatchTheBull, your trusted source for the latest Crypto News and Airdrops. We bring you real-time updates, expert insights, and opportunities to stay ahead in the crypto world. Discover trending projects, market analyses, and airdrop details all in one place.

Join us on this journey to navigate the ever-evolving blockchain universe!

Facebook X (Twitter) Instagram YouTube
Top Insights

Can CBDC Coexist with Stablecoins?

XRP Risks Falling Below $1 Again: Here’s What To Know

BTC Price Prediction: Every Major Moving Average Is Overhead — Bears Own This Chart

Get Informed

Subscribe to Updates

Get the latest Crypto, Blockchain and Airdrop News from us to Catch The Bull.

© 2026 CatchTheBull. All Rights Are Reserved.
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

Type above and press Enter to search. Press Esc to cancel.

  • bitcoinBitcoin(BTC)$60,324.00-3.72%
  • ethereumEthereum(ETH)$1,618.96-2.85%
  • tetherTether(USDT)$1.00-0.03%
  • binancecoinBNB(BNB)$559.14-2.89%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • rippleXRP(XRP)$1.06-3.90%
  • solanaSolana(SOL)$67.39-2.63%
  • tronTRON(TRX)$0.327070-0.75%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.03-0.12%
  • HyperliquidHyperliquid(HYPE)$60.17-4.08%
  • dogecoinDogecoin(DOGE)$0.074735-5.52%
  • USDSUSDS(USDS)$1.00-0.02%
  • RainRain(RAIN)$0.0158720.74%
  • leo-tokenLEO Token(LEO)$9.42-0.93%
  • zcashZcash(ZEC)$403.11-3.99%
  • stellarStellar(XLM)$0.185528-4.01%
  • LABLAB(LAB)$19.1523.40%
  • moneroMonero(XMR)$320.121.88%
  • CantonCanton(CC)$0.150339-0.57%
  • whitebitWhiteBIT Coin(WBT)$49.11-3.31%
  • chainlinkChainlink(LINK)$7.37-3.04%
  • cardanoCardano(ADA)$0.143684-4.92%
  • USD1USD1(USD1)$1.000.02%
  • daiDai(DAI)$1.000.00%
  • Ethena USDeEthena USDe(USDE)$1.00-0.05%
  • the-open-networkGram (prev. Toncoin)(GRAM)$1.572.25%
  • bitcoin-cashBitcoin Cash(BCH)$187.96-1.16%
  • MemeCoreMemeCore(M)$2.82-1.33%
  • hedera-hashgraphHedera(HBAR)$0.074012-4.72%
  • litecoinLitecoin(LTC)$40.78-2.96%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • Global DollarGlobal Dollar(USDG)$1.00-0.01%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.06%
  • suiSui(SUI)$0.67-4.88%
  • avalanche-2Avalanche(AVAX)$6.23-2.45%
  • shiba-inuShiba Inu(SHIB)$0.000004-3.84%
  • crypto-com-chainCronos(CRO)$0.055444-1.95%
  • nearNEAR Protocol(NEAR)$1.91-3.89%
  • tether-goldTether Gold(XAUT)$3,992.18-3.08%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.13-0.29%
  • BittensorBittensor(TAO)$214.97-0.82%
  • pax-goldPAX Gold(PAXG)$3,996.19-3.13%
  • worldcoin-wldWorldcoin(WLD)$0.52-5.82%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.056591-1.76%
  • uniswapUniswap(UNI)$2.81-3.21%
  • mantleMantle(MNT)$0.50-2.29%
  • AsterAster(ASTER)$0.61-1.06%
  • Ripple USDRipple USD(RLUSD)$1.000.04%
  • okbOKB(OKB)$74.96-2.33%