• bitcoinBitcoin(BTC)$77,121.000.63%
  • ethereumEthereum(ETH)$2,103.70-0.52%
  • tetherTether(USDT)$1.000.01%
  • binancecoinBNB(BNB)$658.420.38%
  • rippleXRP(XRP)$1.35-0.48%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$85.35-0.35%
  • tronTRON(TRX)$0.3654280.97%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.00%
  • dogecoinDogecoin(DOGE)$0.102587-0.36%
  • HyperliquidHyperliquid(HYPE)$62.085.18%
  • USDSUSDS(USDS)$1.00-0.02%
  • zcashZcash(ZEC)$658.795.22%
  • leo-tokenLEO Token(LEO)$10.05-0.20%
  • cardanoCardano(ADA)$0.242543-1.35%
  • moneroMonero(XMR)$395.242.09%
  • bitcoin-cashBitcoin Cash(BCH)$347.15-2.32%
  • chainlinkChainlink(LINK)$9.46-0.86%
  • whitebitWhiteBIT Coin(WBT)$56.790.49%
  • CantonCanton(CC)$0.1660622.34%
  • stellarStellar(XLM)$0.1477600.08%
  • USD1USD1(USD1)$1.00-0.03%
  • the-open-networkToncoin(TON)$1.77-2.48%
  • Ethena USDeEthena USDe(USDE)$1.00-0.04%
  • daiDai(DAI)$1.00-0.01%
  • suiSui(SUI)$1.04-2.77%
  • litecoinLitecoin(LTC)$52.84-1.00%
  • avalanche-2Avalanche(AVAX)$9.22-1.74%
  • hedera-hashgraphHedera(HBAR)$0.088458-1.39%
  • MemeCoreMemeCore(M)$2.870.03%
  • RainRain(RAIN)$0.0075930.22%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.03%
  • shiba-inuShiba Inu(SHIB)$0.000006-1.23%
  • nearNEAR Protocol(NEAR)$2.39-0.15%
  • crypto-com-chainCronos(CRO)$0.068976-0.66%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
  • tether-goldTether Gold(XAUT)$4,551.140.99%
  • BittensorBittensor(TAO)$274.10-1.64%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • uniswapUniswap(UNI)$3.40-1.52%
  • mantleMantle(MNT)$0.65-2.10%
  • pax-goldPAX Gold(PAXG)$4,560.500.78%
  • OndoOndo(ONDO)$0.4414795.02%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.130.05%
  • polkadotPolkadot(DOT)$1.25-2.98%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.060947-2.13%
  • HTX DAOHTX DAO(HTX)$0.0000021.00%
  • AsterAster(ASTER)$0.703.41%
  • Falcon USDFalcon USD(USDF)$1.00-0.08%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments

May 24, 2026
in AI & Technology
Reading Time: 1 min read
A A
Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments
ShareShareShareShareShare

YOU MAY ALSO LIKE

Need to Fill US Manufacturing Gap Long-Term, Says Amca CEO

SoftBank Founder’s Starstruck Bet on OpenAI Raises Concern

print("\nPART 5 ── Datasets & experiments --------------------------------------")
DATASET = "capital-cities-tutorial"
langfuse.create_dataset(name=DATASET, description="Capital-city QA benchmark")
_items = [
   ("What is the capital of France?",  "Paris"),
   ("What is the capital of Germany?", "Berlin"),
   ("What is the capital of Japan?",   "Tokyo"),
   ("What is the capital of Italy?",   "Rome"),
]
for i, (q, a) in enumerate(_items):
   langfuse.create_dataset_item(dataset_name=DATASET, id=f"cap-{i}",
                                input={"question": q}, expected_output=a)
def capital_task(*, item, **kwargs):
   question = item.input["question"] if isinstance(item.input, dict) else item.input
   return llm_chat([{"role": "user", "content": question}], name="experiment-answer")
def accuracy(*, input, output, expected_output, metadata=None, **kwargs):
   hit = bool(expected_output) and expected_output.lower() in (output or "").lower()
   return Evaluation(name="accuracy", value=1.0 if hit else 0.0,
                     comment="exact-match contains check")
def conciseness(*, input, output, **kwargs):
   return Evaluation(name="char_length", value=float(len(output or "")))
def mean_accuracy(*, item_results, **kwargs):
   vals = [e.value for r in item_results for e in r.evaluations if e.name == "accuracy"]
   avg = sum(vals) / len(vals) if vals else 0.0
   return Evaluation(name="mean_accuracy", value=avg, comment=f"{avg:.0%} correct")
dataset = langfuse.get_dataset(DATASET)
result = dataset.run_experiment(
   name="capitals-baseline",
   description="Baseline run from the Colab tutorial",
   task=capital_task,
   evaluators=[accuracy, conciseness],
   run_evaluators=[mean_accuracy],
   max_concurrency=4,
)
print(result.format())

Credit: Source link

ShareTweetSendSharePin

Related Posts

Need to Fill US Manufacturing Gap Long-Term, Says Amca CEO
AI & Technology

Need to Fill US Manufacturing Gap Long-Term, Says Amca CEO

May 25, 2026
SoftBank Founder’s Starstruck Bet on OpenAI Raises Concern
AI & Technology

SoftBank Founder’s Starstruck Bet on OpenAI Raises Concern

May 25, 2026
Major Chatbots Miss the Mark on News: Forum AI Study
AI & Technology

Major Chatbots Miss the Mark on News: Forum AI Study

May 24, 2026
Nvidia Earnings in Focus; SpaceX Eyes Historic IPO | Bloomberg Tech 5/20/2026
AI & Technology

Nvidia Earnings in Focus; SpaceX Eyes Historic IPO | Bloomberg Tech 5/20/2026

May 24, 2026
Next Post
Braemar Plc 2026 Q4 – Results – Earnings Call Presentation (OTCMKTS:BSEAF) 2026-05-24

Braemar Plc 2026 Q4 - Results - Earnings Call Presentation (OTCMKTS:BSEAF) 2026-05-24

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
James Murdoch, Intent on ‘Thoughtful Journalism,’ Buys Half of Vox Media – The New York Times

James Murdoch, Intent on ‘Thoughtful Journalism,’ Buys Half of Vox Media – The New York Times

May 20, 2026
LIRR riders experiencing hours-long commutes to NYC as train strike extends into third day – CBS News

LIRR riders experiencing hours-long commutes to NYC as train strike extends into third day – CBS News

May 18, 2026
Outdoor retailer to drop huge deals in California amid liquidation

Outdoor retailer to drop huge deals in California amid liquidation

May 22, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!