• bitcoinBitcoin(BTC)$73,448.000.26%
  • ethereumEthereum(ETH)$2,014.730.67%
  • tetherTether(USDT)$1.000.02%
  • binancecoinBNB(BNB)$658.543.69%
  • rippleXRP(XRP)$1.353.06%
  • usd-coinUSDC(USDC)$1.000.01%
  • solanaSolana(SOL)$82.641.35%
  • tronTRON(TRX)$0.342852-2.02%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.22%
  • dogecoinDogecoin(DOGE)$0.1011152.18%
  • HyperliquidHyperliquid(HYPE)$65.116.02%
  • USDSUSDS(USDS)$1.000.00%
  • stellarStellar(XLM)$0.29255940.87%
  • leo-tokenLEO Token(LEO)$10.000.57%
  • RainRain(RAIN)$0.0144211.68%
  • cardanoCardano(ADA)$0.2364301.42%
  • zcashZcash(ZEC)$524.71-1.49%
  • moneroMonero(XMR)$405.8014.17%
  • chainlinkChainlink(LINK)$9.243.46%
  • whitebitWhiteBIT Coin(WBT)$53.960.32%
  • CantonCanton(CC)$0.1569921.87%
  • bitcoin-cashBitcoin Cash(BCH)$302.760.06%
  • the-open-networkToncoin(TON)$1.75-0.30%
  • USD1USD1(USD1)$1.000.04%
  • Ethena USDeEthena USDe(USDE)$1.000.01%
  • hedera-hashgraphHedera(HBAR)$0.1005699.95%
  • daiDai(DAI)$1.000.00%
  • litecoinLitecoin(LTC)$52.401.57%
  • avalanche-2Avalanche(AVAX)$8.971.13%
  • MemeCoreMemeCore(M)$2.91-2.67%
  • suiSui(SUI)$0.91-1.32%
  • shiba-inuShiba Inu(SHIB)$0.0000063.05%
  • crypto-com-chainCronos(CRO)$0.0685352.23%
  • nearNEAR Protocol(NEAR)$2.36-4.05%
  • paypal-usdPayPal USD(PYUSD)$1.000.00%
  • Circle USYCCircle USYC(USYC)$1.130.02%
  • tether-goldTether Gold(XAUT)$4,512.730.67%
  • Global DollarGlobal Dollar(USDG)$1.000.02%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • BittensorBittensor(TAO)$250.02-2.54%
  • mantleMantle(MNT)$0.653.55%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.140.79%
  • pax-goldPAX Gold(PAXG)$4,527.010.82%
  • polkadotPolkadot(DOT)$1.210.54%
  • uniswapUniswap(UNI)$3.061.18%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.0587670.26%
  • okbOKB(OKB)$87.02-0.42%
  • AsterAster(ASTER)$0.681.51%
  • OndoOndo(ONDO)$0.349198-4.16%
  • Ripple USDRipple USD(RLUSD)$1.000.00%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

How to Use AgentTrove: Streaming 1.7M Agentic Traces and Building a Clean ShareGPT SFT Dataset in Python

May 30, 2026
in AI & Technology
Reading Time: 2 mins read
A A
How to Use AgentTrove: Streaming 1.7M Agentic Traces and Building a Clean ShareGPT SFT Dataset in Python
ShareShareShareShareShare

YOU MAY ALSO LIKE

Blue Origin’s New Glenn Rocket Explodes on Launchpad

The $5 Trillion Race for the Humanoid Future

def is_success(row):
   res = (row.get("result") or "").lower()
   if res in ("resolved", "success", "pass", "passed", "correct"):
       return True
   rw = row.get("reward")
   try:
       return float(rw) >= 1.0
   except (TypeError, ValueError):
       return False
out_path = "agenttrove_clean_sft.jsonl"
kept, scanned, SCAN, KEEP = 0, 0, 1500, 200
print(f"\n⏳ Scanning up to {SCAN} rows, keeping up to {KEEP} successful traces…")
with open(out_path, "w") as f:
   for row in itertools.islice(load_dataset(REPO, split="train", streaming=True), SCAN):
       scanned += 1
       if not is_success(row):
           continue
       turns = normalize_turns(row[TRACE_KEY])
       conv = [{"from": r, "value": c} for r, c in turns if c.strip()]
       if len(conv) < 2:
           continue
       f.write(json.dumps({
           "conversations": conv,
           "source": row.get("original_source"),
           "teacher": row.get("original_teacher"),
       }) + "\n")
       kept += 1
       if kept >= KEEP:
           break
print(f"✅ Scanned {scanned} rows → wrote {kept} clean traces to '{out_path}'")
def search_traces(keyword=None, source=None, limit=3, scan=3000):
   """Stream the dataset and yield-print traces matching filters."""
   hits = 0
   for row in itertools.islice(load_dataset(REPO, split="train", streaming=True), scan):
       if source and row.get("original_source") != source:
           continue
       if keyword:
           blob = " ".join(c for _, c in normalize_turns(row[TRACE_KEY]))
           if keyword.lower() not in blob.lower():
               continue
       render_trace(row, max_chars=300)
       hits += 1
       if hits >= limit:
           break
   if hits == 0:
       print("No matches in the scanned window — try increasing `scan`.")
print("\n🔍 Searching for 'nl2bash' source traces:")
search_traces(source="nl2bash", limit=2, scan=4000)
print("\n🎉 Tutorial complete! Next ideas:")
print("   • Increase N / SCAN for bigger analyses.")
print("   • Filter by original_source (swesmith, codeforces, r2egym…) for a domain SFT set.")
print("   • Feed agenttrove_clean_sft.jsonl into Axolotl / LLaMA-Factory for fine-tuning.")

Credit: Source link

ShareTweetSendSharePin

Related Posts

Blue Origin’s New Glenn Rocket Explodes on Launchpad
AI & Technology

Blue Origin’s New Glenn Rocket Explodes on Launchpad

May 30, 2026
The  Trillion Race for the Humanoid Future
AI & Technology

The $5 Trillion Race for the Humanoid Future

May 30, 2026
SpaceX Said to Lower Target for IPO Valuation to .8 Trillion
AI & Technology

SpaceX Said to Lower Target for IPO Valuation to $1.8 Trillion

May 30, 2026
Anthropic Valuation of 5 Billion Passes OpenAI
AI & Technology

Anthropic Valuation of $965 Billion Passes OpenAI

May 30, 2026
Next Post
Over half of Great American State Fair performers drop out over politics, threats – The Washington Post

Over half of Great American State Fair performers drop out over politics, threats - The Washington Post

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Citizens Financial Group, Inc. (CFG) Presents at Bernstein 42nd Annual Strategic Decisions Conference Transcript

Citizens Financial Group, Inc. (CFG) Presents at Bernstein 42nd Annual Strategic Decisions Conference Transcript

May 29, 2026
SpaceX Files for IPO on Nasdaq Under SPCX Symbol

SpaceX Files for IPO on Nasdaq Under SPCX Symbol

May 24, 2026
A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

May 28, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!