• kpk ETH Primekpk ETH Prime(KPK ETH PRIME)$2,034.900.01%
  • bitcoinBitcoin(BTC)$69,570.00-1.70%
  • ethereumEthereum(ETH)$2,027.22-1.72%
  • kpk ETH Yieldkpk ETH Yield(KPK ETH YIELD)$2,030.62-0.04%
  • tetherTether(USDT)$1.000.00%
  • binancecoinBNB(BNB)$639.37-1.24%
  • rippleXRP(XRP)$1.38-1.76%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$85.27-2.04%
  • tronTRON(TRX)$0.2882221.02%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.04-0.53%
  • dogecoinDogecoin(DOGE)$0.092193-3.39%
  • whitebitWhiteBIT Coin(WBT)$55.16-1.74%
  • USDSUSDS(USDS)$1.000.00%
  • cardanoCardano(ADA)$0.259285-3.01%
  • bitcoin-cashBitcoin Cash(BCH)$448.950.11%
  • leo-tokenLEO Token(LEO)$9.180.58%
  • HyperliquidHyperliquid(HYPE)$34.58-0.28%
  • moneroMonero(XMR)$350.380.00%
  • chainlinkChainlink(LINK)$8.94-1.73%
  • Ethena USDeEthena USDe(USDE)$1.000.04%
  • CantonCanton(CC)$0.147760-0.07%
  • stellarStellar(XLM)$0.156726-2.77%
  • USD1USD1(USD1)$1.000.01%
  • RainRain(RAIN)$0.0090982.49%
  • daiDai(DAI)$1.000.01%
  • litecoinLitecoin(LTC)$53.92-0.44%
  • avalanche-2Avalanche(AVAX)$9.580.55%
  • paypal-usdPayPal USD(PYUSD)$1.000.00%
  • hedera-hashgraphHedera(HBAR)$0.093794-3.17%
  • suiSui(SUI)$0.95-2.32%
  • zcashZcash(ZEC)$212.41-5.57%
  • shiba-inuShiba Inu(SHIB)$0.000006-1.27%
  • the-open-networkToncoin(TON)$1.29-3.16%
  • crypto-com-chainCronos(CRO)$0.075437-1.68%
  • tether-goldTether Gold(XAUT)$5,147.010.16%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.100642-1.20%
  • pax-goldPAX Gold(PAXG)$5,188.130.13%
  • polkadotPolkadot(DOT)$1.50-1.43%
  • MemeCoreMemeCore(M)$1.43-2.05%
  • uniswapUniswap(UNI)$3.87-1.84%
  • mantleMantle(MNT)$0.690.01%
  • Pi NetworkPi Network(PI)$0.2305365.48%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • okbOKB(OKB)$95.41-2.36%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • BittensorBittensor(TAO)$197.24-1.25%
  • Falcon USDFalcon USD(USDF)$1.00-0.06%
  • SkySky(SKY)$0.074834-4.38%
  • AsterAster(ASTER)$0.69-1.75%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Google AI Proposes ReasoningBank: A Strategy-Level I Agent Memory Framework that Makes LLM Agents Self-Evolve at Test Time

October 1, 2025
in AI & Technology
Reading Time: 6 mins read
A A
Google AI Proposes ReasoningBank: A Strategy-Level I Agent Memory Framework that Makes LLM Agents Self-Evolve at Test Time
ShareShareShareShareShare

How do you make an LLM agent actually learn from its own runs—successes and failures—without retraining? Google Research proposes ReasoningBank, an AI agent memory framework that converts an agent’s own interaction traces—both successes and failures—into reusable, high-level reasoning strategies. These strategies are retrieved to guide future decisions, and the loop repeats so the agent self-evolves. Coupled with memory-aware test-time scaling (MaTTS), the approach delivers up to +34.2% relative effectiveness gains and –16% fewer interaction steps across web and software-engineering benchmarks compared to prior memory designs that store raw trajectories or success-only workflows.

https://arxiv.org/pdf/2509.25140

So, what is the problem?

LLM agents tackle multi-step tasks (web browsing, computer use, repo-level bug fixing) but generally fail to accumulate and reuse experience. Conventional “memory” tends to hoard raw logs or rigid workflows. Those are brittle across environments and often ignore useful signals from failures—where a lot of actionable knowledge lives. ReasoningBank reframes memory as compact, human-readable strategy items that are easier to transfer between tasks and domains.

YOU MAY ALSO LIKE

Meta rolls out new features for scam protection

Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space

Then how does it tackle?

Each experience is distilled into a memory item with a title, one-line description, and content containing actionable principles (heuristics, checks, constraints). Retrieval is embedding-based: for a new task, top-k relevant items are injected as system guidance; after execution, new items are extracted and consolidated back. The loop is intentionally simple—retrieve → inject → judge → distill → append—so improvements can be attributed to the abstraction of strategies, not heavy memory management.

Why it transfers: items encode reasoning patterns (“prefer account pages for user-specific data; verify pagination mode; avoid infinite scroll traps; cross-check state with task spec”), not website-specific DOM steps. Failures become negative constraints (“do not rely on search when the site disables indexing; confirm save state before navigation”), which prevents repeated mistakes.

https://arxiv.org/pdf/2509.25140

Memory-aware test-time scaling (MaTTS) proposed as well!

Test-time scaling (running more rollouts or refinements per task) is effective only if the system can learn from the extra trajectories. The research team also propsoed Memory-aware test-time scaling (MaTTS) that integrates scaling with ReasoningBank:

  • Parallel MaTTS: generate (k) rollouts in parallel, then self-contrast them to refine strategy memory.
  • Sequential MaTTS: iteratively self-refine a single trajectory, mining intermediate notes as memory signals.

The synergy is two-way: richer exploration produces better memory; better memory steers exploration toward promising branches. Empirically, MaTTS yields stronger, more monotonic gains than vanilla best-of-N without memory.

So, how good are these proposed research frameworks?

  • Effectiveness: ReasoningBank + MaTTS improves task success up to 34.2% (relative) over no-memory and outperforms prior memory designs that reuse raw traces or success-only routines.
  • Efficiency: Interaction steps drop by 16% overall; further analysis shows the largest reductions on successful trials, indicating fewer redundant actions rather than premature aborts.
https://arxiv.org/pdf/2509.25140

Where does this fits in the agent stack?

ReasoningBank is a plug-in memory layer for interactive agents that already use ReAct-style decision loops or best-of-N test-time scaling. It doesn’t replace verifiers/planners; it amplifies them by injecting distilled lessons at the prompt/system level. On web tasks, it complements BrowserGym/WebArena/Mind2Web; on software tasks, it layers atop SWE-Bench-Verified setups.


Check out the Paper here. Feel free to check out our GitHub Page for Tutorials, Codes and Notebooks. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

The post Google AI Proposes ReasoningBank: A Strategy-Level I Agent Memory Framework that Makes LLM Agents Self-Evolve at Test Time appeared first on MarkTechPost.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Meta rolls out new features for scam protection
AI & Technology

Meta rolls out new features for scam protection

March 11, 2026
Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space
AI & Technology

Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space

March 11, 2026
Google starts rolling out Gemini in Chrome to users in Canada, India and New Zealand
AI & Technology

Google starts rolling out Gemini in Chrome to users in Canada, India and New Zealand

March 11, 2026
Social Security watchdog investigating claims that DOGE engineer copied its databases
AI & Technology

Social Security watchdog investigating claims that DOGE engineer copied its databases

March 10, 2026
Next Post
Powerful earthquake in central Philippines kills at least 69 – The Washington Post

Powerful earthquake in central Philippines kills at least 69 - The Washington Post

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
U.S. soldiers killed in Operation Epic Fury named

U.S. soldiers killed in Operation Epic Fury named

March 6, 2026
Cuban troops open fire on U.S.-registered speedboat

Cuban troops open fire on U.S.-registered speedboat

March 11, 2026
Trump warns Iran ‘better not’ retaliate further following U.S. and Israeli strikes

Trump warns Iran ‘better not’ retaliate further following U.S. and Israeli strikes

March 8, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!