• kpk ETH Primekpk ETH Prime(KPK ETH PRIME)$2,034.900.01%
  • bitcoinBitcoin(BTC)$69,669.00-1.99%
  • ethereumEthereum(ETH)$2,023.34-2.22%
  • kpk ETH Yieldkpk ETH Yield(KPK ETH YIELD)$2,030.62-0.04%
  • tetherTether(USDT)$1.00-0.01%
  • binancecoinBNB(BNB)$641.53-0.98%
  • rippleXRP(XRP)$1.38-2.71%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$85.37-2.15%
  • tronTRON(TRX)$0.2878470.80%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.04-0.53%
  • dogecoinDogecoin(DOGE)$0.091924-3.33%
  • whitebitWhiteBIT Coin(WBT)$55.12-2.10%
  • USDSUSDS(USDS)$1.00-0.01%
  • cardanoCardano(ADA)$0.257977-3.33%
  • bitcoin-cashBitcoin Cash(BCH)$448.95-0.35%
  • leo-tokenLEO Token(LEO)$9.170.66%
  • HyperliquidHyperliquid(HYPE)$34.46-1.03%
  • moneroMonero(XMR)$349.530.03%
  • chainlinkChainlink(LINK)$8.92-2.26%
  • Ethena USDeEthena USDe(USDE)$1.000.03%
  • CantonCanton(CC)$0.147873-0.55%
  • stellarStellar(XLM)$0.156201-3.09%
  • USD1USD1(USD1)$1.000.00%
  • RainRain(RAIN)$0.008973-0.21%
  • daiDai(DAI)$1.000.02%
  • litecoinLitecoin(LTC)$53.85-1.05%
  • avalanche-2Avalanche(AVAX)$9.54-0.12%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.02%
  • hedera-hashgraphHedera(HBAR)$0.093610-3.31%
  • suiSui(SUI)$0.95-2.20%
  • zcashZcash(ZEC)$213.46-4.23%
  • shiba-inuShiba Inu(SHIB)$0.000006-1.12%
  • the-open-networkToncoin(TON)$1.30-2.78%
  • crypto-com-chainCronos(CRO)$0.075363-1.89%
  • tether-goldTether Gold(XAUT)$5,151.490.11%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.100939-1.28%
  • pax-goldPAX Gold(PAXG)$5,192.510.17%
  • polkadotPolkadot(DOT)$1.49-2.57%
  • MemeCoreMemeCore(M)$1.42-2.37%
  • uniswapUniswap(UNI)$3.86-2.85%
  • mantleMantle(MNT)$0.690.37%
  • Pi NetworkPi Network(PI)$0.2299535.34%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • okbOKB(OKB)$95.71-2.40%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • BittensorBittensor(TAO)$196.17-2.05%
  • Falcon USDFalcon USD(USDF)$1.00-0.03%
  • SkySky(SKY)$0.074877-4.22%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

New ETH Zurich Study Proves Your AI Coding Agents are Failing Because Your AGENTS.md Files are too Detailed

February 26, 2026
in AI & Technology
Reading Time: 6 mins read
A A
New ETH Zurich Study Proves Your AI Coding Agents are Failing Because Your AGENTS.md Files are too Detailed
ShareShareShareShareShare

In the high-stakes world of AI, ‘Context Engineering’ has emerged as the latest frontier for squeezing performance out of LLMs. Industry leaders have touted AGENTS.md (and its cousins like CLAUDE.md) as the ultimate configuration point for coding agents—a repository-level ‘North Star’ injected into every conversation to guide the AI through complex codebases.

But a recent study from researchers at ETH Zurich just dropped a massive reality check. The findings are quite clear: if you aren’t deliberate with your context files, you are likely sabotaging your agent’s performance while paying a 20% premium for the privilege.

YOU MAY ALSO LIKE

Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space

Google starts rolling out Gemini in Chrome to users in Canada, India and New Zealand

https://arxiv.org/pdf/2602.11988

The Data: More Tokens, Less Success

The ETH Zurich research team analyzed coding agents like Sonnet-4.5, GPT-5.2, and Qwen3-30B across established benchmarks and a novel set of real-world tasks called AGENTBENCH. The results were surprisingly lopsided:

  • The Auto-Generated Tax: Automatically generated context files actually reduced success rates by roughly 3%.
  • The Cost of ‘Help‘: These files increased inference costs by over 20% and necessitated more reasoning steps to solve the same tasks.
  • The Human Margin: Even human-written files only provided a marginal 4% performance gain.
  • The Intelligence Cap: Interestingly, using stronger models (like GPT-5.2) to generate these files did not yield better results. Stronger models often have enough ‘parametric knowledge’ of common libraries that the extra context becomes redundant noise.

Why ‘Good’ Context Fails

The research team highlights a behavioral trap: AI agents are too obedient. Coding agents tend to respect the instructions found in context files, but when those requirements are unnecessary, they make the task harder.

For instance, the researchers found that codebase overviews and directory listings—a staple of most AGENTS.md files—did not help agents navigate faster. Agents are surprisingly good at discovering file structures on their own; reading a manual listing just consumes reasoning tokens and adds ‘mental’ overhead. Furthermore, LLM-generated files are often redundant if you already have decent documentation elsewhere in the repo.

https://arxiv.org/pdf/2602.11988

The New Rules of Context Engineering

To make context files actually helpful, you need to shift from ‘comprehensive documentation’ to ‘surgical intervention.’

1. What to Include (The ‘Vital Few’)

  • The Technical Stack & Intent: Explain the ‘What’ and the ‘Why.’ Help the agent understand the purpose of the project and its architecture (e.g., a monorepo structure).
  • Non-Obvious Tooling: This is where AGENTS.md shines. Specify how to build, test, and verify changes using specific tools like uv instead of pip or bun instead of npm.
  • The Multiplier Effect: The data shows that instructions are followed; tools mentioned in a context file are used significantly more often. For example, the tool uv was used 160x more frequently (1.6 times per instance vs. 0.01) when explicitly mentioned.+1

2. What to Exclude (The ‘Noise’)

  • Detailed Directory Trees: Skip them. Agents can find the files they need without a map.
  • Style Guides: Don’t waste tokens telling an agent to “use camelCase.” Use deterministic linters and formatters instead—they are cheaper, faster, and more reliable.
  • Task-Specific Instructions: Avoid rules that only apply to a fraction of your issues.
  • Unvetted Auto-Content: Don’t let an agent write its own context file without a human review. The study proves that ‘stronger’ models don’t necessarily make better guides.

3. How to Structure It

  • Keep it Lean: The general consensus for high-performance context files is under 300 lines. Professional teams often keep theirs even tighter—under 60 lines. Every line counts because every line is injected into every session.
  • Progressive Disclosure: Don’t put everything in the root file. Use the main file to point the agent to separate, task-specific documentation (e.g., agent_docs/testing.md) only when relevant.
  • Pointers Over Copies: Instead of embedding code snippets that will eventually go stale, use pointers (e.g., file:line) to show the agent where to find design patterns or specific interfaces.

Key Takeaways

  • Negative Impact of Auto-Generation: LLM-generated context files tend to reduce task success rates by approximately 3% on average compared to providing no repository context at all.
  • Significant Cost Increases: Including context files increases inference costs by over 20% and leads to a higher number of steps required for agents to complete tasks.
  • Minimal Human Benefit: While human-written (developer-provided) context files perform better than auto-generated ones, they only offer a marginal improvement of about 4% over using no context files.
  • Redundancy and Navigation: Detailed codebase overviews in context files are largely redundant with existing documentation and do not help agents find relevant files any faster.
  • Strict Instruction Following: Agents generally respect the instructions in these files, but unnecessary or overly restrictive requirements often make solving real-world tasks harder for the model.

Check out the Paper. Also, feel free to follow us on Twitter and don’t forget to join our 120k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

The post New ETH Zurich Study Proves Your AI Coding Agents are Failing Because Your AGENTS.md Files are too Detailed appeared first on MarkTechPost.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space
AI & Technology

Google AI Introduces Gemini Embedding 2: A Multimodal Embedding Model that Lets Your Bring Text, Images, Video, Audio, and Docs into the Embedding Space

March 11, 2026
Google starts rolling out Gemini in Chrome to users in Canada, India and New Zealand
AI & Technology

Google starts rolling out Gemini in Chrome to users in Canada, India and New Zealand

March 11, 2026
Social Security watchdog investigating claims that DOGE engineer copied its databases
AI & Technology

Social Security watchdog investigating claims that DOGE engineer copied its databases

March 10, 2026
NVIDIA AI Releases Nemotron-Terminal: A Systematic Data Engineering Pipeline for Scaling LLM Terminal Agents
AI & Technology

NVIDIA AI Releases Nemotron-Terminal: A Systematic Data Engineering Pipeline for Scaling LLM Terminal Agents

March 10, 2026
Next Post
Challenges in the Nancy Guthrie investigation

Challenges in the Nancy Guthrie investigation

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
After U.S.-Israeli offensive, Iran unleashes attacks on multiple Arab countries

After U.S.-Israeli offensive, Iran unleashes attacks on multiple Arab countries

March 8, 2026
How Congress is reacting to the strikes in Iran

How Congress is reacting to the strikes in Iran

March 9, 2026
Live updates: Aaron Judge, Roman Anthony homers surge Team USA baseball into lead vs. Mexico in WBC 2026 – Times Union

Live updates: Aaron Judge, Roman Anthony homers surge Team USA baseball into lead vs. Mexico in WBC 2026 – Times Union

March 10, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!