Augment Code tests the impact on coding agents in the monorepo through AGENTS.md. The results show that high-quality AGENTS.md can significantly improve output, sometimes even worse than having none; the best practice is a main file of 100–150 lines, a small focused reference document, a six-step process, and a decision table, improving by 10%–25%. Excessive or unreferenced documentation can drag down performance, and AGENTS.md is the only essential document source that agents must read.

MeNews

2026-04-23 06:40:33

Abstract generation in progress

ME News Report, April 23 (UTC+8), according to Beating Monitoring, AI programming tool company Augment Code extracted dozens of AGENTS.md files from their monorepo and used their internal evaluation suite, AuggieBench, to measure their actual impact on code generation agents’ output. The method was to use a high-quality merged PR as a baseline, then have the agent redo the same task under two conditions: with and without AGENTS.md, and compare the scores. The difference was much larger than expected. The best AGENTS.md files provided a quality boost equivalent to upgrading the model from Haiku to Opus, while the worst ones were no better than having no AGENTS.md at all. Moreover, the same file could have opposite effects on different tasks: it increased the compliance of a bug fix specification by 25%, but reduced the completion rate of a complex feature in the same module by 30%. Effective writing practices include: keeping the main file between 100 and 150 lines, providing several focused reference documents, which can bring a 10% to 15% overall improvement in medium-sized modules with about a hundred core files. Structuring the process into numbered steps yields the best results; a six-step deployment process reduced PRs missing files from 40% to 10%, increasing accuracy by 25%. Using decision tables to help agents choose the correct plan before acting also improved compliance by 25%. When writing prohibitions, they must be paired with alternative solutions; simply writing “do not” causes agents to hesitate, and more than 15 consecutive warnings significantly worsen performance. The most common pitfall is having too much documentation. Once agents are pulled into a large number of architecture documents, loading hundreds of thousands of tokens, their output actually deteriorates. One module accumulated 226 documents exceeding 2MB, and even the best AGENTS.md would be useless. Additionally, AGENTS.md is the only document location that agents will read 100% of the time; unreferenced documents under _docs/ are discovered less than 10% of the time. (Source: BlockBeats)

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
Gate13thAnniversaryLive
1.08M Popularity
#
WCTCTradingChallengeShare8MUSDT
797.12K Popularity
#
BitcoinBouncesBack
211.85K Popularity
#
EthereumMemeSeasonReturns
2M Popularity
#
USIranTalksProgress
751.18K Popularity

Sitemap

Augment Code practical testing of AGENTS.md's impact on code generation: the best is equivalent to a model upgrade by one level, the worst is not writing it at all

Trending Topics

Gate13thAnniversaryLive

WCTCTradingChallengeShare8MUSDT

BitcoinBouncesBack

EthereumMemeSeasonReturns

USIranTalksProgress

Pin