𝗠𝗶𝗰𝗿𝗼𝘀𝗼𝗳𝘁 𝗙𝗮𝘀𝘁𝗖𝗼𝗻𝘁𝗲𝘅 𝗖𝘂𝘁𝘀 𝗖𝗼𝗱𝗶𝗻𝗴 𝗔𝗴𝗲𝗻𝘁 𝗧𝗼𝗸𝗲𝗻𝘀 𝗯𝘆 𝟲𝟬%

Coding agents waste too much time searching for code.

When an agent searches a repository, it often pulls every file into its own context window. This fills the "desk" with raw data before the agent even starts coding.

Microsoft researchers studied GPT-5.4 traces and found a massive problem:

Most of this data is low signal. The agent only needs a few lines, but it carries the whole file.

Microsoft released FastContext to solve this.

Instead of the main agent doing the searching, it uses a dedicated explorer subagent. Think of this like a librarian. You stay at your desk, and you send a librarian into the stacks to find information.

How it works:

The main agent gets the exact location without the bulky text.

The results are significant:

The explorer model (4B-30B) undergoes two training stages. First, supervised fine-tuning teaches it how to explore. Second, task-grounded reinforcement learning ensures it finds evidence that actually helps the main agent solve the problem.

By offloading the "haystack" to a subagent, the main agent keeps its context window clean for actual reasoning and coding.

Source: https://dev.to/pueding/microsoft-fastcontext-a-repo-explorer-subagent-cuts-coding-agent-tokens-60-explorer-subagent-2lpk

Optional learning community: https://t.me/GyaanSetuAi