mahendradani.vercel.app | Mahendra Dani

Link Blog

Effective context engineering for AI agents

What is an AI Agent and how is it different from a LLM?

In simple terms, an AI agent uses a LLM to reason, iterates using tool calls to validate or use knowledge in the form of files/docs and system prompt to take an autonomous decision.

LLM : Takes an input (prompt) and produces an output. It’s reactive and does not act autonomously beyond generating text.

AI Agent: Uses an LLM as a reasoning engine, but goes further. It can:

Take autonomous decisions based on goals.
Iterate over multiple steps, using tools, files, or APIs to gather or verify information.
Update its “context” (working memory) with new information after each step.

Why context engineering over prompt engineering?

We know :

AI agents take autonomous decision, use tool calls in loops for knowledge and information.
The decision is based on context, which might update/change in every iteration.
Prompt engineering fails because it's not iterative.

So how should we provide relevant and useful info to agent in minimal set of tokens as its "working memory" for making next decision in every iteration?

This figure from the article shows that a system prompt remains static during course of iterations of the agent before making a decision, whereas with context-window, we keep the essential information/tool-calls, files and APIs that can effectively answer the following questions for the agent in any iteration:

What do I need more to make a decision?
Which resources that I have in my context window will help me reach the goal?
Given that I have used a resource from window, now do I need to update the window with newer results and evict any previously stored context?

What is context?

A window which holds relevant docs/files, instructions, results of tools, message history that in minimum set of tokens are enough for the agent to:

Use some knowledge from this window.
Determine what to do next (take a decision)
This is such that the agent produces desired results.

How is context different from system prompt?

A system prompt is static, it doesn't change in every iteration of the agent.
It doesn't utilize say the results of 3rd iteration in 7th iteration, which can be done using context.

How to do context engineering effectively?

My advice: Think like teaching a kid how to take better decisions using the resources he have, without being explicitly forced to go in certain direction or being so ridiculed that he can't determine what to do next even with so many resources.

So ask yourselves:

How to use the resources?
When to use which resources?
Which knowledge to preserve and which to let go?
What should be ideal behaviour? (don't go disruptive)
structure your instructions
tell the kid its expected behaviour; rather than hard-coded instructions (because kids can THINK and so do agents!)

The article covers this in detail with examples on how Anthropic uses context engineering in Claude Code to perform better. My goal here is to simplify it without any technical jargon, just common sense.

How to solve problems that require too much context?

Compaction : Summarise the current context window, re-initalize next window with this summary (ideally saving only summary of historic events).
Structured note-taking: Write notes to a persisted memory outside of context window, then use this if context window is insufficient at some point.
Multi-agent architecture: Run many agents simulataneously with their own context windows, offload tasks and using a leader keep the processes running with combining outputs.

What's Next?

Find where context engineering is being applied and how? Probably any open-source project?
If suppose I use effective context engineering methods from the article, how can I assess the performance of the agent, and iterative reach to have the effective context?

30 October 2025, 04:05 pm