Where does caching live in an agent? At every layer — the browser, the API gateway, a semantic cache before the LLM, the tool result, and the database. Toggle a cache at each layer and watch where the request gets served, the cost and latency collapse, and why caching earlier wins. Then change the upstream data and meet caching's one danger: an agent that confidently serves a stale answer, and the TTL that bounds it.